- 1Research Center for Agricultural Information Technology, National Agricultural and Food Research Organization (NARO), Tsukuba, Japan
- 2Institute of Crop Science, National Agricultural and Food Research Organization (NARO), Tsukuba, Japan
- 3Institute for Agro-Environmental Sciences, National Agricultural and Food Research Organization (NARO), Tsukuba, Japan
Unmanned aerial vehicles (UAVs) are powerful tools for monitoring crops for high-throughput phenotyping. Time-series aerial photography of fields can record the whole process of crop growth. Canopy height (CH), which is vertical plant growth, has been used as an indicator for the evaluation of lodging tolerance and the prediction of biomass and yield. However, there have been few attempts to use UAV-derived time-series CH data for field testing of crop lines. Here we provide a novel framework for trait prediction using CH data in rice. We generated UAV-based digital surface models of crops to extract CH data of 30 Japanese rice cultivars in 2019, 2020, and 2021. CH-related parameters were calculated in a non-linear time-series model as an S-shaped plant growth curve. The maximum saturation CH value was the most important predictor for culm length. The time point at the maximum CH contributed to the prediction of days to heading, and was able to predict stem and leaf weight and aboveground weight, possibly reflecting the association of biomass with duration of vegetative growth. These results indicate that the CH-related parameters acquired by UAV can be useful as predictors of traits typically measured by hand.
Introduction
Phenotyping is a fundamental procedure in field testing of crops and is typically done laboriously by hand. To make phenotyping more effective, various methods using unmanned aerial vehicles (UAVs) have been developed for measuring crop physical parameters, especially in the field (Furbank and Tester, 2011; Ninomiya, 2022). UAVs can carry several types of cameras, including RGB (red–green–blue), multispectral, and thermal infrared, to take images of crops (Yang et al., 2017; Sakamoto et al., 2022). From RGB images, the 2D vegetation fraction and vertical canopy height (CH) can be extracted (Ogawa et al., 2021a; Ogawa et al., 2021b). Vegetation indices, obtained by spectral analysis, such as the Normalized Difference Vegetation Index (NDVI), have been used for estimating nitrogen use efficiency (Liang et al., 2021), drought resistance (Jiang et al., 2021), and lodging (Yadav et al., 2017; Singh et al., 2019), and for predicting biomass and yield (Yue et al., 2017; Gong et al., 2018; Di Gennaro et al., 2019; Duan et al., 2019; Wang et al., 2019a). These attempts indicate the usefulness of UAVs for high-throughput phenotyping of crops in the field.
Rice is a staple food, especially in Asia (Muthayya et al., 2014). Crucial to increased and sustainable rice production, yield and biomass are complex traits affected by plant shape and size (Peng et al., 2008; Xing and Zhang, 2010; Ikeda et al., 2013). Culm length (CL), panicle length (PL), and panicle number (PN), values of which reflect the genetic architecture of rice, are roughly related to yield and biomass (Zhao et al., 2011). Breeding for longer culms led to the selection of a rice line with higher grain yield and plant weight (Nomura et al., 2019). A rice line carrying OsglHAT1, which encodes a new-type GNAT-like protein that harbors intrinsic histone acetyltransferase activity, had increased plant size and grain length and width, with increased yield and biomass (Song et al., 2015). Panicle length (PL) and panicle number (PN) are strongly related to rice yield (Agata et al., 2020; Liu et al., 2022). Growth period also influences rice yield and biomass (Endo-Higashi and Izawa, 2011; Gao et al., 2014), and days to heading (DTH) is generally used to evaluate the transition from vegetative to reproductive stage. Conventionally, CL, PL, PN, and DTH are measured by hand at high cost. For rice breeding and examining the cultivation competence of cultivars at lower cost, a practical high-throughput phenotyping system to estimate these traits in the field is required.
In our previous study, rice CH estimated from UAV images was highly correlated with CL (Ogawa et al., 2021b), making it a potential predictor of yield and biomass. A promising approach to make the most of UAV-based CH data for rice phenotyping is time-series monitoring, in which remote sensing has an advantage by being non-invasive and non-destructive. Estimation of plant height in maize inbred lines at four growth stages by UAV showed that temperate lines grew faster at early growth stages, but tropical lines grew faster at later growth stages (Wang et al., 2019b). Time-series observations of rice CH with UAV correlated highly with CH measured by hand, and revealed growth patterns and differences in functional stages of quantitative trait loci for CH (Ogawa et al., 2021b). Use of a cable-suspended phenotyping platform allowed the temperature response of CH in wheat lines to be clarified (Kronenberg et al., 2021). These studies revealed time-series CH dynamism as a new feature different from one-off CH measurement and led to the hypothesis that time-series CH analysis could reveal genetic and phenological characteristics of rice cultivars and predict yield-related traits usually measured by hand.
One of the important challenges in time-series data analysis is handling time-series changes to allow comparison (Giorgino, 2009; Sugihara et al., 2012; Maziarz, 2015). Many time-series models have been proposed for analyzing crop phenology. Such models include shape-model fitting (Sakamoto et al., 2013; Zhou et al., 2020), random regression with the Legendre polynomial (Campbell et al., 2018; Campbell et al., 2019), segmented linear regression (Toda et al., 2021), and non-linear growth curves (Chang et al., 2017; Grados et al., 2020; Poudel et al., 2022). Anderson et al. (2019) applied a three-parameter logistic model (S-shape non-linear curve) to maize CH time-series data measured by UAV over 1 year, applied a linear mixed effects (LME) model to the logistic parameters, decomposed the parameter variance into genetic and environmental effects: they showed that some of the parameters could be used as predictors of grain yield. Borra-Serrano et al. (2020) and Chang et al. (2017) applied similar S-shape non-linear curves to, respectively, soybean and sorghum CH time-series data measured by UAV. In contrast to these crops, in which CH increases with plant growth, rice CH decreases in the reproductive stage. Therefore, it is necessary to develop a new model to incorporate the effects of the CH decrease and its timing, and to apply it to CH time-series data covering various rice lines.
In this study, we aimed at revealing how UAV-derived time-series CH data are useful for predicting yield and biomass and related traits such as DTH, CL, PL and PN. We developed a novel time-series model incorporating both CH increase and decrease during the growth period, unlike previous models developed for maize, soybean, and sorghum. To develop our model, we used data covering 3 years and 30 cultivars, enabling us to evaluate its robustness and to analyze the cultivar effects by LME models. Through this analysis, we developed a practical and high-throughput method for the prediction of rice traits from CH-related parameters.
Materials and methods
Growing of rice cultivars
Seeds of 30 rice cultivars in Japan, including those developed for high grain yield, lodging resistance, disease resistance, and brown rice quality (Supplementary Table 1), were sown in seedling medium on 17 April 2019, 20 April 2020, and 20 April 2021. We transplanted 3 seedlings per hill at a density of 22.2 plants/m2 into a paddy field in Tsukubamirai city (36°00′33″N, 140°01′20″E), Japan, on 17 May 2019, 15 May 2020, and 13 May 2021. The paddy field was divided into 60 plots, two per cultivar (Figure 1). The size of each plot was 2.7 m2. The plants were grown in the field for about 5 months.
Figure 1 Framework of our analysis for the traits prediction from CH-related parameters. Thirty rice cultivars including AKT and TKA were grown with 2 replicates in 2019, 2020, and 2021. Aerial photogrammetry was conducted weekly. CH time-series data were obtained by generating a DSM and CH-related parameters were calculated. Several traits were measured by hand. Statistical analysis included linear regression for the traits prediction and linear mixed effects modeling for the variance decomposition of the traits and CH-related parameters.
UAV-based aerial photography
Aerial observations were made about once a week as in our previous studies (Ogawa et al., 2019; Ogawa et al., 2021a; Ogawa et al., 2021b). We used a Phantom 4 Pro UAV (P4P; DJI, Shenzhen, China) to capture RGB images with an onboard 20-megapixel camera that flew automatically at 1.0 m/s over the paddy field at an altitude of 10.3 m. DJI GS Pro software controlled the flight path and set the following photogrammetry conditions: capture mode, time interval; front overlap ratio, 80%; side overlap ratio, 80%; gimbal pitch angle, −90°, white balance, cloudy; aperture, auto; shutter, auto. Each flight took 150–200 images covering the field, each measuring 5472 × 3648 pixels. To set the focus, the P4P was manually raised to 10.3 m, the camera was focused automatically on a region of the canopy, and then the focus mode was changed to manual. We placed seven ground control point (GCP) markers on the ground around the test field. We obtained the altitude, longitude, and latitude of each GCP by real-time kinematic positioning using a DG-PRO1RWS receiver (BizStation Corp., Matsumoto, Japan).
Generation of digital surface model and quantification of CH
As previously, Agisoft MetaShape Professional v. 1.6.5 software (Agisoft, St. Petersburg, Russia) generated a digital surface model (DSM) from each image set (150–200 images per set) by the date of photogrammetry in the following procedure: (1) align photos (high accuracy), (2) import GCPs, (3) optimize camera, (4) build dense cloud, (5) build digital elevation model (source data to be dense cloud), and (6) export the digital elevation model. The coordinate system was set to UTM zone 54N (WGS-84) and the resolution to 2 mm/pixel. Next, QGIS (3.20.0) software (QGIS Development Team) cut out the area of the paddy field from each DSM image and identified the position of each plot to create shape files. Finally, a script written in Python (3.9.7: Python Software Foundation) cut out the portion in the DSM images corresponding to each plot in reference to the shape files. The computer was an AMD Ryzen Threadripper 2990WX (32-Core Processor, 3.00 GHz, 128 GB RAM, GeForce RTX 2080 Ti GPU) running the 64-bit Windows 10 Pro operating system.
We defined the canopy position as the 95th percentile in the cut-out DSMs corresponding to each plot. CH was defined as the difference between canopy position and ground level. We defined ground level as the 2nd percentile just after transplanting in the cut-out DSMs corresponding to each plot.
Fitting time-series model to the CH data
For the statistical modeling of the CH time-series data, we adopted a three-parameter logistic as the typical model for the S-shape plant growth (Paine et al., 2012). Since the logistic asymptotically approaches the maximum saturation value K, we modified it to incorporate the CH decrease in the late growth phase to develop the following time-series model:
where x is days after sowing, d0 is the time point at the highest growth rate, d1 is the time point at the maximum CH, r is the growth rate, and a is the CH decrease rate from the logistic S-shape curve in the late growth phase (Figure 2).
Figure 2 Fitting time-series model to CH time-series data. Red points are CH data obtained by UAV. The time-series model applied to the data is the red S-shaped curve. The model was prescribed by 5 parameters: K, the maximum saturation value; r, growth rate; a, CH decrease in the late growth phase; d0, time point at the highest growth rate; d1, time point at maximum CH. This curve was described in equation (1).
For parameter estimation, we used a two-step procedure to prevent false convergence in the estimation algorithm. First, we calculated K and d1, taking the maximum value in each CH time-series data set as K. We fitted a cubic polynomial to each data set and obtained d1 as the time point when the cubic polynomial was at its maximum. We applied the following cubic polynomial:
Second, we fitted equation 1 given the values of K and d1. Except for K, all parameters were obtained by means of the nonlinear least squares method implemented in R (R Core Team, 2021). For parameter estimation for equation 1, we used the R function nls, adopting the nl2sol algorithm and setting the initial values to d0=50 , r=0.05 , and a=1.0 × 10−4 .
Manual measurement of traits related to yield
Heading date was defined as the date when panicles emerged from about half of the stems in each plot. DTH (days) was the period from the sowing date to the heading date. CL (m) and PL (cm) of the longest culm of each plant were measured and PN was counted once from 2 to 4 weeks after heading. CL was defined as the length from the ground to the panicle base, and PL was defined as the length from there to the tip of the spikelet. Mean values from 10 plants per cultivar were used for CL, PL, and PN. For aboveground dry weight (ADW; g), 50 plants per plot at maturity were harvested from the ground and air-dried for more than 2 weeks before the measurement. Stem and leaf weight (SLW; g) was obtained as ADW − grain weight (GW; g). These traits were measured in each plot. All trait names are listed in Table 1.
Statistical analysis of traits and parameters
To decompose the traits and CH-related parameters into cultivar, year, and cultivar × year interaction effects, we applied the linear mixed effects (LME) model:
where Xlyb is a parameter or trait of cultivar l in year y and plot b; µX is the fixed effect for the average value; , , and are random effects of cultivar, year, and cultivar × year interaction, respectively; and is the residual. We defined heritability as the ratio of cultivar variance to the total variance:
The R package lme4 (Bates et al., 2015) estimated the parameters and the best linear unbiased predictors (BLUPs) of the LME model by the REML method. Total variance was calculated as follows:
We used the linear regression model to predict the yields from the CH-related parameters and evaluated whether the phenology data contained enough information about yield. To reveal what CH-related parameters are useful for the prediction of traits, we examined Pearson’s correlations (cor). Since multicollinearity impairs the accuracy of regression coefficients, we used backward variable selection to prevent it. We calculated the variance inflation factor in the R package car (Fox and Weisberg, 2019) for variable selection and adopted four parameters as predictors without multicollinearity (variance inflation factor< 5): K, d0, d1, and r (Supplementary Table 2). All four predictors were standardized to have a mean of 0 and standard deviation of 1. Next, we constructed linear regression models by the ordinary least squares method to predict CL, DTH, ADW, GW, and SLW. The prediction accuracies were evaluated by cross-validation (CV), splitting data by year and by cultivar (Figure 3). Finally, the regression coefficients were estimated from all data (n = 180). As measures of accuracy, we used cor and root-mean-square error (RMSE) between observed and predicted values of test data. RMSE evaluates the accuracy of predicting the exact values, and cor evaluates the accuracy of predicting the magnitude of correlation.
Figure 3 Schemes of CV to predict manually measured traits. (A) Threefold CV, where data were split by year: two years were used for training data and the other as test data. (B) Tenfold CV, where data were split by cultivar: 27 cultivars as training data and the other 3 as test data.
Influence of accumulated daily mean temperature on CH-related parameters d0 and d1
We transformed d0 and d1 values to accumulated temperature d0temp and d1temp, starting from the planting date to the date of d0 or d1:
where Td is the mean temperature of day d and dp is the date of planting. The base temperature was set to 0°C. We used the LME model in equation 2. Daily mean temperature (°C) in the paddy field is shown in Supplementary Data 1-3.
Results
Relationships between manually measured traits and CH-related parameters in 30 rice cultivars
We characterized phenotype data of 30 Japanese rice cultivars (Supplementary Table 1) in 2019, 2020, and 2021 from the aspect of genetics and examined how to use the CH data for the prediction of traits usually measured by hand (Figure 1).
The sizes of interannual differences in phenotypic distribution depended on trait (Figure 4; Table 2; Supplementary Table 3). Distributions of DTH, CL, PL, and PN were highly overlapped among years, and heritability was high: that of DTH was 0.80, CL 0.81, PL 0.90, and PN 0.63 (Table 2). On the other hand, the phenotypic distribution of GW was wider than that of SLW, especially between 2021 and the other 2 years (Figure 4). Consistent with this, heritability of GW was 0.29, much lower than that of SLW at 0.77. Therefore, GW was more susceptible to year effect than SLW (Table 2). ADW is SLW + GW, and its heritability (0.70) was positioned between theirs.
Figure 4 Frequency distributions of manually measured traits and CH-related parameters, shown as histograms.
Table 2 Summary statistics of CH-related parameters and traits obtained by the linear mixed effects model.
We collected 2834 CH data of the 30 Japanese cultivars over the 3 years (Supplementary Table 4). Given the huge size of the dataset, we obtained CH-related parameters by applying time-series curves to the CH data by plot (Supplementary Figure 1). The range of phenotypic distributions tended to differ by year (Figure 4; Supplementary Table 3). Heritabilities of K (0.54) and d1 (0.63) were higher than those of r (0.21) and d0 (0.29), suggesting that r and d0, parameters of vegetative growth, might also be more susceptible to year effects than K and d1, parameters of the reproductive stage (Table 2).
We obtained correlation plots between the parameters and traits (Figure 5; Supplementary Figure 2). K was positively correlated with CL. This result is consistent with our previous data showing high correlation between CH and CL in several rice lines (Ogawa et al., 2021b). d1 was positively correlated with DTH, ADW, and SLW. These results motivated us to use K and d1 to predict traits.
Figure 5 Correlation plot of parameters and traits in 2019. Values are correlation coefficients (cor); circles present them by color and size.
Prediction of CL and DTH from CH-related parameters
CV indicated the accuracy of predicting CL and DTH from CH-related parameters (Table 3). In predicting the magnitudes of CL, coryear = 0.82 and corcultivar = 0.68; and of DTH, coryear = 0.89 and corcultivar = 0.85. The scatter plots between observed and predicted CL and DTH were highly correlated (Figure 6). In predicting the exact values of CL, RMSEyear = 0.05 m and RMSEcultivar = 0.04 m; and of DTH, RMSEyear = 5.2 days and RMSEcultivar = 4.2 days (Table 3). These RMSE values were smaller than the total standard deviations, the square root of the total variances (Table 2). These results indicate that the CH-related parameters had information that could be used to predict CL and DTH.
Figure 6 Observed and predicted values of CL, DTH, ADW, SLW, and GW. Plots show the results of CV by (left) year and (right) cultivar. The points plot datasets of predicted and observed values, colored by year; the lines are linear regressions applied to them.
We calculated the regression coefficients in the regression models. Since all predictors were standardized, the importance of each parameter in the model was quantified as the absolute value of each coefficient. The regression model to predict CL was
and the coefficient of determination was R2=0.684. The model to predict DTH was
and
In predicting CL, the coefficient of K (0.06, significant by t-test at 0.1%; Supplementary Table 5) had the largest absolute value, more than 4× the second largest one, that of d0 (0.01). In predicting DTH, the coefficient of d1 (6.82, significant by t-test at 0.1%; Supplementary Table 5) had the largest absolute value, more than 5× the second largest one, that of d0 (1.34). Therefore, in the linear regression model, K was the most important predictor of CL, and d1 was the most important predictor of DTH.
The linear regression models based on CH-related parameters explained the total variances of the manually measured traits, but it was still uncertain whether the relations between the two were derived from the characteristics of each cultivar. The LME model, which decomposed the total variance into cultivar, year, cultivar × year interactions, and residual, extracted the cultivar effects as the BLUPs from the whole data. First, the high heritabilities of CL (0.81) and DTH (0.80) imply that a large proportion of the total variance derived from cultivar effects. Note that heritability is the ratio of cultivar effect to the total variance. Second, the cultivar effects of CL, DTH, K, and d1, quantified as the BLUPs of each cultivar, showed a clear tendency that the cultivars with smaller K had a smaller CL, and those with smaller d1 had a smaller DTH (Figure 7). The correlation of cultivar BLUPs between K and CL was cor = 0.89, and that of d1 and DTH was cor = 0.94. These results indicate that the total variances of CL and DTH were largely prescribed by the cultivar effects of K and d1, respectively.
Figure 7 Scatter plots of cultivar effects between CH-related parameters and manually measured traits. (A) Flow chart of the process to generate scatter plots. The cultivar effect (Xl) on traits and parameters was extracted from the linear mixed effects model as BLUPs from the 3-year experiment. (B, C) Plots of cultivar effects showing correlations between (B) K and CL and between (C) d1 and DTH, ADW, SLW, or GW. Correlation coefficients (cor) are shown in plots. Cultivar name codes are shown in red.
Prediction of ADW, GW, and SLW from CH-related parameters
As it did for DTH and CL, our CV method gave the accuracy of prediction of ADW, GW, and SLW (Table 3). In predicting the magnitudes of ADW, coryear = 0.72 and corcultivar = 0.62 and of SLW, coryear = 0.81 and corcultivar = 0.74 (Figure 6). These values were better than predicting the magnitude of GW: coryear = 0.00 and corcultivar = 0.31. The scatter plots between observed and predicted SLW and ADW were highly correlated. In predicting the exact values of ADW, cross-validation by year (RMSEyear = 406.7 g) had better accuracy than that by cultivar (RMSEcultivar = 638.6 g), as had that of SLW (Table 3). The CH-related parameters contained information with which to predict ADW and SLW, but yearly fluctuations could increase RMSE. By contrast, as shown in the scatter plot between observed and predicted GW (Figure 6), the slopes were almost flat and the model explained little of the GW variance. Therefore, CH-related parameters held little information with which to predict GW, at least under our linear regression model.
The regression coefficients indicated the importance of each parameter in our regression models. (All predictors were standardized.) The regression model to predict ADW was
and R2=0.475 . The model to predict SLW was
and R2=0.684 . In predicting ADW and SLW, all four parameters were significant by t-test at 0.1%, and d1 had the largest absolute values (Supplementary Table 5). For ADW, the coefficient of d1 was 518, more than 2× the absolute value of d0 (−242), the second largest. Similarly, for SLW, the coefficient of d1 was 540, more than 2× the absolute value of d0 (−198). Therefore, d1 was the most important predictor of ADW and SLW.
The results of the LME model uncovered the effect of each cultivar on the total variance of ADW and SLW. The high heritabilities of ADW (0.70) and SLW (0.77) imply that a large proportion of total variance derived from cultivar effects. The cultivar effects of ADW, SLW, and d1, quantified as the BLUPs of each cultivar, showed a clear tendency in which cultivars with smaller d1 had smaller ADW and SLW (Figure 7C). On the other hand, the cultivar BLUPs of GW had little relation with those of d1. The correlations of cultivar BLUPs of d1 with SLW (cor = 0.93) and ADW (cor = 0.87) were higher than that with GW (cor = −0.05). The cultivar effects of d1 clearly reflected those of SLW. As ADW = SLW + GW, since the low correlation indicates that the cultivar effects of d1 and GW were almost independent, the result that the cultivar effects of d1 corresponded to those of ADW derives from the relation of the cultivar effects of d1 and SLW (Figure 7C; Supplementary Figures 3-5).
Sensitivity of accumulated daily mean temperature to CH data
The frequency distributions of the CH-related parameters differed among years (Figure 4), indicating that those may be influenced by environmental factors. In the developmental rate model, which is well known for the prediction of DTH in rice, daylength and daily mean temperature are explanatory variables (Horie et al., 1995). We asked whether the change of CH is affected by accumulated daily mean temperature instead of daylength, because the former varied among years (Supplementary Data 1-3), whereas daylength was almost constant owing to the similar planting dates. CH-related parameters d0 and d1 are based on time-series data, but d0temp and d1temp are based on accumulated daily mean temperature. Transforming d0 into d0temp increased heritability from 0.29 to 0.58 (Supplementary Table 6) and decreased the year effect from 61% to 21% (Figure 8), meaning that the year effect on d0 was explained mostly by the accumulated temperature. On the other hand, the heritability of d1temp (0.62) was almost the same as that of d1 (Supplementary Table 6; Supplementary Figure 6). These results indicate that the time point at the maximum CH (d1), which is linked to heading date, is insensitive to accumulated daily mean temperature, but that at the highest CH growth rate (d0) is sensitive to it.
Figure 8 Visualization of the linear mixed effects model for (A) d0 and (B) d0temp. Left, proportions of 4 variance components; right top, BLUPs of cultivar and cultivar × year interactions of 30 cultivars; right bottom, BLUPs of 3 years. Cultivars are sorted by d1. The numbers in cultivar components indicate percentage heritability.
Discussion
We constructed a time-series model and applied it to the data of 30 rice cultivars in 2019, 2020, and 2021, which were derived from UAV-based time-series aerial photography. In the case of maize (Anderson et al., 2019), soybean (Borra-Serrano et al., 2020) and sorghum (Chang et al., 2017), CH continues to increase, and there is little need to consider the difference of CH growth in between vegetative and reproductive stages. On the other hand, the CH decrease in the reproductive stage is distinct in rice. In our model, we introduced the parameters d1, the time point at the maximum CH; and a, the rate of CH decrease in the late growth phase, in addition to K, the maximum saturation value; d0, the time point at the highest growth rate; and r, the growth rate. Our model proved suitable for predicting CL, DTH, SLW, and ADW. The highly heritable CH-related parameters d1 and K contributed to the prediction of DTH and CL. Notably, d1 also contributed to the prediction of SLW and ADW, possibly reflecting the association of biomass with duration of vegetative growth. The cultivar effects of traits measured by hand (CL, DTH, SLW, and ADW) and their corresponding CH-related parameters were highly correlated. These results indicate that CH-related parameters are useful for the prediction of traits usually measured by hand, reinforcing the significance of time-series monitoring by UAV in high-throughput phenotyping.
Desai et al. (2019) proposed a method to precisely estimate heading date by detecting flowering panicles in RGB images taken with a fixed camera every 5 min. Their method has the advantage of directly detecting panicles but is unsuitable for use by UAV because it requires a higher shooting frequency and a lower shooting altitude. On the other hand, our UAV method enabled us to predict DTH by focusing on the features of time-series CH changes in images taken weekly. Similarly, Zhao et al. (2021) proposed a method to predict wheat heading date by applying a logistic curve to growth data obtained by UAV, extracting the date when the second derivative is minimum. Our and their studies indicate that time-series models derived from UAV data can reveal developmental changes in crops in the field.
Our approach relies on applying a time-series model to CH data spanning crop growth from the vegetative stage to the reproductive stage in the field, and uses CH-related parameters as summary statistics of each trajectory. Time-series or longitudinal trait data have been modeled in several ways, including random regression with the Legendre polynomial. Although this polynomial can be incorporated into the expectation-maximization algorithm (Yang et al., 2006) and kernel methods (Campbell et al., 2018; Campbell et al., 2019), it is difficult to interpret the coefficients in the models. The coefficients of our CH-related parameters, on the other hand, have explicit meaning in the context of phenology and allow better interpretability.
The LME models decomposed each CH-related parameter into cultivar, year, and cultivar × year interaction effects. We considered year effect as an environmental effect and examined the influence of accumulated daily mean temperature on CH data. Our results indicate that d0, a CH-related parameter in the vegetative stage, is sensitive to the accumulated daily mean temperature, but d1, in the reproductive stage, is not. It is possible that DTH, associated with d1, is regulated by daylength, but growth is affected by temperature. In terms of cultivar effect, we showed strong correlations between K and CL, and between d1 and DTH, ADW, or SLW, suggesting the high contribution of these CH-related parameters to the prediction of each trait. This analysis can be useful in cultivar characterization. For example, in the case of cultivars “HKR” and “TYM”, the BLUPs of ADW with d1 deviated from linear (Figure 7C), probably reflecting their high yield and biomass.
CV by using cor and RMSE evaluated the robustness of the regression models to predict CL, DTH, ADW, GW, and SLW in an untested year and in untested cultivars. CV using cor estimates the magnitude of the correlation. In predicting CL, DTH, ADW, and SLW, values of cor by both CV methods were high. CV using RMSEs, which estimates the accuracy at predicting exact values of test data, can evaluate model robustness from the viewpoint of model variance, the phenomenon by which the prediction fluctuates with the training data, which results in variance of the predicted values (Bishop, 2006; Hastie et al., 2009). In predicting CL and DTH, RMSEs were similar by both types of CV methods. However, in predicting ADW and SLW, RMSEs of CV by year were about 1.5 times higher than those of CV by cultivar. Therefore, the prediction of ADW and SLW had model variance derived from year.
This study provides a novel method to predict traits that would usually be measured by hand from CH-related parameters extracted from aerial time-series data. The parameters did not prove useful in the prediction of GW, which manually measured data showed was not heritable. This indicates that GW is more sensitive to environment, suggesting the necessity of environmental data for the prediction of GW. We will examine new models for GW prediction from environmental data and other UAV-derived time-series data.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
ST, J-iY, and DO conceptualized the research; ST, TS, RI, YN, HT, AG, KM, SO, HM, YT, and TI performed the investigations; ST and TS developed the methodology; YN, AG, KM, SO, HM, YT, and TI provided the resources; ST, TS, RI, and YN curated data; HM, YT, TI, J-iY, and DO helped with funding acquisition; ST and DO wrote the manuscript; TS and JY reviewed and edited the manuscript. All authors contributed to the article and approved the submitted version.
Acknowledgments
We thank Hitoshi Koyano, Takeshi Hayashi, and Hiromi Kajiya-Kanegae for scientific discussion. This study was supported by grants from the Ministry of Agriculture, Forestry and Fisheries of Japan (Smart-breeding System for Innovative Agriculture; grant numbers BAC1003 and BAC2001).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.998803/full#supplementary-material
Supplementary Figure 1 | Trajectory of each CH time-series curve. The LME model calculated the cultivar effects of each CH-related parameter, which generated the cultivar-specific time-series curve (“Cultivar E”).
Supplementary Figure 2 | Correlation plot of parameters and traits in (A) 2020 and (B) 2021. Values are correlation coefficients (cor); circles present them by color and size.
Supplementary Figure 3 | Visualization of the LME model for GW. Left, proportion of 4 variance components; right top, BLUPs of cultivar and cultivar × year interactions of 30 cultivars; right bottom, BLUPs of 3 years. Cultivars are sorted by d1.
Supplementary Figure 4 | Visualization of the LME model for ADW. Left, proportion of 4 variance components; right top, BLUPs of cultivar and cultivar × year interactions of 30 cultivars; right bottom, BLUPs of 3 years. Cultivars are sorted by d1.
Supplementary Figure 5 | Visualization of the LME model for SLW. Left, proportion of 4 variance components; right top, BLUPs of cultivar and cultivar × year interactions of 30 cultivars; right bottom, BLUPs of 3 years. Cultivars are sorted by d1.
Supplementary Figure 6 | Visualization of the linear mixed effects model for (A) d1 and (B) d1temp. Left, proportion of 4 variance components; right top, BLUPs of cultivar and cultivar × year interactions of 30 cultivars; right bottom, BLUPs of 3 years. Cultivars are sorted by d1.
References
Agata, A., Ando, K., Ota, S., Kojima, M., Takebayashi, Y., Takehara, S., et al. (2020). Diverse panicle architecture results from various combinations of Prl5/GA20ox4 and Pbl6/APO1 alleles. Commun. Biol. 3 (1), 1–17. doi: 10.1038/s42003-020-1036-8
Anderson, S. L., Murray, S. C., Malambo, L., Ratcliff, C., Popescu, S., Cope, D., et al. (2019). Prediction of maize grain yield before maturity using improved temporal height estimates of unmanned aerial systems. Plant Phenome J. 2 (1), 1–15. doi: 10.2135/tppj2019.02.0004
Bates, D., Machler, M., Bolker, B. M., Walker, S. C. (2015). Fitting linear mixed-effects models using lme4. J. Stat. Software 67 (1), 1–48. doi: 10.18637/jss.v067.i01
Borra-Serrano, I., De Swaef, T., Quataert, P., Aper, J., Saleem, A., Saeys, W., et al. (2020). Closing the phenotyping gap: High resolution UAV time series for soybean growth analysis provides objective data from field trials. Remote Sens. 12 (10), 1644. doi: 10.3390/rs12101644
Campbell, M., Momen, M., Walia, H., Morota, G. (2019). Leveraging breeding values obtained from random regression models for genetic inference of longitudinal traits. Plant Genome 12 (2), 180075. doi: 10.3835/plantgenome2018.10.0075
Campbell, M., Walia, H., Morota, G. (2018). Utilizing random regression models for genomic prediction of a longitudinal trait derived from high-throughput phenotyping. Plant Direct 2 (9), e00080. doi: 10.1002/pld3.80
Chang, A. J., Jung, J. H., Maeda, M. M., Landivar, J. (2017). Crop height monitoring with digital imagery from unmanned aerial system (UAS). Comput. Electron. Agric. 141, 232–237. doi: 10.1016/j.compag.2017.07.008
Desai, S. V., Balasubramanian, V. N., Fukatsu, T., Ninomiya, S., Guo, W. (2019). Automatic estimation of heading date of paddy rice using deep learning. Plant Methods 15, 1–11. doi: 10.1186/s13007-019-0457-1
Di Gennaro, S. F., Toscano, P., Cinat, P., Berton, A., Matese, A. (2019). A low-cost and unsupervised image recognition methodology for yield estimation in a vineyard. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00559
Duan, B., Fang, S. H., Zhu, R. S., Wu, X. T., Wang, S. Q., Gong, Y., et al. (2019). Remote estimation of rice yield with unmanned aerial vehicle (UAV) data and spectral mixture analysis. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00204
Endo-Higashi, N., Izawa, T. (2011). Flowering time genes Heading date 1 and Early heading date 1 together control panicle development in rice. Plant Cell Physiol. 52 (6), 1083–1094. doi: 10.1093/pcp/pcr059
Furbank, R. T., Tester, M. (2011). Phenomics - technologies to relieve the phenotyping bottleneck. Trends Plant Sci. 16 (12), 635–644. doi: 10.1016/j.tplants.2011.09.005
Gao, H., Jin, M. N., Zheng, X. M., Chen, J., Yuan, D. Y., Xin, Y. Y., et al. (2014). “Days to heading 7, a major quantitative locus determining photoperiod sensitivity and regional adaptation in rice,” in Proceedings of the National Academy of Sciences of the United States of America, Vol. 111 (46), 18399–18399. doi: 10.1073/pnas.1422341112
Giorgino, T. (2009). Computing and visualizing dynamic time warping alignments in r: the dtw package. J. Stat. Software 31 (7), 1–24. doi: 10.18637/jss.v031.i07
Gong, Y., Duan, B., Fang, S. H., Zhu, R. S., Wu, X. T., Ma, Y., et al. (2018). Remote estimation of rapeseed yield with unmanned aerial vehicle (UAV) imaging and spectral mixture analysis. Plant Methods 14 (70), 1–14. doi: 10.1186/s13007-018-0338-z
Grados, D., Reynarfaje, X., Schrevens, E. (2020). A methodological approach to assess canopy NDVI-based tomato dynamics under irrigation treatments. Agric. Water Manage. 240, 106208. doi: 10.1016/j.agwat.2020.106208
Hastie, T., Tibshirani, R., Friedman, J. (2009). The elements of statistical learning (New York: Springer).
Horie, T., Nakagawa, H., Centeno, H., Kropff, M. (1995). “The rice crop simulation model SIMRIW and its testing,” in Modeling the impact of climate change on rice production in Asia. Eds. Matthews, R. B., Kropff, M. J., Bachelet, D., van Laar., H. H. (Los Banos, Philippines: International Rice Research Institute), 51–66.
Ikeda, M., Miura, K., Aya, K., Kitano, H., Matsuoka, M. (2013). Genes offering the potential for designing yield-related traits in rice. Curr. Opin. Plant Biol. 16 (2), 213–220. doi: 10.1016/j.pbi.2013.02.002
Jiang, Z., Tu, H. F., Bai, B. W., Yang, C. H., Zhao, B. Q., Guo, Z. Y., et al. (2021). Combining UAV-RGB high-throughput field phenotyping and genome-wide association study to reveal genetic variation of rice germplasms in dynamic response to drought stress. New Phytol. 232 (1), 440–455. doi: 10.1111/nph.17580
Kronenberg, L., Yates, S., Boer, M. P., Kirchgessner, N., Walter, A., Hund, A. (2021). Temperature response of wheat affects final height and the timing of stem elongation under field conditions. J. Exp. Bot. 72 (2), 700–717. doi: 10.1093/jxb/eraa471
Liang, T., Duan, B., Luo, X. Y., Ma, Y., Yuan, Z. Q., Zhu, R. S., et al. (2021). Identification of high nitrogen use efficiency phenotype in rice (Oryza sativa l.) through entire growth duration by unmanned aerial vehicle multispectral imagery. Front. Plant Sci. 12. doi: 10.3389/fpls.2021.740414
Liu, Y. Q., Wang, H. R., Jiang, Z. M., Wang, W., Xu, R. N., Wang, Q. H., et al. (2022). Genomic basis of geographical adaptation to soil nitrogen in rice. Nature 590 (7847). doi: 10.1038/s41586-022-05362-0
Muthayya, S., Sugimoto, J. D., Montgomery, S., Maberly, G. F. (2014). An overview of global rice production, supply, trade, and consumption. Tech. Considerations Rice Fortification Public Health 1324, 7–14. doi: 10.1111/nyas.12540
Ninomiya, S. (2022). High-throughput field crop phenotyping: current status and challenges. Breed. Sci. 72 (1), 3–18. doi: 10.1270/jsbbs.21069
Nomura, T., Arakawa, N., Yamamoto, T., Ueda, T., Adachi, S., Yonemaru, J., et al. (2019). Next generation long-culm rice with superior lodging resistance and high grain yield, monster rice 1. PLoS One 14 (8), e0221424. doi: 10.1371/journal.pone.0221424
Ogawa, D., Sakamoto, T., Tsunematsu, H., Kanno, N., Nonoue, Y., Yonemaru, J. I. (2021a). Haplotype analysis from unmanned aerial vehicle imagery of rice MAGIC population for the trait dissection of biomass and plant architecture. J. Exp. Bot. 72 (7), 2371–2382. doi: 10.1093/jxb/eraa605
Ogawa, D., Sakamoto, T., Tsunematsu, H., Kanno, N., Nonoue, Y., Yonemaru, J. I. (2021b). Remote-sensing-combined haplotype analysis using multi-parental advanced generation inter-cross lines reveals phenology QTLs for canopy height in rice. Front. Plant Sci. 12. doi: 10.3389/fpls.2021.715184
Ogawa, D., Sakamoto, T., Tsunematsu, H., Yamamoto, T., Kanno, N., Nonoue, Y., et al. (2019). Surveillance of panicle positions by unmanned aerial vehicle to reveal morphological features of rice. PLoS One 14 (10), e0224386. doi: 10.1371/journal.pone.0224386
Paine, C. E. T., Marthews, T. R., Vogt, D. R., Purves, D., Rees, M., Hector, A., et al. (2012). How to fit nonlinear plant growth models and calculate growth rates: an update for ecologists. Methods Ecol. Evol. 3 (2), 245–256. doi: 10.1111/j.2041-210x.2011.00155.x
Peng, S. B., Khush, G. S., Virk, P., Tang, Q. Y., Zou, Y. B. (2008). Progress in ideotype breeding to increase rice yield potential. Field Crops Res. 108 (1), 32–38. doi: 10.1016/j.fcr.2008.04.001
Poudel, P., Bello, N. M., Lollato, R. P., Alderman, P. D. (2022). A hierarchical Bayesian approach to dynamic ordinary differential equations modeling for repeated measures data on wheat growth. Field Crops Res. 283, 108549. doi: 10.1016/j.fcr.2022.108549
R Core Team (2021). R: A language and environment for statistical computing (Vienna, Austria: R Foundation for Statistical Computing).
Sakamoto, T., Gitelson, A. A., Arkebauer, T. J. (2013). MODIS-based corn grain yield estimation model incorporating crop phenology information. Remote Sens. Environ. 131, 215–231. doi: 10.1016/j.rse.2012.12.017
Sakamoto, T., Ogawa, D., Hiura, S., Iwasaki, N. (2022). Alternative procedure to improve the positioning accuracy of orthomosaic images acquired with agisoft metashape and DJI P4 multispectral for crop growth observation. Photogrammetric Eng. Remote Sens. 88 (5), 323–332. doi: 10.14358/Pers.21-00064r2
Singh, D., Wang, X., Kumar, U., Gao, L. L., Noor, M., Imtiaz, M., et al. (2019). High-throughput phenotyping enabled genetic dissection of crop lodging in wheat. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00394
Song, X. J., Kuroha, T., Ayano, M., Furuta, T., Nagai, K., Komeda, N., et al. (2015). Rare allele of a previously unidentified histone H4 acetyltransferase enhances grain weight, yield, and plant biomass in rice. Proc. Natl. Acad. Sci. U.S.A., 112, 76–81. doi: 10.1073/pnas.1421127112
Sugihara, G., May, R., Ye, H., Hsieh, C. H., Deyle, E., Fogarty, M., et al. (2012). Detecting causality in complex ecosystems. Science 338 (6106), 496–500. doi: 10.1126/science.1227079
Toda, Y., Kaga, A., Kajiya-Kanegae, H., Hattori, T., Yamaoka, S., Okamoto, M., et al. (2021). Genomic prediction modeling of soybean biomass using UAV-based remote sensing and longitudinal model parameters. Plant Genome 14 (3), e20157. doi: 10.1002/tpg2.20157
Wang, F. L., Wang, F. M., Zhang, Y., Hu, J. H., Huang, J. F., Xie, J. K. (2019a). Rice yield estimation using parcel-level relative spectra variables from UAV-based hyperspectral imagery. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00453
Wang, X. Q., Zhang, R. Y., Song, W., Han, L., Liu, X. L., Sun, X., et al. (2019b). Dynamic plant height QTL revealed in maize through remote sensing phenotyping using a high-throughput unmanned aerial vehicle (UAV). Sci. Rep. 9, 1–10. doi: 10.1038/s41598-019-39448-z
Xing, Y., Zhang, Q. (2010). Genetic and molecular bases of rice yield. Annu. Rev. Plant Biol. 61 (1), 421–442. doi: 10.1146/annurev-arplant-042809-112209
Yadav, S., Singh, U. M., Naik, S. M., Venkateshwarlu, C., Ramayya, P. J., Raman, K. A., et al. (2017). Molecular mapping of QTLs associated with lodging resistance in dry direct-seeded rice (Oryza sativa l.). Front. Plant Sci. 8. doi: 10.3389/fpls.2017.01431
Yang, G. J., Liu, J. G., Zhao, C. J., Li, Z. H., Huang, Y. B., Yu, H. Y., et al. (2017). Unmanned aerial vehicle remote sensing for field-based crop phenotyping: current status and perspectives. Front. Plant Sci. 8. doi: 10.3389/fpls.2017.01111
Yang, R. Q., Tian, Q., Xu, S. Z. (2006). Mapping quantitative trait loci for longitudinal traits in line crosses. Genetics 173 (4), 2339–2356. doi: 10.1534/genetics.105.054775
Yue, J. B., Yang, G. J., Li, C. C., Li, Z. H., Wang, Y. J., Feng, H. K., et al. (2017). Estimation of winter wheat above-ground biomass using unmanned aerial vehicle-based snapshot hyperspectral sensor and crop height improved models. Remote Sens. 9 (7), 708. doi: 10.3390/rs9070708
Zhao, L. C., Guo, W., Wang, J., Wang, H. Z., Duan, Y. L., Wang, C., et al. (2021). An efficient method for estimating wheat heading dates using UAV images. Remote Sens. 13 (16), 3067. doi: 10.3390/rs13163067
Zhao, K., Tung, C.-W., Eizenga, G. C., Wright, M. H., Ali, M. L., Price, A. H., et al. (2011). Genome-wide association mapping reveals a rich genetic architecture of complex traits in oryza sativa. Nat. Commun. 2 (1), 467. doi: 10.1038/ncomms1467
Keywords: UAV, canopy height, rice, heading date, biomass, time-series, BLUP
Citation: Taniguchi S, Sakamoto T, Imase R, Nonoue Y, Tsunematsu H, Goto A, Matsushita K, Ohmori S, Maeda H, Takeuchi Y, Ishii T, Yonemaru J-i and Ogawa D (2022) Prediction of heading date, culm length, and biomass from canopy-height-related parameters derived from time-series UAV observations of rice. Front. Plant Sci. 13:998803. doi: 10.3389/fpls.2022.998803
Received: 20 July 2022; Accepted: 28 November 2022;
Published: 13 December 2022.
Edited by:
Sindhuja Sankaran, Washington State University, United StatesReviewed by:
René Gislum, Aarhus University, DenmarkKang Yu, Technical University of Munich, Germany
Chongyuan Zhang, Purdue University, United States
Copyright © 2022 Taniguchi, Sakamoto, Imase, Nonoue, Tsunematsu, Goto, Matsushita, Ohmori, Maeda, Takeuchi, Ishii, Yonemaru and Ogawa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Daisuke Ogawa, Daisuke.Ogawa@affrc.go.jp