- 1The National Fund for Scientific and Technological Development, Talca, Chile
- 2Plant Breeding and Phenomic Center, Faculty of Agricultural Sciences, Universidad de Talca, Talca, Chile
- 3Department of Food Biofunctionality, Institute of Nutritional Sciences, University of Hohenheim, Stuttgart, Germany
- 4Institute of Biological Sciences, University of Talca, Talca, Chile
Plants produce a wide diversity of specialized metabolites, which fulfill a wide range of biological functions, helping plants to interact with biotic and abiotic factors. In this study, an integrated approach based on high-throughput plant phenotyping, genome-wide haplotypes, and pedigree information was performed to examine the extent of heritable variation of foliar spectral reflectance and to predict the leaf hydrogen cyanide content in a genetically structured population of a cyanogenic eucalyptus (Eucalyptus cladocalyx F. Muell). In addition, the heritable variation (based on pedigree and genomic data) of more of 100 common spectral reflectance indices was examined. The first profile of heritable variation along the spectral reflectance curve indicated the highest estimate of genomic heritability (=0.41) within the visible region of the spectrum, suggesting that several physiological and biological responses of trees to environmental stimuli (ex., light) are under moderate genetic control. The spectral reflectance index with the highest genomic-based heritability was leaf rust disease severity index 1 (=0.58), followed by the anthocyanin reflectance index and the Browning reflectance index (=0.54). Among the Bayesian prediction models based on spectral reflectance data, Bayes B had a better goodness of fit than the Bayes-C and Bayesian ridge regression models (in terms of the deviance information criterion). All models that included spectral reflectance data outperformed conventional genomic prediction models in their predictive ability and goodness-of-fit measures. Finally, we confirmed the proposed hypothesis that high-throughput phenotyping indirectly capture endophenotypic variants related to specialized metabolites (defense chemistry), and therefore, generally more accurate predictions can be made integrating phenomics and genomics.
Introduction
Plants produce an outstandingly wide diversity of specialized metabolites (or secondary metabolites), fulfilling a wide range of biological functions, and helping plants to better cope with abiotic and biotic factors. These molecules are usually divided into three groups, including phenolic compounds, terpenes, and nitrogen-containing compounds (Zeng et al., 2020; Balestrini et al., 2021; Mieres-Castro et al., 2021). The major classes of nitrogen-containing specialized metabolites in the plant kingdom are cyanogenic glucosides, alkaloids, and non-protein amino acids. In particular, cyanogenic glycosides are present in various trees and plants, many of which are utilized as a source of food for humans and animals (Gleadow and Møller, 2014). These specialized metabolites are part of a plant’s strategy against herbivores and therefore play a major role in the ecosystem as defense-related compounds (Zenk and Juenger, 2007). The endogenous plant enzymes can react with cyanogenic glycosides and release hydrogen cyanide (hereafter referred to as HCN), a process known as cyanogenesis, which may be toxic to generalist herbivores and pathogens (Appenteng et al., 2021). Therefore, the effectiveness of cyanogenesis is a phytochemical defense strategy, dependent on cyanogenic plants’ capability to release HCN in sufficient quantities to be considered toxic.
There are over 3,000 cyanogenic plant species, representing more than 130 families, including Fabaceae, Leguminosae, Myrtaceae, Rosaceae, and many others (Gleadow and Møller, 2014; Appenteng et al., 2021; Mora-Poblete et al., 2021). In addition to serving as important compounds, cyanogenic glycosides play multiple roles in physiological functions involved in phenotypic plasticity during specific developmental stages and particularly under challenging environmental conditions, such as drought (Gleadow and Møller, 2014; Pičmanová et al., 2015; Rosati et al., 2019). In fact, cyanogenic glycosides can play an important role in primary metabolism processes as nitrogen and glucose transporters (Møller, 2010). The metabolic profile in cyanogenic species is modulated according to the interaction between genes and environmental conditions, resulting in a physiological trade-off between the production of defense metabolites and growth-related tasks (Gleadow and Woodrow, 2000; Gleadow and Møller, 2014).
The biosynthesis, degradation, biological functions, polymorphism, and regulation of cyanogenic glycosides have been widely studied in plant species (Sun et al., 2018), such as sorghum (Darbani et al., 2016), cassava (Zidenga et al., 2017), Prunus spp. (Thodberg et al., 2018), and Eucalyptus spp. (Goodger et al., 2004; Hansen et al., 2018). More than 20 species of the Eucalyptus genus have been identified to be cyanogenic (Gleadow et al., 2008), which serves as a powerful experimental system to study cyanogenesis as a chemical defense response against herbivores and pathogens. Among the Eucalyptus species studied to examine their cyanogenic content are E. nobilis, E. polyanthemos, E. yarraensis, E. dalrympleana, E. camphora, E. viminalis, and E. cladocalyx (Gleadow et al., 2003, 2008; Goodger et al., 2004; Neilson et al., 2006; Hansen et al., 2018). Gleadow et al. (2003) and Neilson et al. (2006) reported concentrations of HCN ranging from 0 to 0.153 mg HCN g−1 dw and from 0.01 to 0.5 mg HCN g−1 dw in leaves of E. nobilis and E. camphora, respectively. In E. polyanthemus, Goodger et al. (2004) found that these compounds can vary from 0.002 to 0.2 mg HCN g−1 dw in leaves by supplementing the plants with different concentrations of nitrogen. Moreover, prunasin concentration is highly variable depending on tissue type and leaf age in E. cladocalyx (Gleadow and Møller, 2014). In fact, prunasin concentration of less than 20 ug HCN g−1 dw can be found in adult leaves, while a concentration of 60 ug HCN g−1 dw can be found in immature flower buds (Hansen et al., 2018). Furthermore, Mora-Poblete et al. (2021) reported that the concentration in adult leaves can reach up to 1.5 mg HCN g−1 dw under dry land conditions. Different types of cyanogenic glycosides have been identified in Eucalyptus trees, such as prunasin (the predominant type); amygdalin; sambunigrin (the epimer of prunasin); neoamygdalin (the epimer of amygdalin); and eucalyptosin A, B, and C (Gleadow et al., 2008; Neilson et al., 2011; Hansen et al., 2018). These specialized metabolites are synthesized from phenylalanine, a process in which several cytochrome P450s (CYP) proteins are involved, to produce monoglycosides (Yamaguchi et al., 2014; Hansen et al., 2018). Particularly, defense mechanisms against pathogens and pests have rarely been reported in cyanogenic Eucalyptus spp., unlike other crops (Yactayo-Chang et al., 2020). On the other hand, it has been shown that the biosynthesis of these metabolites is induced by different abiotic factors (i.e., soil nutrients and water level, light, and temperature; Gleadow and Woodrow, 2002; Goodger et al., 2002, 2004; Simon et al., 2010).
In E. cladocalyx, prunasins were mainly found in floral and vegetative tissues, along with amygdalin in a minor concentration (Hansen et al., 2018). In this cyanogenic species, three CYP proteins and one UDP-glucuronosyltransferase protein (UGT) are involved in prunasin biosynthesis. First, L-phenylalanine is converted into phenyl acetaldoxime by CYP79A125 and is then dehydrated into phenyl acetonitrile by CYP706C55. Phenyl acetonitrile is converted into mandelonitrile by CYP71B103. Subsequently, UGT85A59 catalyzes the conversion of mandelonitrile into prunasin. In other cyanogenic plants, it has also been reported that another UGT protein catalyzes the conversion of cyanogenic monoglycosides (such as prunasin) into diglycosides (amygdalin; Yamaguchi et al., 2014; Del Cueto et al., 2017; Thodberg et al., 2018).
Despite the importance of cyanogenic glycosides as a source of defense-related compounds, and the advances made in the biosynthesis, catabolism, transport, and storage of these specialized metabolites, few published works have focused on the prediction of the amounts of cyanogenic glycosides in Eucalyptus trees. The complex nature of the genetic architecture of cyanogenic glycoside content may be a bottleneck for genomic prediction studies, especially when high-density marker panels are not available or when study species are poorly represented in commercial DNA arrays (Aguirre et al., 2019; Ballesta et al., 2020; Lebedev et al., 2020). According to Pryce et al. (2014), the use of low-density marker panels will inevitably affect the accuracy of the genomic prediction of target traits to some degree. In this sense, Mora-Poblete et al. (2021) found a moderate predictive ability for HCN content (of up to 0.47) in a genetically structured breeding population of cyanogenic E. cladocalyx, using a low-density marker panel of single nucleotide polymorphisms (SNPs) and haplotype blocks constructed from them, which had a better predictive ability (slightly higher) than the individual SNPs for predicting HCN content in leaves. Consistently, these findings are in line with other studies, which have concluded that the use of haplotypes, instead of individual SNPs, could have a higher predictive ability of certain quantitative traits (Contreras-Soto et al., 2017; Ballesta et al., 2019; Valenzuela et al., 2021). Consequently, in the present study, we examined a strategy of prediction that integrates conventional pedigree information, genomic data, and high-throughput phenotyping techniques, which is an effective method with which to break through the bottleneck of low-density marker panels.
Similarly, an integrated genomic and phenomic selection strategy that has been performed in breeding programs of plants and forest trees (Rincent et al., 2018; Krause et al., 2019), in which spectral reflectance data are used as regressors (or reflectance-derived relationship matrices) for accelerating the breeding progress of complex traits. Interestingly, Rincent et al. (2018) and Krause et al. (2019) observed that prediction models using spectral signatures performed similarly or superior to marker- and pedigree-based genomic selection models when predicting within and across environments. We hypothesized that high-throughput phenotyping platforms could indirectly capture endophenotypic variants, which could be related to specialized metabolites, and, therefore, we expected to be able to perform a more robust prediction, considering the spectral reflectance relationship among the trees. Consequently, indirect endophenotypic measures could enable these phenomic prediction methods to be incorporated into studies of specialized metabolites, such as cyanogenic glycosides. Additionally, we presented the first profile of heritable variation (genomic- and pedigree-based approaches) in the leaves of cyanogenic Eucalyptus (E. cladocalyx) along the spectral reflectance curve for each 1-nm wavelength interval (from 400 to 2,400 nm). The genomic- and pedigree-based heritability of the most widely used spectral reflectance indices (SRIs) was also estimated, such as the photochemical reflectance index (PRI), the green normalized difference vegetation index (GNDVI), water-related SRIs, the normalized difference vegetation index (NDVI), and the normalized pigment chlorophyll ratio index (NPCI), among others. This represents the first exploration of the genomic variation along the reflectance curve in a tree species and supports high-throughput phenotyping as a suitable approach for the prediction of specialized metabolites.
Materials and Methods
Plant Materials
The study was carried out in a provenance/progeny trial of Eucalyptus cladocalyx, established in 2001, and situated in the southern Atacama Desert, in Chile, Choapa Province (31° 55′ S, 71° 27′ W, 167 m.a.s.l.). The climate in this area is classified as predominantly arid, according to the De Martonne aridity index (Arriagada et al., 2018). The trial consisted of a genetically structured population with 49 half-sib families (details in Valenzuela et al., 2021) according to a randomized complete block design, with 30 blocks and single-tree plots.
The concentration of hydrogen cyanide (HCN) was determined in the leaves of 310 trees with three replications using the protocol developed by (Brinker and Seigler, 1989). Fully expanded and fresh mature leaves (~10 leaves per tree) were collected in the first third and from the northern side of the tree canopy (Woodrow et al., 2002). The methods for obtaining and quantifying HCN for each leaf sample can be reviewed with details in Mora-Poblete et al. (2021). Briefly, a hydrolysis of cyanogenic glucosides from plant tissue was performed, trapping the resulting HCN in a well containing 1 M NaOH. The cyanide captured was quantified using the König reaction. The final absorbance was measured at 595 nm using a Genesys 10UV spectrophotometer (ThermoSpectronic). The amount of cyanide was determined by interpolation into a calibration curve built with sodium cyanide (0.2–1.2 μg/ml, R2 = 0.9935). All samples were analyzed in triplicate, and results were presented as mean mg HCN equivalents g−1 dry weight (dw). The cyanide concentration was expressed as mg HCN equivalents g−1 dry weight (dw) primarily for the glucoside prunasin (Gleadow and Woodrow, 2000). The trees exhibited a HCN content from <0.0001 mg HCN g−1 dw up to 1.54 mg HCN g−1 dw in their leaves.
Spectral Reflectance Assessments
Absolute reflectance measurements of leaves (0.1 g lyophilized leaf powder per sample) were performed using a portable FieldSpec® 4 HiRes spectroradiometer (ASD Inc., Boulder, CO, United States), which covers the 350–2,500 nm range (the full range of the solar irradiance spectrum) with a 2.3-mm-diameter optical fiber. The spectral range between 350–399 and 2,400–2,500 nm was removed. RS3 software (ASD Inc., Boulder, CO, United States) was used to calibrate and control the spectrometer and acquire spectral signatures. The equipment was configured to integrate three samples per scan. The reflectance data were extracted using View Spec Pro 2008 software (ASD Inc., Boulder, CO, United States). The spectral data were pre-processed according to method of Rincent et al. (2018), in the R package Prospectr (Stevens and Ramirez-Lopez, 2014), in which the reflectance measures were normalized (centered and scaled), and their first derivative was computed using a Savitzky–Golay filter, with a window size of 37 data points.
The raw spectral reflectance data were also used to calculate more than 125 previously characterized spectral reflectance indices (SRIs), using either normalized or simple ratios of reflectance measures in the hsdar package (Lehnert et al., 2018). These SRIs have been shown to correlate with different physiological and biochemical components in plants and to provide information about several physiological (and agronomic) traits (Lobos and Poblete-Echeverría, 2017). A stepwise regression was conducted to evaluate and determine an appropriate number of predictor variables (SRIs) for HCN content. The optimal model was selected based on Akaike’s information criterion (AIC). Multicollinearity among SRIs was examined using the variance inflation factor (VIF). The VIF value for each SRI was interpreted as follows (Liu et al., 2019; Qian et al., 2019): 0 < VIF < 10, 10 ≤ VIF < 100 and VIF ≥ 100, indicating that there is no strong and severe multicollinearity, respectively.
DNA Isolation and Genotyping
Genomic DNA was extracted from the leaves of individual samples of E. cladocalyx, according to the method of Ballesta et al. (2019, 2020)>. All individuals were genotyped using an array (Illumina Infinium) of ~60,000 single nucleotide polymorphisms (SNPs). SNP markers with a call rate < 90% and a minor frequency allele <0.05 were removed from the SNP data matrix. A total of 3,897 SNPs were retained, which were distributed across the 11 chromosomes (~ 350 SNPs per chromosome) of Eucalyptus, with a density of one SNP for every 11,000 bp. After applying these filters, haplotype blocks were constructed according to the solid spine method in Haploview v. 4.2 (Barrett et al., 2005), which were later used in the haplotype-based genomic prediction of HCN. Only haplotype blocks with a D ‘value ≥ 0.9 and a LOD score ≥ 2 were considered for the further analyses. According to previous studies, genomic prediction models based on SNPs forming haplotypes have a slightly higher predictive ability of HCN content than individual markers (Mora-Poblete et al., 2021). Therefore, this antecedent was considered in the present study, in such a way that all genomic prediction models were implemented using haplotypes as predictor variables, instead of SNP markers.
Heritability in Single Wavelengths and Spectral Reflectance Indices of Leaf
The following prediction models were used to estimate the heritable variation along the reflectance curve and for all SRIs evaluated in Eucalyptus leaves:
where corresponds to phenotype records, i.e., SRIs or spectral reflectance measurements for each wavelength. , , , , and are the incidence matrices of the associated vectors. corresponds to the vector of the block effect (experimental design). and are the vectors of genetic population structure (Pritchard et al., 2000) and provenance (seed source) effects, respectively. is the vector of genomic values with ∼ N(0, G1 ), where G1 is the genomic relationship matrix (based on filtered SNP data), and corresponds to the genomic variance component. is the vector of the polygenic background effects (based on pedigree information), in which ∼ N(0, A), where A corresponds to the pedigree-based numerator relationship matrix and is the additive genetic variance, and is the vector of residual effects, distributed as ∼ N(0, I), where I is the identity matrix and is the residual variance. Heritability estimates (narrow sense heritability and genomic heritability ) of each wavelength measure and SRI were calculated according to the following formulas: and .
Prediction Models for HCN in Eucalyptus Leaves
The following prediction models were performed using the pedigree, genomic, and spectral information. The first model included only genomic data, as follows:
where corresponds to the vector of adjusted phenotypes (HCN) for block effects (experimental design). is the incidence matrix associated with the vector, which corresponds to genomic values based on haplotypes (Mora-Poblete et al., 2021). The Model 4 can be expressed in matrix form as:
where the terms and are described in the 2.4 section (see Models 1 and 2). The Model 5 included the population genetic structure and polygenic effects (see above):
The fourth prediction model (Model 6) combined the polygenic and genomic effects:
The following model (Model 7) included the spectral data as regressors in the prediction model:
where, is the incidence matrix associated to vector, which corresponds to the vector of wavelength effects (Gonçalves et al., 2021). The prediction Model 8 comprised the genomic and wavelength effects:
Similarly, the Model 9 included the polygenic and wavelength effects:
The Models 10, 11, and 12 included the SRIs effects, as follows:
where, is the incidence matrix for the vector (fixed effect), which corresponds to SRIs’ effects (1, …, n). The vector contains a single SRI or multiple SRIs (selected according to the stepwise regression method); as a covariate (s) in the model. The Model 13 included the wavelength effects in combination with polygenic and genomic effects:
The Model 14 corresponded to the full prediction model, which integrated all effects above described:
The following three Bayesian regression methods were used to predict the effect of each wavelength: Bayes B, Bayesian ridge regression (BRR), and Bayes C (Meuwissen et al., 2001; Gianola et al., 2006; Habier et al., 2011). These methods were selected because they have different analytical assumptions, which allows the analysis of traits with different genetic architectures, and are traditionally used in the context of genomic prediction. BRR method assumes that the regressors (For instance, SNPs or reflectance measurements) have a common variance and have the same contraction effect. The predictor effect () is distributed , and the common variance is distributed scaled- inverse Chi-squared ((, where and corresponds to degree freedom and scale parameters, respectively. Contrarily, Bayes B denotes that each regressor has its own variance, and uses a mixed distribution with a mass at zero, such that the prior distribution of the effects of the all regressors is assumed as:
The π parameter represents the probability that the regressor effect tends to be zero. Bayes C is a method that combines assumptions of Bayes B and BRR, in which the predictors effects have a common variance and it assigns a non-null prior probability for the predictor effect to be equal to zero (see more details about methods in Meuwissen et al., 2001, Gianola et al., 2006 and Habier et al., 2011). The Bayesian Generalized Linear Regression (BGLR) library in R (Pérez and De los Campos, 2014) was used for fitting all models and making predictions. The BGLR procedure considered a run with 1,000,000 iterations, a burn-in period of 100,000 and a thin of 50.
Assessing Model Fitting and Predictive Ability
Prediction models 3 to 14 were evaluated and compared in terms of their predictive ability (PA) and goodness of fit using the deviance information criterion (DIC; Spiegelhalter et al., 2002). A DIC difference > 10 between two competitive models was considered to be supported against a model with higher DIC; a DIC difference between 3 and 10 was considered as substantial difference between models, while a difference < 3 was considered as not significant. All models were calibrated using ~300 individuals (which were also used for the estimation of genetic parameters) and validated considering the 20% of the total population. The PA of each model was calculated as the correlation between the adjusted phenotypes from the validation dataset and the predicted phenotypes (; Ballesta et al., 2020; Mora-Poblete et al., 2021). A process of fivefold cross-validation was used to evaluate the PA of all models.
Results
Heritable Variation in Single Wavelengths and SRIs
The estimates of genomic heritability for 1-nm wavelength bands along the reflectance curve (400–2,400 nm) ranged from 0.18 to 0.41, whereas pedigree-based estimates varied from 0.19 to 0.46 (Figure 1). The pedigree- and genomic-based heritability patterns were highly correlated (Pearson’s coefficient, r = 0.60). The highest estimate of genomic heritability ( = 0.41) was found within the visible region: 460–490 nm. The estimates of pedigree-based heritability peaked at two different wavelength regions. The first one was at 575–590 nm (= 0.46; the visible region), and the second one was at 700–715 nm ( = 0.46; the red edge spectral region).
Figure 1. Estimation of pedigree-based heritability (blue) and genomic heritability (green) along the reflectance curve (in the range of 400–2,400 nm) in leaves of a cyanogenic species of Eucalyptus. The upper axis indicates the range of the three main regions of the electromagnetic spectrum: visible, near infrared (NIR), and shortwave infrared (SWIR).
The SRI with the highest genomic-based heritability was leaf rust disease severity index 1 (LRDSI1; = 0.58; Table 1; Supplementary Table S1), which was calculated with reflectance measurements at 605 and 455 nm, followed by the anthocyanin reflectance index (ARI) and the Browning reflectance index (BRI; = 0.54); whereas the least heritable index was the enhanced vegetation index (EVI; = 0.22), which was calculated using reflectance at 800, 670, and 475 nm. The index with the highest heritability based on pedigree was the REP_LE index ( =0.48), which was calculated with reflectance measures within the red edge region, followed by the double difference Index (DD), REP_LE, the derivative index (D2), and the Datt index ( = 0.47), whereas the least heritable index was the EVI index (EVI; = 0.18). According to the stepwise regression method, four spectral reflectance indices were significantly associated with HCN (p < 0.05); i.e., simple ratio 10 (SR10), normalized difference lignin index (NDLI), normalized difference nitrogen index (NDNI), and normalized pigment chlorophyll index (NPCI). The multiple regression adjusted by the three SRIs explained ~18% of the total variation (p = 7.336*10−11). The SR10, NDLI, NDNI, and NPCI indices explained ~10, 8, 2, and 6% of the total variation, respectively. In this way, these four SRIs were considered for predicting HCN. Heritability estimates based on pedigree and genomic data (SNP markers) were equivalent for the indices SR10 ( = 0.41; = 0.41) and NDNI ( = 0.28; = 0.29; Table 2). The pedigree-derived relationship matrix accounted for a greater portion of the NDLI index variation than the SNP-derived relationship matrix ( = 0.34; = 0.28). In contrast, the estimation of genomic heritability of the NPCI index ( = 0.46) had a higher value than that based on pedigree ( = 0.40).
Table 1. The most heritable spectral reflectance indices (SRIs), according to genomic-based heritability measured in adult leaves of cyanogenic Eucalyptus cladocalyx.
Table 2. Estimates of pedigree-based heritability () and genomic heritability () of selected reflectance indices (SRIs): simple ratio 10 (SR10), normalized difference lignin index (NDLI), normalized difference nitrogen index (NDNI), normalized pigment chlorophyll index (NPCI), and hydrogen cyanide (HCN) content.
Prediction of HCN Content in Leaves of Adult Eucalyptus Trees Using an Integrated Approach
The goodness-of-fit measures for all the prediction models of HCN content that consider spectral reflectance data, based on different Bayesian regression methods (i.e., Bayes B, Bayes C, and Bayesian ridge regression), are shown in Table 3. On average, the Bayes B method had the best goodness-of-fit statistics (deviance information criterion; DIC) for the majority of the study models. Therefore, based on this Bayesian method, predictive ability (PA) estimates ranged from 0.41 to 0.60 (Table 4). Models that included spectral reflectance data as regressors outperformed conventional genomic prediction models in terms of PA and goodness-of-fit measures (Table 4). In fact, the goodness of fit of the prediction model based exclusively on genomic data (Model 3) was significantly increased with the incorporation of spectral data (Models 8 and 10; ∆DIC = 30 and 70, respectively).
Table 3. Goodness-of-fit testing for all prediction models of HCN content that consider spectral reflectance data, based on different Bayesian regression methods: Bayes B, Bayes C, and Bayesian ridge regression (BRR).
Table 4. Predictive ability (PA) and goodness-of-fit measures for all models used for predicting cyanide (HCN) content in Eucalyptus trees.
The prediction model with the best goodness-of-fit measure (DIC = −395.2) was the one that included genomic, spectral reflectance, and pedigree information (Model 13), which had a predictive ability of 0.59. In terms of each component of Model 13, spectral reflectance data explained the greatest percentage of the phenotypic variation (24%), whereas genomic and pedigree data explained ~22% and ~ 12% of the total variation in HCN content, respectively. The model that included reflectance indices (SRIs; as covariates) showed a slight reduction in terms of goodness of fit (DIC = −392.7), whereas the PA value was maintained. The solution of wavelength effects accomplished using Model 13 is shown in Figure 2. Seven specific points of the reflectance spectrum had a greater effect on HCN: two regions within the visible spectrum (400–410 and 530–540 nm), one region within the visible–red edge spectral regions (660–670 nm) and four regions in the near and shortwave infrared regions (1,200–1,210, 1,490–1,500, 1,650–1,660, and 2,125–2,130.
Figure 2. Absolute values of the wavelength effects along the reflectance curve (400–2,400 nm), estimated by the model with the best goodness-of-fit results (Model 13). The upper axis shows the three main regions of the spectrum: visible, near infrared (NIR), and shortwave infrared (SWIR).
Ten out of twelve models that included either spectral reflectance data or SRIs as regressors had a PA value above 0.5, whereas the models that exclusively included genomic and/or pedigree data had a PA below this value. The models that included the spectral reflectance component (h) had a PA value varying between 0.57 and 0.6. It should be noted that the goodness-of-fit statistics of Model 7 (exclusively based on spectral data) were increased by ~3 and 6% when genomic (Model 8) and pedigree (Model 9) data were included, respectively. The measures of the PA and goodness of fit of the model based exclusively on genomic data (Model 3; DIC = −307.4; PA = 0.47) were enhanced with the integration of genomic data and spectral reflectance indices (Model 10; DIC = −338.7; PA = 0.56). In particular, the prediction model that combined genomic data and the SR10 index had a lower DIC (DIC = −339.5) than the model based exclusively on genomic data and a 21% higher PA value. Additionally, the NPCI index increased the PA of model 3 by 10%, and improved its goodness of fit by 5% (DIC = −324.0). In contrast, the model that combined spectral data and SRIs (Model 11) had a lower goodness-of-fit statistic than the model based exclusively on spectral reflectance data (Model 7).
Discussion
Heritable Variation in Single Wavelengths
The spectral reflectance of the E. cladocalyx leaves showed a relatively moderate genetic control (h2 = 018–0.46) across the 400-to-2,400-nm spectral range. The heritability distribution along the reflectance curve indicates that additive gene effects fluctuate along the reflectance curve (Čepl et al., 2018). The reflectance regions with the highest genetic control (either genomic- or pedigree-based heritability) corresponded to the visible spectrum, between the wavelengths 460–490 and 575–590 nm, and the red edge region (700–715 nm). In agreement with this, Čepl et al. (2018), found that the maximum pedigree-based heritability value (i.e., h2 = 0.39) corresponded to the red edge inflection point (722 nm wavelength band) measured in a half-sib progeny test of Pinus sylvestris. The canopy reflectance at wavelengths between 400 and 1,000 nm is predominantly influenced by plant pigments (i.e., chlorophylls a and b) and cell structures (Schlerf et al., 2010; Raper and Varco, 2015; Zarco-Tejada et al., 2019). For instance, Schlerf et al. (2010), reported that the reflectance in the visible and red edge spectrum regions, measured in the canopy of Picea abies, can be used as a proxy for chlorophyll and nitrogen content. These findings can be observed in trees grown in different types of climates, such as tropical, Mediterranean, and oceanic climates (Schlerf et al., 2010; Li et al., 2019; Zarco-Tejada et al., 2019). In this context, chlorophyll content and other components in leaf cells could not only be regulated by abiotic and/or biotic factors, but also may be determined by inheritance patterns. Li et al. (2019) determined that the chlorophyll content and other variables related to the leaf color in Sassafras tzumu exhibit a high additive genetic control. In contrast, McKown et al. (2014) and Guerra et al. (2016) showed that ecophysiological parameters, such as chlorophyll content, photosynthetic rate, and nitrogen content, exhibit relatively low genetic control in Populus spp. This implies that the genetic control of the cellular components in the leaves (e.g., pigments) and their reflectance could be plant-specific. According to the results of this study, the cellular components of E. cladocalyx leaves, associated with reflectance between wavelengths 460 and 700 nm (e.g., chlorophylls and other pigments), could show relatively moderate genetic control.
In the present study, the estimates of the heritability of reflectance based on pedigree and genomic approaches in the short-wave infrared region (1,400–2,400 nm) were relatively lower than those in the visible spectrum region (including the red edge region). The reflectance in the short-wave infrared region has been associated with different primary and secondary metabolites, such as starch, celluloses, lignin, and other carbohydrates (Türker-Kaya and Huck, 2017; Čepl et al., 2019). In this sense, Paaso et al. (2017) reported that lignin exhibits a relatively low level of genetic control in senescent leaves of Betula pendula. Moreover, both the cellulose and lignin content in Eucalyptus wood are traits of moderate genetic control (Denis et al., 2013; Chen et al., 2018), which is consistent with the present study results.
Heritable Variation in Spectral Reflectance Indices Associated With HCN Content in Leaves
Among the studied SRIs, only 3% of them were significantly associated with HCN. According to stepwise regression analysis, the SR10, NPCI, and NDLI indices were negatively correlated with HCN, whereas NDNI was positively correlated. These four selected SRIs had a similar pattern of heritability to those estimated at each wavelength. The indices calculated based on reflectance in the visible spectrum (i.e., SR10 and NPCI) showed greater heritable variation than the SRIs calculated within the SWNIR spectrum (i.e., NDLI and NDNI), which is in agreement with the heritability estimates in the different regions of the full spectrum (400–2,400 nm). SR10 (an index measured at 685 and 655 nm) has been previously related to chlorophylls and the efficient use of light (Chen et al., 2018). At the same time, the NPCI index was originally developed to evaluate the chlorophyll content in leaves (Peñuelas et al., 1994; Hatfield and Prueger, 2010). Previous studies have confirmed that this index could also be a proxy for the nitrogen content and could be an indicator of water stress and a good predictor of photochemical quenching (Filella et al., 1995; Huang et al., 2014; Maimaitiyiming et al., 2017). On the other hand, the NDLI index has been proposed as an indicator of lignin content (Serrano et al., 2002; Daughtry et al., 2004). However, it has also been considered to predict leaf biomass in rice (Cheng et al., 2017). According to Serrano et al. (2002), this index has been used as an indicator of the nitrogen content in leaves in different plants (Wang et al., 2016; Wang and Wei, 2016; Liang et al., 2018). In this context, most of the indices significantly correlated with HCN have been used to predict nitrogen content in leaves. According to Gleadow and Møller (2014), the content of cyanogenic glycosides is closely related to the availability of nitrogen in the soil, such that high doses of fertilizers increase the cyanogenic capacity of different crops. Relatedly, Gleadow and Woodrow (2000) and Simon et al. (2010) reported that the cyanogenic capacity of E. cladocalyx trees is increased with a greater availability of nitrogen in the soil (or substrate). Additionally, Mora-Poblete et al. (2021) reported a negative (although not significant) relationship between nitrogen and chlorophyll content (indirectly measured) and HCN in the leaves of E. cladocalyx. In fact, reflectance indices related to chlorophyll content can also be good predictors of nitrogen content in various tree species, such as Picea, Acer, and Sorbus (Wang et al., 2016).
Regarding the relationship between the NDLI index and HCN, the lignin content could be indirectly related to the HCN content. In fact, both prunasin and lignin correspond to secondary metabolites that come from a biosynthetic pathway that begins with the amino acid phenylalanine (Phe). The enzyme phenylalanine ammonia lyase converts Phe into cinnamate, which is subsequently converted to lignin through a series of enzymatic reactions. On the other hand, Phe can be hydroxylated by cytochrome P450 (in E. cladocalyx, CYP79A125; Hansen et al., 2018) and converted into phenylacetaldoxime, which is subsequently transformed into prunasin, suggesting a possible trade-off between the content of both secondary metabolites. On the other hand, these results should be interpreted with caution due to cyanogenic glycosides content competes for nitrogen with other nitrogen-containing compounds, since cyanogenic glycosides could account for up to 20% of leaf nitrogen (Hansen et al., 2018).
Prediction of HCN Content in Eucalyptus Leaves
In this study, the most suitable model used to predict HCN in leaves of cyanogenic Eucalyptus was the one that included high-throughput phenotyping data, genomic information (haplotypes), and genealogical data (pedigree) in terms of the goodness-of-fit measurements. According to the results, the spectral reflectance explained a higher percentage of the HCN variation than both genomic and pedigree data, considering a half-sib progeny test. Consistently with these findings, Krause et al. (2019) reported that the prediction ability of a model based on a hyperspectral reflectance-derived relationship matrix could be equal to or greater than a prediction model that combined the genomic selection (GS) model with pedigree-derived relationship matrix (GS-A) for grain yields in wheat. Furthermore, the model that combined the three types of information was slightly superior to the GS-A model. In the present study, the prediction ability for HCN content based on spectral reflectance data (Model 7) was superior to that obtained by models based on either genomic or pedigree data. However, the model combining spectral reflectance, pedigree, and haplotype-based genomic data (Model 13) was not superior to Model 7 (in terms of its prediction ability). Gonçalves et al. (2021) reported that the prediction ability based on the spectral reflectance of the fiber and sucrose content of sugarcane stems was superior to that of the model based on genomic data alone. However, the combined use of spectral reflectance and genomic data did not significantly increase the prediction ability of the studied traits.
The hypothesis that a high-throughput phenotyping platform could indirectly capture endophenotypic variants can be related to specialized metabolites. Therefore, the expectation of a more robust prediction, considering the spectral reflectance relationship among trees, was confirmed. This result is consistent with that of Rincent et al. (2018), who showed how predicting the heading date in wheat based on spectral data is more accurate than that based on genomic information. In the same study, however, the prediction of wood-related traits in Populus trees based on spectral data was less precise than that based on genomic data. In another study, Sandhu et al. (2021) reported that the genomic prediction ability for protein content and grain yield in wheat could be increased by 12 and 20%, respectively, through the combined use of genomic data and SRIs. Interestingly, in the present study, the use of SRIs as covariates in predictive models of HCN led to an increase in the goodness-of-fit measures and the predictive ability of the GS model (Model 3). Therefore, the predictive capacities of both information resources could depend on the trait under study, the species, and the type of population. In practice, all types of information available must be tested. Indeed, in another study, Rutkoski et al. (2016) demonstrated that the use of spectral reflectance indices could increase GS precision by up to 70% for predicting the grain yield of wheat. It should be noted that none of the models that combined genomic data and SRIs was superior to the model based exclusively on spectral data in terms of prediction ability. However, SRIs may be relevant to explain the variation of HCN and the accuracy of the HCN prediction may require the use of various regions of the reflectance spectrum. In forage sorghum (Sorghum bicolor), Fox et al. (2012) determined that using a predictive model that combined the visible, NIR, and SW-NIR spectra allowed a better estimation of HCN than a model based on a single region of spectral signatures.
In general, the concentrations of different compounds in plants are estimated through the combined use of NIR spectral data with variable selection and dimensionality reduction models, such as PLS (Assis et al., 2017; Rizvi et al., 2018; Sampaio et al., 2018). However, the PLS method and other related ones are challenging to implement in genetically structured populations (genetic structures) or trials with complete pedigree information (Rincent et al., 2018; Krause et al., 2019). These types of information are relevant to the prediction process. In the present study case, three different Bayesian regression methods were evaluated, in which the Bayes B model offered a better goodness-of-fit measurement and a slightly higher prediction ability compared with both the BRR and Bayes C models. Bayes B is a regression method with a hierarchical Bayesian approach that performs the selection and reduction of the predictor variables (Meuwissen et al., 2001). Some studies have found that Bayesian regression models, such as Bayes B, have been more accurate than the PLS method (Solberg et al., 2009; Ferragina et al., 2015). Moreover, Gonçalves et al. (2021), reported that the Bayes B method was up to approximately two times more accurate than PLS in predicting fiber and sucrose con-tent in sugarcane stems. According to several studies (Kainer et al., 2018; Thistlethwaite et al., 2019; Rio et al., 2021), the Bayesian methods, such as Bayes B, can improve the predictive ability in genome -based evaluations. For instance, Kainer et al. (2018) evaluated the ability of different genomic prediction models of eight traits related to foliar terpene yield in Eucalyptus polybractea, using three different marker densities. According to these authors, Bayes B method outperformed ABLUP and GBLUP methods in the 54% of the total cases (total: 24 cases; 3 marker densities and 8 traits).
The Bayes B method allows for the identification of the most relevant variables to explain the variation of a response variable. In the present study, seven subregions of the spectral reflectance curve had a relatively greater effect on HCN (i.e., two, one, one, and five subregions of the reflectance curve: visible, red edge, near infrared, and shortwave infrared, respectively). Consistently, the SRIs significantly associated with HCN (i.e., SR10, NPCI, NDLI, and NDNI) were calculated from reflectance measurements located within (and near) these seven subregions of the spectrum. For example, the SR10 and NPCI indices were calculated from reflectance measurements located between the visible spectrum and in the red edge region (680 and 685 nm), whereas the NDNI and NDLI indices were calculated from reflectance measurements at 1680, 1510, and 430 nm. Based on the HCN prediction analysis, considering the model with the best good-ness-of-fit (Model 13), the spectral reflectance data explained the highest percentage of the phenotypic variation (24%), whereas genomic and pedigree components explained ~22 and 12% of the total variations in HCN content, respectively. These results suggest that the variation in the HCN content in Eucalyptus leaves could be mainly related to the spectral signature of the individual rather than its genotypic characterization based on haplotypes. However, several factors could affect the accuracy of genomic prediction models, including marker density (Lorenz and Smith, 2015). The density of markers (either haplotype and/or SNPs) used in the present study could be considered relatively low compared to other species of the genus Eucalyptus (Ballesta et al., 2020). In this sense, a greater number of markers should be included to address a greater number of regions in the genome explaining the variation of a specialized metabolite, mainly due to the quantitative nature of its genetic control (Mora-Poblete et al., 2021). Genomic prediction models have shown low to high prediction ability of secondary metabolites in trees (Kainer et al., 2018; Yamashita et al., 2020; Mora-Poblete et al., 2021). Our study proposes an alternative method to predict specialized metabolites in plants, which may be relevant in an economic, ecological and/or human health context. Yamashita et al. (2020) developed genomic prediction models for tea quality-related metabolites in 150 tea accessions, in which six models produced moderate prediction accuracy values for epigallocatechin gallate and caffeine, and low for free amino acids and chlorophylls. Kainer et al. (2018), compared several methods of genomic prediction for eight traits related to foliar terpene yield in Eucalyptus polybractea. PA was higher for the individual compounds, such as foliar α-pinene and 1,8-cineole concentration, than total foliar oil concentration, but the PA was significantly increased as the number of markers increased in most traits.
Conclusion
This is the first study to examine heritable variation along the reflectance curve (in the range of 400–2,400 nm) and several spectral reflectance indices measured in leaves of a cyanogenic Eucalyptus. According to results, the variation in reflectance measurements and indices could be partially genetically driven, such that this information may help to recognize genotypes with specific chemical properties, such as cyanogenic capacity in leaves. The spectral reflectance of leaves of E. cladocalyx showed a relatively moderate level of genetic control. Consequently, we found four indices to be significantly associated with HCN content, all of which have been previously associated with nitrogen content in leaves, which agrees with previous studies examining the relationship between nitrogen and HCN in Eucalyptus leaves.
The model with the best fit to predict HCN content in leaves of cyanogenic Eucalyptus was the one that included spectral reflectance data, genomic information (haplotypes), and genealogical data (pedigree), as determined in terms of goodness-of-fit measures. In particular, its ability to predict HCN content in Eucalyptus leaves based exclusively on spectral data was superior to that obtained by models based on genomic and/or pedigree information. The strategy of prediction that integrates conventional pedigree information and genomic data, along with high-throughput phenotyping techniques, may be beneficial when the genome coverage is low or when the number of molecular markers is limited to predict a complex trait, such as secondary metabolites. Finally, we confirmed that the high-throughput phenotyping platforms indirectly capture endophenotypic variants related to secondary metabolites. Therefore, a more robust prediction can be made, considering the spectral reflectance in cyanogenic plant species.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author Contributions
FM-P contributed to conceptualization, done funding acquisition, and provided resources. PB and FM-P provided methodology, performed writing—original draft preparation, contributed to project administration, and performed investigation. PB provided software and carried out formal analysis. PB, SA, GL, and FJ-A performed validation. GL, SA, DM-C, and FJ-A performed data curation. FM-P, FJ-A, GL, and DM-C performed writing—review and editing. GL, FM-P, and FJ-A done supervision. All authors contributed to the article and approved the submitted version.
Funding
This study was supported by FONDECYT (grant number 1201973).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.871943/full#supplementary-material
References
Aguirre, N. C., Filippi, C. V., Zaina, G., Rivas, J. G., Acuña, C. V., Villalba, P. V., et al. (2019). Optimizing ddRADseq in non-model species: a case study in Eucalyptus dunnii maiden. Agronomy 9:484. doi: 10.3390/agronomy9090484
Appenteng, M. K., Krueger, R., Johnson, M. C., Ingold, H., Bell, R., Thomas, A. L., et al. (2021). Cyanogenic glycoside analysis in American elderberry. Molecules 26:1384. doi: 10.3390/molecules26051384
Arriagada, O., do Amaral, A. T. Jr., and Mora, F. (2018). Thirteen years under arid conditions: exploring marker-trait associations in Eucalyptus cladocalyx for complex traits related to flowering, stem form and growth. Breed. Sci. 68, 367–374. doi: 10.1270/jsbbs.17131
Assis, C., Ramos, R. S., Silva, L. A., Kist, V., Barbosa, M. H. P., and Teofilo, R. F. (2017). Prediction of lignin content in different parts of sugarcane using near-infrared spectroscopy (NIR), ordered predictors selection (OPS), and partial least squares (PLS). Appl. Spectrosc. 71, 2001–2012. doi: 10.1177/0003702817704147
Balestrini, R., Brunetti, C., Cammareri, M., Caretto, S., Cavallaro, V., Cominelli, E., et al. (2021). Strategies to modulate specialized metabolism in mediterranean crops: from molecular aspects to field. Int. J. Mol. Sci. 22:2887. doi: 10.3390/ijms22062887
Ballesta, P., Bush, D., Silva, F. F., and Mora, F. (2020). Genomic predictions using low-density SNP markers, pedigree and GWAS information: a case study with the non-model species Eucalyptus cladocalyx. Plan. Theory 9:99. doi: 10.3390/plants9010099
Ballesta, P., Maldonado, C., Pérez-Rodríguez, P., and Mora, F. (2019). SNP and haplotype-based genomic selection of quantitative traits in Eucalyptus globulus. Plan. Theory 8:331. doi: 10.3390/plants8090331
Barrett, J. C., Fry, B., Maller, J., and Daly, M. J. (2005). Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265. doi: 10.1093/bioinformatics/bth457
Brinker, A. M., and Seigler, D. S. (1989). Methods for the detection and quantitative determination of cyanide in plant materials. Phytochem. Bull. 21:24.
Čepl, J., Scholtz, V., and Scholtzová, J. (2019). Modeling of concentric pattern of Serratia marcescens colony. Arch. Microbiol. 201, 87–92. doi: 10.1007/s00203-018-1575-2
Čepl, J., Stejskal, J., Lhotáková, Z., Holá, D., Korecký, J., Lstibůrek, M., et al. (2018). Heritable variation in needle spectral reflectance of scots pine (Pinus sylvestris L.) peaks in red edge. Remote Sens. Environ. 219, 89–98. doi: 10.1016/j.rse.2018.10.001
Chen, S., Weng, Q., Li, F., Li, M., Zhou, C., and Gan, S. (2018). Genetic parameters for growth and wood chemical properties in Eucalyptus urophylla × E. tereticornis hybrids. Ann. For. Sci. 75, 1–11. doi: 10.1007/s13595-018-0694-x
Cheng, T., Song, R., Li, D., Zhou, K., Zheng, H., Yao, X., et al. (2017). Spectroscopic estimation of biomass in canopy components of paddy rice using dry matter and chlorophyll indices. Remote Sens. 9:319. doi: 10.3390/rs9040319
Contreras-Soto, R. I., Mora, F., de Oliveira, M. A. R., Higashi, W., Scapim, C. A., and Schuster, I. A. (2017). A genome-wide association study for agronomic traits in soybean using SNP markers and SNP-based haplotype analysis. PLoS One 12:e0171105. doi: 10.1371/journal.pone.0171105
Darbani, B., Motawia, M. S., Olsen, C. E., Nour-Eldin, H. H., Møller, B. L., and Rook, F. (2016). The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter. Sci. Rep. 6:79. doi: 10.1038/srep37079
Daughtry, C. S. T., Hunt, E. R. Jr., and McMurtrey Iii, J. E. (2004). Assessing crop residue cover using shortwave infrared reflectance. Remote Sens. Environ. 90, 126–134. doi: 10.1016/j.rse.2003.10.023
Del Cueto, J., Ionescu, I. A., Pičmanová, M., Gericke, O., Motawia, M. S., Olsen, C. E., et al. (2017). Cyanogenic glucosides and derivatives in almond and sweet cherry flower buds from dormancy to flowering. Front. Plant Sci. 8:800. doi: 10.3389/fpls.2017.00800
Denis, M., Favreau, B., Ueno, S., Camus-Kulandaivelu, L., Chaix, G., Gion, J.-M., et al. (2013). Genetic variation of wood chemical traits and association with underlying genes in Eucalyptus urophylla. Tree Genet. Genomes 9, 927–942. doi: 10.1007/s11295-013-0606-z
Ferragina, A., de Los Campos, G., Vazquez, A. I., Cecchinato, A., and Bittante, G. (2015). Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data. Int. J. Dairy Sci. 98, 8133–8151. doi: 10.3168/jds.2014-9143
Filella, I., Serrano, L., Serra, J., and Peñuelas, J. (1995). Evaluating wheat nitrogen status with canopy reflectance indices and discriminant analysis. Crop Sci. 35, 1400–1405. doi: 10.2135/cropsci1995.0011183X003500050023x
Fox, G. P., O’Donnell, N. H., Stewart, P. N., and Gleadow, R. M. (2012). Estimating hydrogen cyanide in forage sorghum (Sorghum bicolor) by near-infrared spectroscopy. J. Agric. Food Chem. 60, 6183–6187. doi: 10.1021/jf205030b
Gianola, D., Fernando, R. L., and Stella, A. (2006). Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics 173, 1761–1776. doi: 10.1534/genetics.105.049510
Gleadow, R. M., Haburjak, J., Dunn, J. E., Conn, M. E., and Conn, E. E. (2008). Frequency and distribution of cyanogenic glycosides in eucalyptus L’Hérit. Phytochemistry 69, 1870–1874. doi: 10.1016/j.phytochem.2008.03.018
Gleadow, R. M., and Møller, B. L. (2014). Cyanogenic glycosides: synthesis, physiology, and phenotypic plasticity. Annu. Rev. Plant Biol. 65, 155–185. doi: 10.1146/annurev-arplant-050213-040027
Gleadow, R. M., Vecchies, A. C., and Woodrow, I. E. (2003). Cyanogenic Eucalyptus nobilis is polymorphic for both prunasin and specific β-glucosidases. Phytochemistry 63, 699–704. doi: 10.1016/S0031-9422(03)00245-0
Gleadow, R. M., and Woodrow, I. E. (2000). Temporal and spatial variation in cyanogenic glycosides in Eucalyptus cladocalyx. Tree Physiol. 20, 591–598. doi: 10.1093/treephys/20.9.591
Gleadow, R. M., and Woodrow, I. E. (2002). Defense chemistry of cyanogenic Eucalyptus cladocalyx seedlings is affected by water supply. Tree Physiol. 22, 939–945. doi: 10.1093/treephys/22.13.939
Gonçalves, M. T. V., Morota, G., de Costa, P. M. A., Vidigal, P. M. P., Barbosa, M. H. P., and Peternelli, L. A. (2021). Near-infrared spectroscopy outperforms genomics for predicting sugarcane feedstock quality traits. PLoS One 16:e0236853. doi: 10.1371/journal.pone.0236853
Goodger, J. Q. D., Ades, P. K., and Woodrow, I. E. (2004). Cyanogenesis in Eucalyptus polyanthemos seedlings: heritability, ontogeny and effect of soil nitrogen. Tree Physiol. 24, 681–688. doi: 10.1093/treephys/24.6.681
Goodger, J. Q. D., Capon, R. J., and Woodrow, I. E. (2002). Cyanogenic polymorphism in Eucalyptus polyanthemos Schauer subsp. vestita L. Johnson and K. Hill (Myrtaceae). Biochem. Syst. Ecol. 30, 617–630. doi: 10.1016/S0305-1978(01)00141-7
Guerra, F. P., Richards, J. H., Fiehn, O., Famula, R., Stanton, B. J., Shuren, R., et al. (2016). Analysis of the genetic variation in growth, ecophysiology, and chemical and metabolomic composition of wood of Populus trichocarpa provenances. Tree Genet. Genomes 12:6. doi: 10.1007/s11295-015-0965-8
Habier, D., Fernando, R. L., Kizilkaya, K., and Garrick, D. J. (2011). Extension of the Bayesian alphabet for genomic selection. BMC Bioinform. 12:186. doi: 10.1186/1471-2105-12-186
Hansen, C. C., Sørensen, M., Veiga, T. A. M., Zibrandtsen, J. F. S., Heskes, A. M., Olsen, C. E., et al. (2018). Reconfigured cyanogenic glucoside biosynthesis in Eucalyptus cladocalyx involves a cytochrome P450 CYP706C55. Plant Physiol. 178, 1081–1095. doi: 10.1104/pp.18.00998
Hatfield, J. L., and Prueger, J. H. (2010). Value of using different vegetative indices to quantify agricultural crop characteristics at different growth stages under varying management practices. Remote Sens. 2, 562–578. doi: 10.3390/rs2020562
Huang, W., Yang, Q., Pu, R., and Yang, S. (2014). Estimation of nitrogen vertical distribution by bidirectional canopy reflectance in winter wheat. Sensors 14, 20347–20359. doi: 10.3390/s141120347
Kainer, D., Stone, E. A., Padovan, A., Foley, W. J., and Külheim, C. (2018). Accuracy of genomic prediction for foliar terpene traits in eucalyptus polybractea. G3: genes. Genom. Genetics 8, 2573–2583. doi: 10.1534/g3.118.200443
Krause, M. R., González-Pérez, L., Crossa, J., Pérez-Rodríguez, P., Montesinos-López, O., Singh, R. P., et al. (2019). Hyperspectral reflectance-derived relationship matrices for genomic prediction of grain yield in wheat. Genes Genom. Genet. 9, 1231–1247. doi: 10.1534/g3.118.200856
Lebedev, V. G., Lebedeva, T. N., Chernodubov, A. I., and Shestibratov, K. A. (2020). Genomic selection for forest tree improvement: methods, achievements and perspectives. Forests 11:1190. doi: 10.3390/f11111190
Lehnert, L. W., Meyer, H., Obermeier, W. A., Silva, B., Regeling, B., and Bendix, J. (2018). Hyperspectral data analysis in R: the hsdar package. J. Stat. Softw. 89:877. doi: 10.18637/jss.v089.i12
Li, Y., Sun, Y., Jiang, J., and Liu, J. (2019). Spectroscopic determination of leaf chlorophyll content and color for genetic selection on sassafras tzumu. Plant Methods 15, 73–11. doi: 10.1186/s13007-019-0458-0
Liang, L., Di, L., Huang, T., Wang, J., Lin, L., Wang, L., et al. (2018). Estimation of leaf nitrogen content in wheat using new hyperspectral indices and a random forest regression algorithm. Remote Sens. 10:1940. doi: 10.3390/rs10121940
Liu, P., Liu, Z., Hu, Y., Shi, Z., Pan, Y., Wang, L., et al. (2019). Integrating a hybrid back propagation neural network and particle swarm optimization for estimating soil heavy metal contents using hyperspectral data. Sustainability 11:419. doi: 10.3390/su11020419
Lobos, G. A., and Poblete-Echeverría, C. (2017). Spectral knowledge (SK-UTALCA): software for exploratory analysis of high-resolution spectral reflectance data on plant breeding. Front. Plant Sci. 7:1996. doi: 10.3389/fpls.2016.01996
Lorenz, A., and Smith, K. P. (2015). Adding genetically distant individuals to training populations reduces genomic prediction accuracy in barley. Crop Sci. 55, 2657–2667. doi: 10.2135/cropsci2014.12.0827
Maimaitiyiming, M., Ghulam, A., Bozzolo, A., Wilkins, J. L., and Kwasniewski, M. T. (2017). Early detection of plant physiological responses to different levels of water stress using reflectance spectroscopy. Remote Sens. 9:745. doi: 10.3390/rs9070745
McKown, A. D., Guy, R. D., Klápště, J., Geraldes, A., Friedmann, M., Cronk, Q. C. B., et al. (2014). Geographical and environmental gradients shape phenotypic trait variation and genetic structure in Populus trichocarpa. New Phytol. 201, 1263–1276. doi: 10.1111/nph.12601
Meuwissen, T. H., Hayes, B. J., and Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829. doi: 10.1093/genetics/157.4.1819
Mieres-Castro, D., Ahmar, S., Shabbir, R., and Mora-Poblete, F. (2021). Antiviral activities of eucalyptus essential oils: their effectiveness as therapeutic targets against human viruses. Pharmaceuticals 14:1210. doi: 10.3390/ph14121210
Møller, B. L. (2010). Functional diversifications of cyanogenic glucosides. Curr. Opin. Plant Biol. 13, 337–346. doi: 10.1016/j.pbi.2010.01.009
Mora-Poblete, F., Ballesta, P., Lobos, G. A., Molina-Montenegro, M., Gleadow, R., Ahmar, S., et al. (2021). Genome-wide association study of cyanogenic glycosides, proline, sugars, and pigments in Eucalyptus cladocalyx after 18 consecutive dry summers. Physiol. Plant. 172, 1550–1569. doi: 10.1111/ppl.13349
Neilson, E. H. J., Goodger, J. Q. D., Motawia, M. S., Bjarnholt, N., Frisch, T., Olsen, C. E., et al. (2011). Phenylalanine derived cyanogenic diglucosides from Eucalyptus camphora and their abundances in relation to ontogeny and tissue type. Phytochemistry 72, 2325–2334. doi: 10.1016/j.phytochem.2011.08.022
Neilson, E. H., Goodger, J. Q. D., and Woodrow, I. E. (2006). Novel aspects of cyanogenesis in Eucalyptus camphora subsp. humeana. Funct. Plant Biol. 33, 487–496. doi: 10.1071/FP05293
Paaso, U., Keski-Saari, S., Keinänen, M., Karvinen, H., Silfver, T., Rousi, M., et al. (2017). Intrapopulation genotypic variation of foliar secondary chemistry during leaf senescence and litter decomposition in silver birch (Betula pendula). Front. Plant Sci. 8:1074. doi: 10.3389/fpls.2017.01074
Peñuelas, J., Gamon, J. A., Fredeen, A. L., Merino, J., and Field, C. B. (1994). Reflectance indices associated with physiological changes in nitrogen-and water-limited sunflower leaves. Remote Sens. Environ. 48, 135–146. doi: 10.1016/0034-4257(94)90136-8
Pérez, P., and De los Campos, G. (2014). BGLR: a statistical package for whole genome regression and prediction. Genetics 198, 483–495. doi: 10.1534/genetics.114.164442
Pičmanová, M., Neilson, E. H., Motawia, M. S., Olsen, C. E., Agerbirk, N., Gray, C. J., et al. (2015). A recycling pathway for cyanogenic glycosides evidenced by the comparative metabolic profiling in three cyanogenic plant species. Biochem. J. 469, 375–389. doi: 10.1042/BJ20150390
Pritchard, J. K., Stephens, M., and Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics 155, 945–959. doi: 10.1093/genetics/155.2.945
Pryce, J. E., Johnston, J., Hayes, B. J., Sahana, G., Weigel, K. A., McParland, S., et al. (2014). Imputation of genotypes from low density (50,000 markers) to high density (700,000 markers) of cows from research herds in Europe, North America, and Australasia using 2 reference populations. Int. J. Dairy Sci. 97, 1799–1811. doi: 10.3168/jds.2013-7368
Qian, X., Zhang, Y., Liu, L., and Du, S. (2019). Exploring the potential of leaf reflectance spectra for retrieving the leaf maximum carboxylation rate. Int. J. Remote Sens. 40, 5411–5428. doi: 10.1080/01431161.2019.1579940
Raper, T. B., and Varco, J. J. (2015). Canopy-scale wavelength and vegetative index sensitivities to cotton growth parameters and nitrogen status. Precis. Agric. 16, 62–76. doi: 10.1007/s11119-014-9383-4
Rincent, R., Charpentier, J.-P., Faivre-Rampant, P., Paux, E., Le Gouis, J., Bastien, C., et al. (2018). Phenomic selection is a low-cost and high-throughput method based on indirect predictions: proof of concept on wheat and poplar. Genes Genom. Genet. 8, 3961–3972. doi: 10.1534/g3.118.200760
Rio, S., Gallego-Sánchez, L., Montilla-Bascón, G., Canales, F. J., Sánchez, J. I. Y. S., and Prats, E. (2021). Genomic prediction and training set optimization in a structured Mediterranean oat population. Theor. Appl. Genetics 134, 3595–3609. doi: 10.1007/s00122-021-03916-w
Rizvi, T. S., Mabood, F., Ali, L., Al-Broumi, M., Al Rabani, H. K. M., Hussain, J., et al. (2018). Application of NIR spectroscopy coupled with PLS regression for quantification of total polyphenol contents from the fruit and aerial parts of Citrullus colocynthis. Phytochem. Anal. 29, 16–22. doi: 10.1002/pca.2710
Rosati, V. C., Blomstedt, C. K., Møller, B. L., Garnett, T., and Gleadow, R. (2019). The interplay between water limitation, dhurrin, and nitrate in the low-cyanogenic sorghum mutant adult cyanide deficient class 1. Front. Plant Sci. 10:1458. doi: 10.3389/fpls.2019.01458
Rutkoski, J., Poland, J., Mondal, S., Autrique, E., Pérez, L. G., Crossa, J., et al. (2016). Canopy temperature and vegetation indices from high-throughput phenotyping improve accuracy of pedigree and genomic selection for grain yield in wheat. Genes Genom. Genet. 6, 2799–2808. doi: 10.1534/g3.116.032888
Sampaio, P. S., Soares, A., Castanho, A., Almeida, A. S., Oliveira, J., and Brites, C. (2018). Optimization of rice amylose determination by NIR-spectroscopy using PLS chemometrics algorithms. Food Chem. 242, 196–204. doi: 10.1016/j.foodchem.2017.09.058
Sandhu, K. S., Mihalyov, P. D., Lewien, M. J., Pumphrey, M. O., and Carter, A. H. (2021). Combining genomic and phenomic information for predicting grain protein content and grain yield in spring wheat. Front. Plant Sci. 12:613300. doi: 10.3389/fpls.2021.613300
Schlerf, M., Atzberger, C., Hill, J., Buddenbaum, H., Werner, W., and Schüler, G. (2010). Retrieval of chlorophyll and nitrogen in Norway spruce (Picea abies L. karst.) using imaging spectroscopy. Int. J. Appl. Earth Obs. Geoinf. 12, 17–26. doi: 10.1016/j.jag.2009.08.006
Serrano, L., Peñuelas, J., and Ustin, S. L. (2002). Remote sensing of nitrogen and lignin in Mediterranean vegetation from AVIRIS data: decomposing biochemical from structural signals. Remote Sens. Environ. 81, 355–364. doi: 10.1016/S0034-4257(02)00011-1
Simon, J., Gleadow, R. M., and Woodrow, I. E. (2010). Allocation of nitrogen to chemical defence and plant functional traits is constrained by soil N. Tree Physiol. 30, 1111–1117. doi: 10.1093/treephys/tpq049
Solberg, I. C., Lygren, I., Jahnsen, J., Aadland, E., Høie, O., Cvancarova, M., et al. (2009). Clinical course during the first 10 years of ulcerative colitis: results from a population-based inception cohort (IBSEN Study). Scand. J. Gastroenterol. 44, 431–440. doi: 10.1080/00365520802600961
Spiegelhalter, D. J., Best, N. G., Carlin, B. P., and Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. J. R. Stat. Soc. Series B Stat. Methodol. 64, 583–639. doi: 10.1111/1467-9868.00353
Stevens, A., and Ramirez-Lopez, L. (2014). An introduction to the prospectr package. R package vignette, Report No.: R Package Version 0.1, 3.
Sun, Z., Zhang, K., Chen, C., Wu, Y., Tang, Y., Georgiev, M. I., et al. (2018). Biosynthesis and regulation of cyanogenic glycoside production in forage plants. Appl. Microbiol. Biotechnol. 102, 9–16. doi: 10.1007/s00253-017-8559-z
Thistlethwaite, F. R., Ratcliffe, B., Klápště, J., Porth, I., Chen, C., Stoehr, M. U., et al. (2019). Genomic selection of juvenile height across a single-generational gap in Douglas-fir. Heredity 122, 848–863. doi: 10.1038/s41437-018-0172-0
Thodberg, S., Del Cueto, J., Mazzeo, R., Pavan, S., Lotti, C., Dicenta, F., et al. (2018). Elucidation of the amygdalin pathway reveals the metabolic basis of bitter and sweet almonds (Prunus dulcis). Plant Physiol. 178, 1096–1111. doi: 10.1104/pp.18.00922
Türker-Kaya, S., and Huck, C. W. (2017). A review of mid-infrared and near-infrared imaging: principles, concepts and applications in plant tissue analysis. Molecules 22:168. doi: 10.3390/molecules22010168
Valenzuela, C. E., Ballesta, P., Ahmar, S., Fiaz, S., Heidari, P., Maldonado, C., et al. (2021). Haplotype-and SNP-based GWAS for growth and wood quality traits in Eucalyptus cladocalyx trees under arid conditions. Plan. Theory 10:148. doi: 10.3390/plants10010148
Wang, Z., Wang, T., Darvishzadeh, R., Skidmore, A. K., Jones, S., Suarez, L., et al. (2016). Vegetation indices for mapping canopy foliar nitrogen in a mixed temperate forest. Remote Sens. 8:491. doi: 10.3390/rs8060491
Wang, L., and Wei, Y. (2016). Revised normalized difference nitrogen index (NDNI) for estimating canopy nitrogen concentration in wetlands. Optik 127, 7676–7688. doi: 10.1016/j.ijleo.2016.05.115
Woodrow, I. E., Slocum, D. J., and Gleadow, R. M. (2002). Influence of water stress on cyanogenic capacity in Eucalyptus cladocalyx. Funct. Plant Biol. 29, 103–110. doi: 10.1071/PP01116
Yactayo-Chang, J. P., Tang, H. V., Mendoza, J., Christensen, S. A., and Block, A. K. (2020). Plant defense chemicals against insect pests. Agronomy 10:1156. doi: 10.3390/agronomy10081156
Yamaguchi, T., Yamamoto, K., and Asano, Y. (2014). Identification and characterization of CYP79D16 and CYP71AN24 catalyzing the first and second steps in l-phenylalanine-derived cyanogenic glycoside biosynthesis in the Japanese apricot, Prunus mume Sieb. Et Zucc. Plant Mol. Biol. 86, 215–223. doi: 10.1007/s11103-014-0225-6
Yamashita, H., Uchida, T., Tanaka, Y., Katai, H., Nagano, A. J., Morita, A., et al. (2020). Genomic predictions and genome-wide association studies based on RAD-seq of quality-related metabolites for the genomics-assisted breeding of tea plants. Sci. Rep. 10, 17480–17410. doi: 10.1038/s41598-020-74623-7
Zarco-Tejada, P. J., Hornero, A., Beck, P. S. A., Kattenborn, T., Kempeneers, P., and Hernández-Clemente, R. (2019). Chlorophyll content estimation in an open-canopy conifer forest with sentinel-2A and hyperspectral imagery in the context of forest decline. Remote Sens. Environ. 223, 320–335. doi: 10.1016/j.rse.2019.01.031
Zeng, L., Zhou, X., Liao, Y., and Yang, Z. (2020). Roles of specialized metabolites in biological function and environmental adaptability of tea plant (Camellia sinensis) as a metabolite studying model. J. Adv. Res. 34, 159–171. doi: 10.1016/j.jare.2020.11.004
Zenk, M. H., and Juenger, M. (2007). Evolution and current status of the phytochemistry of nitrogenous compounds. Phytochemistry 68, 2757–2772. doi: 10.1016/j.phytochem.2007.07.009
Keywords: genomic and phenomic prediction, genomic heritability, defense chemistry, spectral reflectance indexes, spectroradiometer, specialized metabolite
Citation: Ballesta P, Ahmar S, Lobos GA, Mieres-Castro D, Jiménez-Aspee F and Mora-Poblete F (2022) Heritable Variation of Foliar Spectral Reflectance Enhances Genomic Prediction of Hydrogen Cyanide in a Genetically Structured Population of Eucalyptus. Front. Plant Sci. 13:871943. doi: 10.3389/fpls.2022.871943
Edited by:
Nunzio D’Agostino, University of Naples Federico II, ItalyReviewed by:
Vadim Lebedev, Branch of Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, RussiaKaransher Singh Sandhu, Bayer Crop Science (United States), United States
Copyright © 2022 Ballesta, Ahmar, Lobos, Mieres-Castro, Jiménez-Aspee and Mora-Poblete. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Freddy Mora-Poblete, bW9yYXBvYmxldGVAZ21haWwuY29t