- 1Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
- 2Agriculture Victoria, Grains Innovation Park, Horsham, VIC, Australia
- 3School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
Genomic selection (GS) uses associations between markers and phenotypes to predict the breeding values of individuals. It can be applied early in the breeding cycle to reduce the cross-to-cross generation interval and thereby increase genetic gain per unit of time. The development of cost-effective, high-throughput genotyping platforms has revolutionized plant breeding programs by enabling the implementation of GS at the scale required to achieve impact. As a result, GS is becoming routine in plant breeding, even in minor crops such as pulses. Here we examined 2,081 breeding lines from Agriculture Victoria’s national lentil breeding program for a range of target traits including grain yield, ascochyta blight resistance, botrytis grey mould resistance, salinity and boron stress tolerance, 100-grain weight, seed size index and protein content. A broad range of narrow-sense heritabilities was observed across these traits (0.24-0.66). Genomic prediction models were developed based on 64,781 genome-wide SNPs using Bayesian methodology and genomic estimated breeding values (GEBVs) were calculated. Forward cross-validation was applied to examine the prediction accuracy of GS for these targeted traits. The accuracy of GEBVs was consistently higher (0.34-0.83) than BLUP estimated breeding values (EBVs) (0.22-0.54), indicating a higher expected rate of genetic gain with GS. GS-led parental selection using early generation breeding materials also resulted in higher genetic gain compared to BLUP-based selection performed using later generation breeding lines. Our results show that implementing GS in lentil breeding will fast track the development of high-yielding cultivars with increased resistance to biotic and abiotic stresses, as well as improved seed quality traits.
1 Introduction
Lentil is a self-pollinating, diploid (2n = 14), cool season legume crop grown in temperate climates, with world production increased from 3.15 to 6.54 million metric tons in the last two decades (Kaale et al., 2023). Canada is a major producer of lentils, contributing to more than 50% to global trade, while Australia makes up another 10-15% (Sadras et al., 2021). In 2020-21, Australia produced 900,000 tons of lentil of which 864,403 tons were exported, accounting for 13.3% of global trade (ABARES, 2021; Semmler, 2021). Over the past 3 decades, the National Australian lentil breeding program achieved an average annual genetic gain of 1.2% using conventional breeding and management practices (Sadras et al., 2021). These findings align with global trends in lentil productivity over the past six decades, as reported by Kumar et al. (2021). Despite this good rate of genetic gain, it is below the level required to meet increasing global demand caused by population growth, as well as the changing dietary habits in western countries, where people are opting for nutritious, sustainable, and healthier foods. Lentil grain has about 26% protein content, which makes it an attractive choice for plant-based diets (Ismail et al., 2020). To keep up with this ever-increasing demand, it is critical to increase and stabilize lentil crop production through the: 1) development of varieties with high yield potential; 2) cropping area expansion; and 3) continuous germplasm improvement to withstand changing climatic conditions.
Recent studies have demonstrated considerable yield variations in lentil cultivation across diverse environments in Australia, with an average production of 1.2 tons/ha (Sadras et al., 2021; Suri et al., 2022). These variations arise primarily from the influence of various biotic and environmental stress factors. Of these, ascochyta blight, botrytis grey mould, boron and salinity have been identified as key limiting factors across a wide range of production regions (Davidson et al., 2016; Sudheesh et al., 2016; Rodda et al., 2017; Rodda et al., 2018; Dissanayake et al., 2020; Kumar et al., 2021) that can result in yield losses of 30-40% (Kumar et al., 2013). Climate variability and complex genotype-by-environment (G × E) interactions on the expression of phenotypic traits also contribute to low genetic gain in lentil breeding (Kumar and Ali, 2006; Kumar et al., 2020). As more than 95% of the Australian lentil crop is exported, grain quality traits such as grain weight, seed size and protein content are also important in meeting market demands. The abovementioned traits, as well as crop yield improvement, remain the primary focus of lentil breeding in Australia.
The advent of DNA markers opened new opportunities in plant breeding that have enabled breeders to make more informed and accurate selections through marker-assisted selection (MAS) (Kaur et al., 2014; Jain et al., 2017). In lentils, quantitative trait loci have been identified for ascochyta blight resistance, boron and salinity toxicity tolerance, yield, winter hardiness, seed weight, seed size and milling quality (Taylor et al., 2006; Barrios et al., 2007; Kaur et al., 2014; Verma et al., 2015; Sudheesh et al., 2016; Singh et al., 2017; Subedi et al., 2018; Singh et al., 2020). While some success has been achieved in the use of MAS to accelerate genetic gain in breeding programs, it is neither effective nor practical for quantitative traits, which are controlled by multiple genes with minor effects (Galli et al., 2020).
GS uses genome-wide marker information to estimate the genetic potential of an individual and has emerged as a dominant approach for genetic improvement in plant and livestock breeding, where selections are made based on genomic estimated breeding values (GEBVs) (Meuwissen et al., 2001). GS involves developing genomic prediction equations using genotyping and phenotyping data obtained from a training population, which are then used to estimate the GEBVs of individuals in a testing population that have not been phenotyped. GS is more beneficial for traits with low heritability and that are difficult to measure. It can outperform conventional phenotypic and MAS in terms of genetic gain per unit time and cost (Jannink et al., 2010). The primary advantage of GS is the ability to predict the phenotypic performance of individuals earlier in the breeding cycle, which reduces the cross-to-cross generation interval and increases genetic gain (Lin et al., 2021). GS was shown to provide a three-fold and two-fold increase in genetic gain in maize and wheat, respectively, compared to MAS (Heffner et al., 2010). The adoption of GS in maize breeding has reduced overall breeding costs by 30-50% (Crossa et al., 2017; Beyene et al., 2019). Other studies using simulated data have shown GS is superior to phenotypic selection, both in terms of shortening the breeding interval and increasing genetic gain per unit of time (Gaynor et al., 2017; Gorjanc et al., 2018; Li et al., 2022). Previous studies on lentils have also revealed that GS can increase genetic gain for economically important agronomic traits by reducing both cost and breeding cycle time (Haile et al., 2020; Janghel and Sharma, 2022; Li et al., 2022). Additionally, GS has been also studied in other grain legumes, such as common bean, field pea, and chickpea, as evidenced by several published studies (Roorkiwal et al., 2018; Annicchiarico et al., 2019; Keller et al., 2020; Diaz et al., 2021). However, it is worth noting that these studies have limited discussions on the practical implications of implementing GS in crop breeding.
Another advantage of GS is that it can handle G×E interactions effectively, allowing breeders to select germplasm across multiple environments. Several studies have assessed the impact of incorporating G×E into genomic prediction models and reported an increase in prediction accuracy. For example, a multi-environment study in barley showed an increase in prediction accuracy from 0.37 to 0.45 when G×E was included in the prediction model (Lin et al., 2021). Other studies have shown that the incorporation of environmental covariates and crop models into genomic prediction models can increase prediction accuracy by up to 11% in both tested and untested environments (Heslot et al., 2014; Jarquín et al., 2017). More recently, (Jighly et al., 2021) described an extension to existing GS models that incorporates genotype plus genotype-environment (GGE) analysis, which provided a 70% increase in prediction accuracy compared to GS models that only included G×E.
In the current study, we utilised historical phenotyping data collected across 10 years (2010-2020) for key target traits from Agriculture Victoria’s national lentil breeding program (NLBP). The objectives of this study were to: 1) develop genomic prediction models for target traits and calculate GEBVs to make selections, and 2) compare the expected rate of change in genetic gain per unit of time using GS and BLUP-based phenotypic selections.
2 Materials and methods
A total of 2,081 lentil genotypes including advanced breeding lines – Stage 2 (F4:F7, 1,496 genotypes) and Stage 3 (F4:F8, 569 genotypes) and 16 commercial cultivars sourced from the NLBP were used as a training population (Table 1). Phenotyping data for nine traits were sourced including: grain yield (GYD), ascochyta blight (AB) resistance (two pathovars; one associated with lentil variety PBA Hurricane XT (ABH), and another one with variety Nipper (ABN)), boron (BOR) tolerance, salinity (SAL) tolerance, botrytis grey mould (BGM) resistance, 100-grain weight (GWT), seed size index (SSI) and protein content (PRO). GYD and BGM were evaluated under field conditions whereas ABN, ABH, BOR and SAL were assessed under controlled environment conditions. Grain quality traits PRO, SSI and GWT were measured in a seed phenomics laboratory. All the phenotyping experiments included the use of released cultivars as checks and controls. A total of 241 individual experiments from 16 locations were analysed in this study. Table 1 summarises the traits investigated in this study.
Table 1 A descriptive summary of the total number of observations and trials for each trait assessed in the current study.
2.1 Trait phenotyping
2.1.1 Grain yield
GYD was evaluated in 132 field trials across 16 locations over 20 growing seasons between 2010 and 2020 in four Australian states: South Australia (SA), Victoria (VIC), New South Wales (NSW) and Western Australia (WA) (Supplementary Table 1). These trials followed a randomised complete block design with released varieties as checks (at least five checks per trial) and were managed under rainfed conditions. Stage 2 trials were replicated twice, whilst stage 3 trials were replicated three times. GYD was expressed in tons/ha and extrapolated from the plot harvest. Each plot size was 1.25 m (wide) × 5 m (long) (6.25 m² area) with 0.25 m spacing between rows. A summary of the GYD trials is provided in Supplementary Table 2.
2.1.2 Ascochyta blight resistance screening
AB screening was performed as described in (Davidson et al., 2016; Sudheesh et al., 2016) with minor modifications using two different Ascochyta lentis isolates (virulent to PBA Hurricane XT and Nipper lentil varieties) (Supplementary Table 3). Seeds of each lentil line were seeded into two replications per line, with two to three seeds per pot, and then placed in plastic tents (160×80×80 cm) in a controlled environment room (CER) at 15°C, 12/12-h light/dark. Trials were set up in a randomised complete block design, with one replication per tent and susceptible and resistant check plants included in the set. Experiments used a randomised complete block design, with one replicate per tent with susceptible and resistant checks included. Seedlings were inoculated two weeks after sowing and disease symptoms on each seedling were assessed and visually scored 10-14 days later as percentage area of plant disease (% APD).
2.1.3 Boron tolerance screening
BOR toxicity tolerance was assessed using a hydroponic screening method as described in (Rodda et al., 2018)), where six plants per genotype were grown in trays of peat plugs floating in a dilute boric acid solution for two weeks. Lentil varieties previously characterized as intolerant (PBA Blitz and Cassab) and tolerant (ILL2024) were used as internal controls to determine the ideal assessment time. The genotypes were scored based on percentage necrosis using a 0-100 scale, where low score values indicated tolerance and high score values indicated susceptibility to boron toxicity.
2.1.4 Salt tolerance screening
A pot-based salinity tolerance screening method was used as described in (Dissanayake et al., 2020). In brief, the study included 1,755 lines with each experiment conducted as a randomized complete block design with two replications and six plants per pot. Salt was applied in the form of diluted commercial nutrient solution (Nitrosol®, Amgrow Pty. Ltd., Lidcombe, New South Wales, Australia (nitrogen:phosphorous:potassium (NPK) 4:1:3) and Ca(NO3)2H2O at 20% of the recommended concentration) in increments of 2 ds/m per day to reach the desired level of 6 ds/m. Each plant’s response to salinity stress was assessed 10 weeks post-sowing using the visual growth response scale (1-10) developed by Maher and Connor (2003). The average growth response scale values for each pot were used for analysis.
2.1.5 Botrytis grey mould tolerance screening
BGM resistance was assessed in the field under natural infection conditions that occurred in GYD field trials (stages 2 and 3) over two years (four locations in 2013 and seven locations in 2016). The Horsham, Melton, Mallala, and Williamulka experiments were scored in 2013, and Beulah, Curyo, Kadina, Mallala, Melton, Rupanyup, and Wagga Wagga were scored in 2016. Each experiment used a randomised complete block design with two and three replications for stages 2 and 3 genotypes, respectively. Disease symptoms were assessed on individual plots using a 1-9 scoring scale, where 1 indicated no infection and 9 indicated over 50% of plants within a plot were infected.
2.1.6 Grain quality traits screening
Grain quality traits were assessed in the seed phenomics laboratory, Horsham, Victoria. SSI was obtained using the EyeFoss™ (FOSS Analytical, Hoganas, Sweden) image analysis as described in LeMasurier et al. (2014). Near-infrared spectroscopy (NIRS) was used to determine PRO, whereas GWT was calculated as described by (Revilla et al., 2019).
2.2 Genotyping and SNP data calling
All stage 2 and 3 materials (2010-2020) were genotyped using the imputation enabled multispecies pulse 30K SNP array, containing 10,528 lentil specific SNPs. In brief, DNA extractions were performed from 6 seeds per sample using a modified CTAB protocol (Tibbits et al., 2008). A total of 200 ng DNA per sample was used for the genotyping assay following the manufacturer’s protocols for the Infinium XT SNP bead chip array (Illumina Inc., San Diego, USA). Initial analysis was performed using GenomeStudio 2.0 Polyploid software (Illumina) using the manufacturer’s supplied crop-specific SNP manifest file. Theta and normalized R values were exported from GenomeStudio and used to call SNP using the custom genotype calling pipeline. Phasing and filling of missing data was performed using Eagle/Beagle (Browning and Browning, 2007; Browning and Browning, 2016). Imputation to the whole genome sequence level (3,528,788 SNPs) was achieved using Minimac3 (Das et al., 2016) and whole genome sequence (WGS)-based reference haplotypes. The imputed SNP data was filtered for linkage disequilibrium (r2 > 0.99) with a window size of 250kb, minor allelic frequency (MAF<0.05), heterozygosity per SNP (>20%) and heterozygosity per sample (>50%) using vcftools (Danecek et al., 2011) and bcftools (Li, 2011). A final set of 64,781 filtered SNPs were used for GS. Due to the usage of proprietary breeding material in this study, the genotype and marker names have been de-identified in the genotyping data file (Supplementary text file 1). However, the distribution of SNP markers along the lentil genome is shown in Supplementary Figure 1.
2.3 Grain yield clustering
Pairwise genetic correlations calculated between different experiments across multiple environments were found to be quite low. Consequently, as an alternative approach to include G×E in the analysis, six environmental variables (daily evaporation (mm), solar radiation (MJ/m2), rainfall (mm), vapour pressure (KPa), and minimum and maximum temperatures (degree in Celsius)) were used to cluster the GYD trials. Data for the environmental variables corresponding to each of the 132 GYD trials was obtained from the Scientific Information for Land Owners (SILO) (https://www.longpaddock.qld.gov.au/silo/) for the duration of lentil growing season (May- early December; 2010-2020), with the nearest meteorological station selected. The number of clusters was then determined by applying hierarchical clustering to the environmental variable data. For hierarchical clustering, dissimilarity values between clusters were computed using the Euclidean distance (dE) in (Equation 1), (Fox and Rosielle, 1982).
where Xik is the value for environmental variable k for cluster i,Xjk is the value for environmental variable k for cluster j where i and j = 1,…,4, and k = 1,…,6. We applied the hclust function in R to implement clustering by the distance matrix (Smoliński et al., 2002).
2.4 Statistical analyses
2.4.1 Spatial adjustment
The broad-sense heritability (H2) of traits measured from each trial was estimated using the linear mixed model shown in Equation 2.
where y is the vector of phenotypic observations, b is a vector of fixed effects (the population mean and replicates), and g is a vector of random total genetic effects with a normally-distributed variance structure , where is the total genetic variance and I is the identity matrix; and e is a vector of residuals with a variance matrix R. X and Z are incidence matrices that link phenotypic observations to the fixed and random effects, respectively. The residual variance matrix R is decomposed into spatially dependent () and spatially independent (η) residuals by fitting autoregression (AR) of rows and columns with the formula below in Equation 3 (Dutkowski et al., 2006):
where is the spatially dependent residual variance, is the spatially independent residual variance, and are the autoregression parameters on column and row, respectively.
The broad-sense heritability was calculated as . For genomic analyses, the best linear unbiased estimator (BLUE) of individuals tested within a trial was estimated using (Equation 2) by fitting individuals as fixed effects and no random effects.
2.4.2 Genomic selection analyses
The vector of BLUEs of measured traits were modelled with a general form of the mixed linear models as shown in Equation 4:
where b is a vector of fixed effects (the population mean), a is a vector of the additive genetic effects and e is a vector of the residual effects. X and Z are incidence matrices that link phenotypic observations to the fixed and random effects, respectively. For genomic selection, the marker effects were estimated first in the training population and then genomic breeding values of the validation population were calculated. The marker effects were estimated using a linear mixed model shown in Equation 4 and implemented with BayesR package developed by (Breen et al., 2022).
The vector of the additive genetic effects a are the SNP effects in BayesR with an incidence matrix Z with , where is the genotype of individual i at SNP j. The SNP effects were modelled by a mixture of four categories of distributions as defined by (Breen et al., 2022)): one for SNPs with zero effect, one for very small effects, one for small to medium effects and one medium to large, with a cumulative genetic variance explained by SNPs. e is a vector of residual effects following , is the residual variance. The BayesR was implemented using a Markov Chain Monte Carlo (MCMC) process with 50,000 iterations and 25,000 burn-in in 5 chains. Narrow-sense heritability was calculated as . Genomic estimated breeding values (, GEBVs) were predicted as the linear combination of marker effects as under (Equation 5):
where X is an incidence matrix of SNP genotypes and a is a vector of the SNP effects that were estimated from Equation 4. Prediction accuracy of the GEBVs was estimated as the Pearson correlation between GEBVs and BLUEs.
2.4.3 Genetic correlation for GYD between clusters
Pair-wise genetic correlations between GYD clusters were estimated using ASReml-R (Butler et al., 2009). The genetic correlation between a pair of clusters was calculated as . In Equation 4, the random additive genetic effects a have a normal distribution following a normal distribution , where , where is the additive genetic variance for cluster i, is the additive genetic variance for cluster j, is the additive genetic covariance between cluster i and cluster j, G is the genomic relationship matrix (GRM) estimated from SNP genotyped using VanRaden et al. (2009), denotes the Kronecker product. The residual effects follow a normal distribution , where , where is the residual variance for cluster i, is the residual variance for cluster j.
2.4.4 Investigation of genomic prediction accuracy
Three validation scenarios were used to investigate the prediction accuracy of genomic breeding values: five-fold, Leave One year Out Validation (LOOV) and forward validation. In the five-fold validation, individuals were equally divided into five groups randomly, and the GEBVs of one group were predicted using training data from the remaining four groups. In LOOV, the GEBVs of one year were predicted using data of the remaining years for model training. For forward validation, the GEBVs of current years were predicted using data of previous years for model training. For all scenarios, the genotypes that overlapped with the training were removed from the validation to prevent overestimating the prediction accuracy.
Within-cluster and across-cluster validation were used to develop prediction accuracies for GYD. All three validation scenarios – namely five-fold, LOOV, and forward validation – were applied for prediction accuracy within-clusters. Furthermore, all validation scenarios were applied to the entire population (without clustering on the GYD data) in a non-G×E validation method. For the across-cluster validation data, validation was initially carried out between clusters, with one cluster being predicted using data from all other clusters as a training set. Secondly, validation was performed between clusters by predicting one cluster using data from another cluster. For all other target traits, non-G×E interactions were used in all validation scenarios across the whole population.
2.4.5 Efficiency of GS over BLUP selection
To evaluate the efficiency of GS over BLUP-based selection, the best linear unbiased prediction (BLUP) EBVs of traits examined were estimated based on BLUEs and pedigree information using the model specified in Equation 4 and implemented with ASReml-R (Butler et al., 2009). The random additive genetic effects a follow a normal distribution with , where is the additive genetic variance and A is the numerator relationship matrix calculated from pedigree (Mrode, 2014). Forward validation was conducted based on BLUP EBVs to derive the accuracy of BLUP EBVs for the comparison between GS and BLUP selection.
Expected genetic gain of GS over BLUP selection based on EBVs was calculated for different scenarios of selecting parents from different stages of a breeding cycle: Stage 2, F6, F2 and F1. The generation intervals were chosen as defined by Li et al. (2022): 8.5, 5, 1 and 0.5 years for selecting parents from Stage 2, F6, F2 and F1 in GS and 8 years for BLUP selection. Additional expected genetic gain obtained from GS over BLUP selection was calculated in Equation 7 as that described by (Li et al., 2019):
where i is the selection intensity, is the accuracy of GS estimated from the forward validation, is the accuracy of BLUP selection estimated from the forward validation based on EBV, is the square root of the additive genetic variance, Lg is the generation interval of lentil breeding population under GS and La is the generation interval of lentil breeding population under BLUP selection.
3 Results
3.1 Clustering of grain yield
Based on six environmental variables, the 132 yield trials were grouped into four clusters using hierarchical clustering (Figure 1). Most of the variation (86.8%) was explained by Dimensions 1 and 2. Total rainfall and evaporation were found to be the most important climatic factors. GYD-C1 was marked by high rainfall and low evaporation, GYD-C2 with high rainfall and high evaporation, GYD-C3 with low rainfall and low evaporation, and GYD-C4 with low rainfall and high evaporation. The number of trials grouped into GYD-C1, GYD-C3 and GYD-C4 was 33, 38 and 47 which were screened across 8, 9 and 8 years, respectively (Table 2). Only 14 trials from 4 years were grouped into GYD-C2.
Figure 1 Clustering of GYD data based on environmental variables of daily evaporation, solar radiation, rainfall, vapour pressure, and temperatures (Min and Max).
Table 2 Summary of environmental variables for trial sites within each grain yield cluster, as well as all trials sites for grain yield.
3.2 Phenotypic and genotypic variations of target traits
The average mean BLUE for GYD (tons/ha) was 1.8, with GYD-C1 and GYD-C2 having the highest and GYD-C3 having the lowest yield (Table 3). For all other traits, spatial adjustment was performed for each individual trial. We also evaluated connectedness (number of genotypes in common between environment), where the trials shared genotypes ranging from 3 to 341 (Supplementary Table 4). For most traits, moderate to high levels of pairwise genetic correlations were observed among different environments over years, except (>=0.7) yield where poor correlations were found (-1 to 1) (Supplementary Table 5).
3.3 Heritability and variance components
Broad sense heritabilities for GYD, the biotic and abiotic stress tolerance traits, as well as grain quality traits ranged from 0.47 to 0.83, whilst narrow sense heritabilities were lower, ranging from 0.24 to 0.66 (Tables 4, 5).
Table 4 Broad-sense heritability (H2), narrow-sense heritability (h2), the additive genetic variance () and residual variance () for grain yield evaluation trial sites.
Table 5 Broad sense heritability (H2), narrow sense heritability (h2), the additive genetic variance () and residual variance () for measured biotic and abiotic stress tolerance and grain quality traits.
3.4 Genomic prediction accuracies for grain yield
When comparing the three validation scenarios, random five-fold validation exhibited the highest prediction accuracy (0.47-0.57), where GYD-C2 showed the highest accuracy (0.57) and GYD-C1 the lowest (0.47) (Figure 2). The prediction accuracies for LOOV ranged from 0.27 to 0.42, with GYD-C4 (0.41) having the highest and GYD-C2 having the lowest (0.27) accuracy. The prediction accuracies for forward cross-validation ranged from 0.25 to 0.41, with GYD-C4 having the highest (0.41) and GYD-C1 having the lowest (0.25) accuracy. For yield without clustering (GYD), the accuracies ranged from 0.42-0.63 for all validation methods.
Figure 2 Prediction accuracies for grain yield clusters (standard error shown in error bar) achieved using three cross-validation methods: five-fold, LOOV and forward validation.
3.5 Genetic correlation between GYD clusters and prediction accuracies
The prediction accuracy across clusters was found to be highly correlated to the genetic correlation between clusters, where higher genetic correlation led to higher prediction accuracy (Figure 3). Genetic correlations among GYD-C1, GYD-C3 and GYD-C4 were 0.83-1.00, implying a low level of G×E interaction among clusters. This led to a moderate prediction accuracy, ranging from 0.38 to 0.56. The correlation between GYD-C2 and the other clusters (GYD-C3, GYD-C4, and GYD-C1) was lower (0.36-0.76), indicating higher G×E interaction between GYD-C2 and other clusters. This resulted in weak to moderate prediction accuracies (0.19-0.44).
Figure 3 Relationship between prediction accuracy and genetic correlation between yield clusters. The text beside each dot point represents the training (left to hyphen) and the validation cluster (right to hyphen).
3.6 Prediction accuracies for abiotic and biotic stress tolerances and grain quality traits
The prediction accuracy for ABH ranged from 0.45 to 0.57, with the highest achieved in LOOV and lowest in forward cross-validation (Figure 4). For ABN, similar prediction accuracy (0.60-0.64) was obtained with each validation method. For BGM, moderate prediction accuracy (0.63) was achieved using the five-fold method. The prediction accuracy could not be calculated for BGM using the LOOV and forward methods due to complete overlap of genotypes between the training and test sets.
Figure 4 Prediction accuracies (standard error shown in error bar) from random five-fold, LOOV and forward cross-validation for abiotic and biotic stress tolerance and grain quality traits.
For SAL, the prediction accuracy ranged from 0.39 to 0.52, with the highest achieved in random five-fold and lowest in forward cross-validation. For BOR, the prediction accuracy ranged from 0.47 to 0.72, with the highest achieved in random five-fold and lowest in forward cross-validation. The five-fold method had the highest prediction accuracies for both BOR (0.72) and SAL (0.52), while the forward prediction method had the lowest prediction accuracies, 0.47 and 0.39 respectively.
Prediction accuracies for grain quality traits GWT and SSI were 0.80 in all cross-validation methods. The prediction accuracy for PRO ranged from 0.51 to 0.65, with random five-fold achieving the highest and forward cross-validation having the lowest.
3.7 Comparison of GS to BLUP selection
The expected genetic gain per year from GS and BLUP selection when parents were selected at the stage 2, F6, F2 and F1 generation in the breeding cycle is shown in Table 6. In all scenarios, GS outperformed BLUP selection for all traits. When parents were chosen from stage 2, GS outperformed BLUP selection by 0.08 to 2.32-fold increase for all traits. Similarly, selecting parents from the F6 generation led to GS outperforming BLUP selection by 0.84 to 4.65 times. The additional expected genetic gain from GS over BLUP selection was 9.4 to 27.3-fold when selecting parents from the F2 generation, and 17.4 to 55.5-fold for selecting parents from the F1 generation.
Table 6 Expected genetic gain obtained from GS over BLUP selection. Genetic gain was calculated using the accuracy obtained from forward cross-validation on the parents from stage 2, F6, F2 and F1..
4 Discussion
Over the years, lentil breeding in Australia has achieved significant success in enhancing grain yield through the utilization of conventional breeding methods and effective management practices. This continuous effort has resulted in an encouraging annual increase in the rate of genetic gain, averaging c. 1.2% over the past three decades (Sadras et al., 2021; Silva-Perez et al., 2022). However, to address increasing global demand for plant-based protein, it is crucial to explore new tools and technologies such as GS, which has the potential to further increase the rate of genetic gain, particularly in environments that are prone to abiotic and biotic stresses. GS enables more accurate and informed breeding selections, cost savings, and reduced breeding cycle times compared to traditional phenotype-based breeding approaches. Collectively, these advantages of GS lead to increased rates of genetic gain per unit time. Here, we report the implementation of GS for target traits in Agriculture Victoria’s national lentil breeding program. Ten years of historical phenotyping data (2010-2020) captured from advanced breeding stages under field and controlled environment conditions was used to train genomic prediction models for yield, biotic stress resistance, abiotic stress tolerance and seed quality traits. Various cross-validation methods achieved moderate to high prediction accuracies, which varied depending on trait complexity. Simple traits like boron tolerance had higher accuracies compared to complex traits like GYD. Despite this, GYD still had moderate prediction accuracies, making GS a preferred method for lentil breeders. The observed accuracies are comparable to previous studies in pulse crops (Li et al., 2018; Annicchiarico et al., 2019; Keller et al., 2020; Bari et al., 2021; Diaz et al., 2021).
In commercial breeding programs, addressing the G × E interaction is crucial to achieve yield stability, as it results in varying genotype performance across different environments. Including G × E components in GS models has been reported to improve prediction accuracies. To assess the G x E levels in the current dataset, pairwise genetic correlations were calculated among various GYD trials conducted from 2010 to 2020 (data not shown). The results revealed relatively low correlations, which could be attributed to the limited number of overlapping entries between these trials, resulting in low connectedness. As an alternative approach, environmental variables were utilized to cluster the GYD data into four groups, primarily based on total rainfall and evaporation, which accounted for most of the variation. Clustering of mega-environments into groups based on environmental variables has been reported in barley (Heslot et al., 2013; Lin et al., 2021) and wheat (Dawson et al., 2013). Genetic correlations were high among GYD-C1, GYD-C3, and GYD-C4 (exceeding 0.80), but lower with GYD-C2 (0.36 to 0.76). Predicting GYD-C2 using GYD-C3 and GYD-C4, and vice versa, resulted in lower accuracies (0.19-0.25). This could be due to GYD-C2 having only 14 trials over four years (2010-2014), while GYD-C3 and GYD-C4 spanned 33-47 trials over ten years (2010-2020), leading to lower connectedness between GYD-C2 and the other clusters. This would also mean the training population of GYD-C2 would be genetically more distant (less related) when compared to GYD-C3 and GYD-C4. Despite this, GYD-C2 exhibited higher heritability estimates, possibly due to higher relatedness within its training population. Overall, clustering based on environmental variables did not significantly improve GYD predictions compared to non-clustering methods, as environmental variables are just one factor influencing plant performance.
In Australia, lentils are a high-value cash crop, with more than 95% of the harvest exported to the global market. Consequently, grain quality traits such as seed size, protein content, and seed colour are priority breeding targets. However, breeding for quality traits poses further challenges because of the commonly observed negative correlation between seed protein content and grain yield in pulse crops such as chickpea, common bean and pigeon pea (Saxena et al., 1987; Gaur et al., 2016; Keller et al., 2020). When traits are antagonistically connected, it adds another layer of complexity for the simultaneous improvement of multiple traits. To address this complexity, different strategies can be implemented in a breeding program. One strategy is to apply selection at an early stage of the breeding program, when a larger number of early generation progenies are available and stronger selection can be applied. With larger progeny numbers, the likelihood of identifying individuals with favourable alleles for both grain yield and protein content is increased. A wheat GS study found a positive association between grain yield and protein content in some progeny groups, despite the fact that the corresponding parents were negatively correlated (Yao et al., 2018). However, while this may result in short-term success, it will result in the loss of the highest yielding genotypes in the breeding program due to segregation. Another option is the use of a selection index, where multiple traits of importance are indexed simultaneously (weighted selection index, culling and tandem selection, index selection or independent culling) (Batista et al., 2021; Covarrubias-Pazaran et al., 2021). A weighted selection index is calculated by assigning a certain weight to each trait based on their relative economic values. In the absence of economic weights, breeders can define targeted genetic gain based on long-term breeding goals. This allows the balance between favorable genetic gain and targeted genetic gain to be optimized to overcome the problem of targeted gain in which selection for one trait may lead to unfavorable changes in other traits due to correlations between traits. Breeders may also aim to develop product classes (also known as target product profiles) to maintain balance in the breeding program and meet export demands for different markets. For instance, Agriculture Victoria’s national lentil breeding program is targeting different export markets who have demand for small and large seed size lentils. To clearly define these product profiles, we are implementing weighted selection indices to underpin parental selections for hybridization programs. Based on these decisions, crossing designs, cross evaluation, and selection tasks are carried out for each individual product profile. This means that selection decisions are made at various breeding stages including intercrosses, bulk-up, preliminary yield trials, Stage 1 trials, and Stage 2 trials (evaluation of advanced breeding lines).
As grain yield and quality in lentils can be significantly impacted by biotic and abiotic stresses, (Araujo et al., 2015; Li et al., 2015; Scheelbeek et al., 2018; Dutta et al., 2022), it is imperative to target breeding for disease resistance and abiotic stress tolerance. Both AB and BGM affect lentil production in Australia. For biosecurity reasons, glasshouse assays are used to screen genotypic responses to AB resistance. The prediction accuracy (0.45-0.64) for AB resistance observed in this study was moderately high indicating that GS could replace some of the phenotypic screens as a selection tool for this trait. BGM assays, on the other hand, rely on field endemics that occur under high rainfall conditions. Given the pathogen’s complexity and the low frequency of high rainfall seasons across lentil growing regions in Australia, breeders have relied on opportunistic field scores. In this study, moderately high prediction accuracy (0.62) was obtained using data captured from two seasons (2013 and 2016), which is sufficient to enable GS to be applied for this trait in the breeding program. Similarly, moderate to high prediction accuracies observed for both boron and salt tolerance (0.39-0.72) (Figure 4) demonstrated the utility of GS for developing abiotic stress tolerant lentil varieties. In conclusion, incorporating these GS results into a breeding strategy will help to accelerate the development of lentil cultivars with quantitative disease resistance and abiotic stress tolerance.
Early parent selection in GS is critical for shortening the breeding cycle and lowering breeding costs. When GS is used in the breeding program, it is expected to outperform phenotypic selection in terms of accuracy and genetic gain per unit time (Matei et al., 2018; Li et al., 2022). In this study, GS was shown to provide significant genetic gain over BLUP based selection when parents were selected from stage 2, F6, F2 and F1 (Table 6). Noteworthy was that prediction accuracy was reduced when parents were selected from earlier generations. This was likely caused by the training population only being updated with phenotyped Stage 2 and 3 materials. Historically, updating the parents for selection of the next breeding cycle could take up to 8 years (F8 or stage 2). However, with the availability of genomic prediction, parental selection can be carried out earlier in the breeding program. This is beneficial for shortening the breeding cycle and cycling back superior individuals from crosses as parents into subsequent hybridization cycles. (Li et al., 2022) suggested that using phenotyped F2 families to update the prediction model each breeding cycle improved prediction accuracy. Another point worth mentioning is that increased genetic gain per unit of time resulting from the selection of earlier generation parents will result in the rapid loss of genetic diversity (inbreeding). To mitigate this, breeders can use different strategies including introducing exotic germplasm into crossing blocks and using computational algorithms that help chose optimal parents while maintaining diversity (Lin et al., 2017). Li et al. (2022) developed a strategy to maximise genetic gain while preserving genetic diversity by restricting the co-ancestry of selected parents and the number of alleles fixed. Similarly, (Gorjanc et al., 2018) developed a method for comparing optimal cross-selection and truncation selection, and optimal cross-selection to increase long-term genetic gain while preserving diversity. In conclusion, this study demonstrates the potential of GS for making informed selections in breeding programs and for cycling parents back from earlier stages to accelerate the development of high yielding, biotic and abiotic stress tolerant, lentil varieties with superior grain quality.
Data availability statement
The minimum data set required to undertake the analysis included in the manuscript is provided in Supplementary files. We note that both the sample and marker names in the genotyping data file have been de-identified due to the data originating from a commercial breeding program. For any additional inquiries, please contact the corresponding author.
Author contributions
AG: Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft. YL: Conceptualization, Methodology, Software, Supervision, Visualization, Writing – review & editing. AS: Data curation, Methodology, Validation, Writing – review & editing. SS: Data curation, Formal Analysis, Methodology, Writing – review & editing. HV-K: Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – review & editing. MH: Funding acquisition, Project administration, Resources, Writing – review & editing. GR: Funding acquisition, Methodology, Project administration, Resources, Supervision, Validation, Writing – review & editing. SK: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was funded by Agriculture Victoria Research and Grain Research Development Corporation through project DJP2104-0030PX.
Acknowledgments
We would like to thank the Horsham pulse breeding glasshouses and field operations staff for managing the field experiments, glasshouse work, and data collection. We would like to thank Dr. Zibei Lin for reading the manuscript and making useful suggestions. Furthermore, we would like to thank our interstate collaborators for managing the trials and phenotyping experimental trials. We would also like to thank AgriBio’s molecular genetics team for their genotyping efforts. Finally, we would like to thank Agriculture Victoria Research and the Grain Research Development Corporation for the financial support of this research.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1284781/full#supplementary-material
References
ABARES (2021). Australian grains, oilseeds and pulses (Canberra, Australia: Australian Bureau of Agricultural and Resource Economics and Sciences). Available at: https://www.agriculture.gov.au/abares/research-topics/agricultural-outlook/previous-reports.
Annicchiarico, P., Nazzicari, N., Pecetti, L., Romani, M., Russi, L. (2019). Pea genomic selection for Italian environments. BMC Genomics 20, 603. doi: 10.1186/s12864-019-5920-x
Araujo, S. S., Beebe, S., Crespi, M., Delbreil, B., Gonzalez, E. M., Gruber, V., et al. (2015). Abiotic stress responses in legumes: strategies used to cope with environmental challenges. Crit. Rev. Plant Sci. 34, 237–280. doi: 10.1080/07352689.2014.898450
Bari, M. A. A., Zheng, P., Viera, I., Worral, H., Szwiec, S., Ma, Y., et al. (2021). Harnessing genetic diversity in the USDA pea germplasm collection through genomic prediction. Front. Genet. 12, 707754. doi: 10.3389/fgene.2021.707754
Barrios, A., Kahraman, A., Aparicio, T., Rodríguez, M., Mosquera, P., García, P., et al. (2007). “Preliminary identification of QTLS for winter hardiness, frost tolerance and other agronomic characters in lentil (Lens culinaris Medik.) for Castilla y León (SPAIN) region,” in Proceedings of the 6th European conference on gain legumes, Lisbon. 12–16.
Batista, L. G., Gaynor, R. C., Margarido, G. R. A., Byrne, T., Amer, P., Gorjanc, G., et al. (2021). Long-term comparison between index selection and optimal independent culling in plant breeding programs with genomic prediction. PloS One 16, e0235554. doi: 10.1371/journal.pone.0235554
Beyene, Y., Gowda, M., Olsen, M., Robbins, K. R., Pérez-Rodríguez, P., Alvarado, G., et al. (2019). Empirical comparison of tropical maize hybrids selected through genomic and phenotypic selections. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.01502
Breen, E. J., MacLeod, I. M., Ho, P. N., Haile-Mariam, M., Pryce, J. E., Thomas, C. D., et al. (2022). BayesR3 enables fast MCMC blocked processing for largescale multi-trait genomic prediction and QTN mapping analysis. Commun. Biol. 5, 661. doi: 10.1038/s42003-022-03624-1
Browning, S. R., Browning, B. L. (2007). Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097. doi: 10.1086/521987
Browning, B. L., Browning, S. R. (2016). Genotype Imputation with Millions of Reference Samples. Am. J. Hum. Genet. 98, 116–126. doi: 10.1016/j.ajhg.2015.11.020
Butler, D. G., Cullis, B. R., Gilmour, A. R., Gogel, B. J. (2009). ASReml-R reference manual (Brisbane, Australia: NSW Department of Primary Industries & VSN International Ltd).
Covarrubias-Pazaran, G., Gebeyehu, Z., Gemenet, D., Werner, C., Labroo, M., Sirak, S., et al. (2021). Breeding schemes: what are they, how to formalize them, and how to improve them? Front. Plant Sci. 12, 791859. doi: 10.3389/fpls.2021.791859
Crossa, J., Perez-Rodriguez, P., Cuevas, J., Montesinos-Lopez, O., Jarquin, D., de Los Campos, G., et al. (2017). Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 22, 961–975. doi: 10.1016/j.tplants.2017.08.011
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27, 2156–2158. doi: 10.1093/bioinformatics/btr330
Das, S., Forer, L., Schönherr, S., Sidore, C., Locke, A. E., Kwong, A., et al. (2016). Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287. doi: 10.1038/ng.3656
Davidson, J., Smetham, G., Russ, M. H., McMurray, L., Rodda, M., Krysinska-Kaczmarek, M., et al. (2016). Changes in aggressiveness of the ascochyta lentis population in southern Australia. Front. Plant Sci. 7. doi: 10.3389/fpls.2016.00393
Dawson, J. C., Endelman, J. B., Heslot, N., Crossa, J., Poland, J., Dreisigacker, S., et al. (2013). The use of unbalanced historical data for genomic selection in an international wheat breeding program. Field Crops Res. 154, 12–22. doi: 10.1016/j.fcr.2013.07.020
Diaz, S., Ariza-Suarez, D., Ramdeen, R., Aparicio, J., Arunachalam, N., Hernandez, C., et al. (2021). Genetic architecture and genomic prediction of cooking time in common bean (Phaseolus vulgaris L.). Front. Plant Sci. 11, 622213. doi: 10.3389/fpls.2020.622213
Dissanayake, R., Kahrood, H. V., Dimech, A. M., Noy, D. M., Rosewarne, G. M., Smith, K. F., et al. (2020). Development and application of image-based high-throughput phenotyping methodology for salt tolerance in lentils. Agronomy 10, 1992. doi: 10.3390/agronomy10121992
Dutkowski, G. W., Costa e Silva, J., Gilmour, A. R., Wellendorf, H., Aguiar, A. (2006). Spatial analysis enhances modelling of a wide variety of traits in forest genetic trials. Can. J. For. Res. 36, 1851–1870. doi: 10.1139/x06-059
Dutta, A., Trivedi, A., Nath, C. P., Gupta, D. S., Hazra, K. K. (2022). A comprehensive review on grain legumes as climate-smart crops: challenges and prospects. Environ. Challenges 7, 100479. doi: 10.1016/j.envc.2022.100479
Fox, P. N., Rosielle, A. A. (1982). Reducing the influence of environmental main-effects on pattern analysis of plant breeding environments. Euphytica 31, 645–656. doi: 10.1007/BF00039203
Galli, G., Alves, F. C., Morosini, J. S., Fritsche-Neto, R. (2020). On the usefulness of parental lines GWAS for predicting low heritability traits in tropical maize hybrids. PloS One 15, e0228724. doi: 10.1371/journal.pone.0228724
Gaur, P. M., Singh, M. K., Samineni, S., Sajja, S. B., Jukanti, A. K., Kamatam, S., et al. (2016). Inheritance of protein content and its relationships with seed size, grain yield and other traits in chickpea. Euphytica 209, 253–260. doi: 10.1007/s10681-016-1678-2
Gaynor, R. C., Gorjanc, G., Bentley, A. R., Ober, E. S., Howell, P., Jackson, R., et al. (2017). A two-part strategy for using genomic selection to develop inbred lines. Crop Sci. 57, 2372–2386. doi: 10.2135/cropsci2016.09.0742
Gorjanc, G., Gaynor, R. C., Hickey, J. M. (2018). Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection. Theor. Appl. Genet. 131, 1953–1966. doi: 10.1007/s00122-018-3125-3
Haile, T. A., Heidecker, T., Wright, D., Neupane, S., Ramsay, L., Vandenberg, A., et al. (2020). Genomic selection for lentil breeding: Empirical evidence. Plant Genome 13, e20002. doi: 10.1002/tpg2.20002
Heffner, E. L., Lorenz, A. J., Jannink, J.-L., Sorrells, M. E. (2010). Plant breeding with genomic selection: gain per unit time and cost. Crop Sci. 50, 1681–1690. doi: 10.2135/cropsci2009.11.0662
Heslot, N., Akdemir, D., Sorrells, M. E., Jannink, J.-L. (2014). Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions. Theor. Appl. Genet. 127, 463–480. doi: 10.1007/s00122-013-2231-5
Heslot, N., Jannink, J.-L., Sorrells, M. E. (2013). Using genomic prediction to characterize environments and optimize prediction accuracy in applied breeding data. Crop Sci. 53, 921–933. doi: 10.2135/cropsci2012.07.0420
Ismail, B. P., Senaratne-Lenagala, L., Stube, A., Brackenridge, A. (2020). Protein demand: review of plant and animal proteins used in alternative protein product development and production. Anim. Front. 10, 53–63. doi: 10.1093/af/vfaa040
Jain, A., Roorkiwal, M., Pandey, M. K., Varshney, R. K. (2017). “Current status and prospects of genomic selection in legumes,” in Genomic selection for crop improvement: new molecular breeding strategies for crop improvement. Eds. Varshney, R. K., Roorkiwal, M., Sorrells, M. E. (Cham: Springer International Publishing), 131–147.
Janghel, D. K., Sharma, V. (2022). “Genomics-assisted breeding approaches in lentil (Lens culinaris medik),” in Technologies in plant biotechnology and breeding of field crops. Eds. Kiran, K. U., Abdin, M. Z. (Singapore: Springer Nature Singapore), 201–237.
Jannink, J. L., Lorenz, A. J., Iwata, H. (2010). Genomic selection in plant breeding: from theory to practice. Brief Funct. Genomics 9, 166–177. doi: 10.1093/bfgp/elq001
Jarquín, D., Lemes da Silva, C., Gaynor, R. C., Poland, J., Fritz, A., Howard, R., et al. (2017). Increasing genomic-enabled prediction accuracy by modeling genotype × Environment interactions in kansas wheat. Plant Genome. 10, 1–15 doi: 10.3835/plantgenome2016.12.0130
Jighly, A., Hayden, M., Daetwyler, H. (2021). Integrating genomic selection with a genotype plus genotype x environment (GGE) model improves prediction accuracy and computational efficiency. Plant Cell Environ. 44, 3459–3470. doi: 10.1111/pce.14145
Kaale, L. D., Siddiq, M., Hooper, S. (2023). Lentil (Lens culinaris Medik) as nutrient-rich and versatile food legume: A review. Legume Sci. 5, e169. doi: 10.1002/leg3.169
Kaur, S., Cogan, N. O. I., Stephens, A., Noy, D., Butsch, M., Forster, J. W., et al. (2014). EST-SNP discovery and dense genetic mapping in lentil (Lens culinaris Medik.) enable candidate gene selection for boron tolerance. Theor. Appl. Genet. 127, 703–713. doi: 10.1007/s00122-013-2252-0
Keller, B., Ariza-Suarez, D., de la Hoz, J., Aparicio, J. S., Portilla-Benavides, A. E., Buendia, H. F., et al. (2020). Genomic prediction of agronomic traits in common bean (Phaseolus vulgaris L.) under environmental stress. Front. Plant Sci. 11, 1001. doi: 10.3389/fpls.2020.01001
Kumar, S., Ali, M. (2006). GE interaction and its breeding implications in pulses. Botanica 56, 31–36.
Kumar, S., Barpete, S., Kumar, J., Gupta, P., Sarker, A. (2013). Global lentil production: constraints and strategies. SATSA Mukhapatra-Annual Tech. 17, 1–13.
Kumar, J., Gupta, D. S., Tiwari, P. (2020). “Lentil breeding in genomic era: present status and future prospects,” in Accelerated plant breeding, volume 3: food legumes (Cham: Springer International Publishing).
Kumar, J., Sen Gupta, D., Baum, M., Varshney, R. K., Kumar, S. (2021). Genomics-assisted lentil breeding: Current status and future strategies. Legume Sci. 3, e71. doi: 10.1002/leg3.71
LeMasurier, L. S., Panozzo, J. F., Walker, C. K. (2014). A digital image analysis method for assessment of lentil size traits. J. Food Eng. 128, 72–78. doi: 10.1016/j.jfoodeng.2013.12.018
Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993. doi: 10.1093/bioinformatics/btr509
Li, Y., Kaur, S., Pembleton, L. W., Valipour-Kahrood, H., Rosewarne, G. M., Daetwyler, H. D. (2022). Strategies of preserving genetic diversity while maximizing genetic response from implementing genomic selection in pulse breeding programs. Theor. Appl. Genet. Theoretische und angewandte Genetik 135, 1813–1828. doi: 10.1007/s00122-022-04071-6
Li, Y., Klápště, J., Telfer, E., Wilcox, P., Graham, N., Macdonald, L., et al. (2019). Genomic selection for non-key traits in radiata pine when the documented pedigree is corrected using DNA marker information. BMC Genomics 20, 1026. doi: 10.1186/s12864-019-6420-8
Li, H., Rodda, M., Gnanasambandam, A., Aftab, M., Redden, R., Hobson, K., et al. (2015). Breeding for biotic stress resistance in chickpea: progress and prospects. Euphytica 204, 257–288. doi: 10.1007/s10681-015-1462-8
Li, Y., Ruperao, P., Batley, J., Edwards, D., Khan, T., Colmer, T. D., et al. (2018). Investigating drought tolerance in chickpea using genome-wide association mapping and genomic selection based on whole-genome resequencing data. Front. Plant Sci. 9, 190. doi: 10.3389/fpls.2018.00190
Lin, Z., Robinson, H., Godoy, J., Rattey, A., Moody, D., Mullan, D., et al. (2021). Genomic prediction for grain yield in a barley breeding program using genotype × environment interaction clusters. Crop Sci. 61, 2323–2335. doi: 10.1002/csc2.20460
Lin, Z., Shi, F., Hayes, B. J., Daetwyler, H. D. (2017). Mitigation of inbreeding while preserving genetic gain in genomic breeding programs for outbred plants. Theor. Appl. Genet. 130, 969–980. doi: 10.1007/s00122-017-2863-y
Maher, L. A., Connor, D. (2003). “Salt tolerant lentils—A possibility for the future?,” in Solutions for a better environment. Proceedings of the 11th Australian agronomy conference, geelong, victoria, Australia (Geelong, Australia: The Australian Society of Agronomy Inc).
Matei, G., Woyann, L. G., Milioli, A. S., de Bem Oliveira, I., Zdziarski, A. D., Zanella, R., et al. (2018). Genomic selection in soybean: accuracy and time gain in relation to phenotypic selection. Mol. Breed. 38, 1–13. doi: 10.1007/s11032-018-0872-4
Meuwissen, T. H., Hayes, B. J., Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829. doi: 10.1093/genetics/157.4.1819
Revilla, I., Lastras, C., González-Martín, M. I., Vivar-Quintana, A. M., Morales-Corts, R., Gómez-Sánchez, M. A., et al. (2019). Predicting the physicochemical properties and geographical ORIGIN of lentils using near infrared spectroscopy. J. Food Composition Anal. 77, 84–90. doi: 10.1016/j.jfca.2019.01.012
Rodda, M. S., Davidson, J., Javid, M., Sudheesh, S., Blake, S., Forster, J. W., et al. (2017). Molecular breeding for ascochyta blight resistance in lentil: current progress and future directions. Front. Plant Sci. 8, 1136. doi: 10.3389/fpls.2017.01136
Rodda, M. S., Sudheesh, S., Javid, M., Noy, D., Gnanasambandam, A., Slater, A. T., et al. (2018). Breeding for boron tolerance in lentil (Lens culinaris Medik.) using a high-throughput phenotypic assay and molecular markers. Plant Breed. 137, 492–501. doi: 10.1111/pbr.12608
Sadras, V. O., Rosewarne, G. M., Lake, L. (2021). Australian lentil breeding between 1988 and 2019 has delivered greater yield gain under stress than under high-yield conditions. Front. Plant Sci. 12, 674327. doi: 10.3389/fpls.2021.674327
Saxena, K. B., Faris, D. G., Singh, U., Kumar, R. V. (1987). Relationship between seed size and protein content in newly developed high protein lines of pigeonpea. Plant Foods Hum. Nutr. 36, 335–340. doi: 10.1007/BF01892354
Scheelbeek, P. F., Bird, F. A., Tuomisto, H. L., Green, R., Harris, F. B., Joy, E. J., et al. (2018). Effect of environmental changes on vegetable and legume yields and nutritional quality. Proc. Natl. Acad. Sci. 115, 6804–6809. doi: 10.1073/pnas.1800442115
Silva-Perez, V., Shunmugam, A. S. K., Rao, S., Cossani, C. M., Tefera, A. T., Fitzgerald, G. J., et al. (2022). Breeding has selected for architectural and photosynthetic traits in lentils. Front. Plant Sci. 13, 925987. doi: 10.3389/fpls.2022.925987
Singh, D., Singh, C. K., Singh Tomar, R. S., Pal, M. (2017). Genetics and molecular mapping of heat tolerance for seedling survival and pod set in lentil. Crop Sci. 57, 3059–3067. doi: 10.2135/cropsci2017.05.0284
Singh, D., Singh, C. K., Tomar, R. S. S., Sharma, S., Karwa, S., Pal, M., et al. (2020). Genetics and molecular mapping for salinity stress tolerance at seedling stage in lentil (Lens culinaris Medik). Crop Sci. 60, 1254–1266. doi: 10.1002/csc2.20030
Smoliński, A., Walczak, B., Einax, J. W. (2002). Hierarchical clustering extended with visual complements of environmental data set. Chemometrics Intelligent Lab. Syst. 64, 45–54. doi: 10.1016/S0169-7439(02)00049-7
Subedi, M., Bett, K. E., Khazaei, H., Vandenberg, A. (2018). Genetic mapping of milling quality traits in lentil (Lens culinaris Medik.). Plant Genome 11, 170092. doi: 10.3835/plantgenome2017.10.0092
Sudheesh, S., Rodda, M. S., Davidson, J., Javid, M., Stephens, A., Slater, A. T., et al. (2016). SNP-based linkage mapping for validation of QTLs for resistance to ascochyta blight in lentil. Front. Plant Sci. 7. doi: 10.3389/fpls.2016.01604
Suri, G. K., Braich, S., Noy, D. M., Rosewarne, G. M., Cogan, N. O. I., Kaur, S. (2022). Advances in lentil production through heterosis: Evaluating generations and breeding systems. PloS One 17, e0262857. doi: 10.1371/journal.pone.0262857
Taylor, P. W. J., Ades, P. K., Ford, R. (2006). QTL mapping of resistance in lentil (Lens culinaris ssp. culinaris) to ascochyta blight (Ascochyta lentis). Plant Breed. 125, 506–512. doi: 10.1111/j.1439-0523.2006.01259.x
Tibbits, J., McManus, L. J., Spokevicius, A. V., Bossinger, G. (2008). A rapid method for tissue collection and high-throughput isolation of genomic DNA from mature trees. Plant Mol. Biol. Rep. 24, 81–91. doi: 10.1007/BF02914048
VanRaden, P. M., Tassell, C. P., Wiggans, G. R., Sonstegard, T. S., Schnabel, R. D., Taylor, J. F., et al. (2009). Invited review: reliability of genomic predictions for North American Holstein bulls. J. Dairy Sci. 92, 16–24. doi: 10.3168/jds.2008-1514
Verma, P., Goyal, R., Chahota, R. K., Sharma, T. R., Abdin, M. Z., Bhatia, S. (2015). Construction of a genetic linkage map and identification of QTLs for seed weight and seed size traits in lentil (Lens culinaris medik.). PloS One 10, e0139666. doi: 10.1371/journal.pone.0139666
Keywords: BayesR, multi-environmental trial, genetic gain, genomic selection, lentil
Citation: Gebremedhin A, Li Y, Shunmugam ASK, Sudheesh S, Valipour-Kahrood H, Hayden MJ, Rosewarne GM and Kaur S (2024) Genomic selection for target traits in the Australian lentil breeding program. Front. Plant Sci. 14:1284781. doi: 10.3389/fpls.2023.1284781
Received: 29 August 2023; Accepted: 07 December 2023;
Published: 03 January 2024.
Edited by:
Aditya Pratap, Indian Institute of Pulses Research (ICAR), IndiaReviewed by:
Petr Smýkal, Palacký University in Olomouc, CzechiaShanmugavadivel Ps, Indian Institute of Pulses Research (ICAR), India
Copyright © 2024 Gebremedhin, Li, Shunmugam, Sudheesh, Valipour-Kahrood, Hayden, Rosewarne and Kaur. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sukhjiwan Kaur, Sukhjiwan.Kaur@agriculture.vic.gov.au