Prediction of additive, epistatic, and dominance effects using models accounting for incomplete inbreeding in parental lines of hybrid rye and sugar beet

Kristensen, Peter Skov; Sarup, Pernille; Fé, Dario; Orabi, Jihad; Snell, Per; Ripa, Linda; Mohlfeld, Marius; Chu, Thinh Tuan; Herrström, Joakim; Jahoor, Ahmed; Jensen, Just

doi:10.3389/fpls.2023.1193433

ORIGINAL RESEARCH article

Front. Plant Sci., 02 November 2023

Sec. Functional and Applied Plant Genomics

Volume 14 - 2023 | https://doi.org/10.3389/fpls.2023.1193433

This article is part of the Research TopicCultivation and Breeding of Special CropsView all 9 articles

Prediction of additive, epistatic, and dominance effects using models accounting for incomplete inbreeding in parental lines of hybrid rye and sugar beet

Peter Skov Kristensen^1*

Pernille Sarup²

Dario Fé³

Jihad Orabi²

Per Snell⁴

Linda Ripa⁴

Marius Mohlfeld⁵

Thinh Tuan Chu¹

Joakim Herrström⁴

Ahmed Jahoor^2,5,6

Just Jensen¹

¹Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
²Research and Development, Nordic Seed A/S, Odder, Denmark
³Research Division, DLF Seeds A/S, Store Heddinge, Denmark
⁴Research and Development, DLF Beet Seed AB, Landskrona, Sweden
⁵Breeding, Nordic Seed Germany GmbH, Nienstädt, Germany
⁶Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden

Genomic models for prediction of additive and non-additive effects within and across different heterotic groups are lacking for breeding of hybrid crops. In this study, genomic prediction models accounting for incomplete inbreeding in parental lines from two different heterotic groups were developed and evaluated. The models can be used for prediction of general combining ability (GCA) of parental lines from each heterotic group as well as specific combining ability (SCA) of all realized and potential crosses. Here, GCA was estimated as the sum of additive genetic effects and within-group epistasis due to high degree of inbreeding in parental lines. SCA was estimated as the sum of across-group epistasis and dominance effects. Three models were compared. In model 1, it was assumed that each hybrid was produced from two completely inbred parental lines. Model 1 was extended to include three-way hybrids from parental lines with arbitrary levels of inbreeding: In model 2, parents of the three-way hybrids could have any levels of inbreeding, while the grandparents of the maternal parent were assumed completely inbred. In model 3, all parental components could have any levels of inbreeding. Data from commercial breeding programs for hybrid rye and sugar beet was used to evaluate the models. The traits grain yield and root yield were analyzed for rye and sugar beet, respectively. Additive genetic variances were larger than epistatic and dominance variances. The models’ predictive abilities for total genetic value, for GCA of each parental line and for SCA were evaluated based on different cross-validation strategies. Predictive abilities were highest for total genetic values and lowest for SCA. Predictive abilities for SCA and for GCA of maternal lines were higher for model 2 and model 3 than for model 1. The implementation of the genomic prediction models in hybrid breeding programs can potentially lead to increased genetic gain in two different ways: I) by facilitating the selection of crossing parents with high GCA within heterotic groups and II) by prediction of SCA of all realized and potential combinations of parental lines to produce hybrids with high total genetic values.

1 Introduction

Hybrid varieties of important crops, such as maize (Zea mays L.), rye (Secale cereale L.), and sugar beet (Beta vulgaris L. ssp. vulgaris), are widely cultivated and perform considerably better than inbred or population varieties (Campbell, 1990; Duvick, 2005; Laidig et al., 2017). The improved performance is due to heterosis or hybrid vigor, which occurs when genetically different lines are crossed (Labroo et al., 2021). However, breeding programs for hybrid crops typically require many resources and have long breeding cycles (Longin et al., 2012). Both of these factors limit genetic gain in hybrid crops compared to line breeding programs. Rye and sugar beet are both crops that are commonly cultivated as hybrids, which are produced by crossing inbred lines from different heterotic groups. In the current study, data from three-way hybrids and their parental components were evaluated. First, two-way crosses between cytoplasmic male-sterile (MS) and non-restorer (NR) lines from one heterotic group are produced, and the resulting male-sterile offspring are then crossed with a pollinator or restorer (R) line from a different heterotic group to produce the three-way hybrids. MS lines do not produce viable pollen. When MS lines are crossed with an NR line, the offspring remain male-sterile, and when MS lines are crossed with a restorer line, the offspring become male-fertile and can produce viable pollen (Vendelbo et al., 2020).

Many studies have shown that genomic prediction can be applied to a broad range of complex traits in animals and crops to increase rate of genetic gain and improve effectiveness in breeding programs (Meuwissen et al., 2016; Crossa et al., 2017; Kristensen et al., 2019). For genomic prediction, numerous of genome-wide markers are used to predict genomic breeding values of lines, based on a training set consisting of lines that have both been genotyped and phenotyped (Meuwissen et al., 2001). Many important agronomic traits have a complex genetic architecture, i.e. they are controlled by many QTL, each having a small effect (Würschum et al., 2011; Hackauf et al., 2017). For such traits, genomic prediction models have been shown to be more accurate than marker-assisted selection based on few markers (Wang et al., 2014; Arruda et al., 2016).

For optimal use of genomic selection in breeding programs for hybrid crops, it is essential to have models that can predict both genomic breeding values of lines within heterotic groups as well as the total genetic value of the hybrids (Technow et al., 2012). However, studies of genomic prediction in hybrid rye and sugar beet have so far been conducted using datasets and models, where it was not possible to separately estimate additive genetic effects of each parent and non-additive genetic effects of the hybrids, which limits the practical use for breeding (Hofheinz et al., 2012; Würschum et al., 2013; Wang et al., 2014; Auinger et al., 2016; Schulthess et al., 2016; Bernal-Vasquez et al., 2017). Non-additive genetic effects consist of dominance and epistatic deviations from the additive allele substitution effects. Dominance deviations are due to interactions between different alleles within a locus. Epistatic deviations are due to interactions between alleles across loci, and can consist of additive by additive, additive by dominance, dominance by dominance, and any higher order interactions (Falconer and Mackay, 1996).

González-Diéguez et al. (2021) developed a “GCA-model” to predict the performance of hybrids made by crossing completely inbred parental lines from two different heterotic groups. In this model, hybrid performance could be split into additive effects from the parental lines in each heterotic group, epistatic deviations both within and across the two groups, and dominance deviations. Genetic effects might differ between heterotic groups due to differences in allele frequencies, differences in linkage disequilibrium between QTL and markers, markers linked to opposite phases of the QTL, or due to QTL that segregate in one group, but not in the other (Vendelbo et al., 2020; Vendelbo et al., 2021a). Therefore, effects were defined separately for each of the two heterotic groups in the model (González-Diéguez et al., 2021). If parental lines are completely inbred, general combining ability (GCA) can be estimated as the sum of the additive and within-line epistatic deviations for each parent, while specific combining ability (SCA) can be estimated as the sum of the dominance and across-group epistatic deviations. If parental lines are not inbred, within-line epistatic deviations are not always transmitted to the offspring due to recombination during meiosis, and GCA is therefore estimated based on additive genetic effects only. González-Diéguez et al. (2021) evaluated their model using data from a hybrid maize breeding program, and the GCA-model resulted in high prediction accuracies for grain yield (0.80 to 0.92 based on different cross-validation strategies).

In the current study, the GCA-model was extended from predicting the performance of two-way hybrids produced from fully inbred lines to predicting the performance of three-way hybrids produced from parental lines with arbitrary levels of inbreeding. The models were evaluated using data from two commercial breeding programs for hybrid rye and sugar beet. The genetic variation and the degree of homozygosity in the parental lines of the different heterotic groups of the breeding programs were investigated. Rye grain yield and sugar beet root yield were analyzed, and different cross-validation strategies were used for evaluating the predictive abilities. The models can be used in breeding programs to predict SCA for all realized and potential hybrids as well as GCA for all genotyped individuals in the two parental heterotic groups.

2 Materials and methods

2.1 Phenotypic and genotypic data

Data from the rye breeding program of the company Nordic Seed and from the sugar beet breeding program of the company DLF Beet Seed was used (Supplementary Tables). The phenotypic data consisted of yield of three-way hybrids tested in replicated multi-location trials across several European countries from 2016-2022 for rye and from 2012-2022 for sugar beet. Environmental effects and spatial variations in the fields were accounted for by the fixed effects in the genomic models described below. For rye, grain yield of plots was corrected to a moisture content of 15%. Replications within a trial were either treated with fungicides and growth regulators or were untreated. For sugar beet, root yield of plots was recorded as fresh weight, and all replicates were treated with fungicides. The three-way hybrids were produced by first crossing an MS and an NR line. The MS lines were derived from NR lines via several generations of backcrossing. Thus, the MS and NR both belong to the same heterotic group. The two-way hybrid from the cross between MS and NR was then crossed with an R line from a second heterotic group to produce a three-way hybrid. The number of tested three-way hybrids and of the parental components used for producing the hybrids are shown in Table 1. Parental components of the hybrids (MS, NR, and R lines) were genotyped with SNP chip arrays. For rye lines, DNA was extracted from leaves of seedlings, and genotyping was carried out by TraitGenetics GmbH (Germany) using a custom Illumina Infinium 15K wheat + 5K rye SNP iSelect ultra HD chip array (Vendelbo et al., 2020). For sugar beet lines, DNA was extracted from first true leaves using the sbeadex™ Magnetic Bead Kit (LGC) in accordance with the manufacturer’s instructions, and genotyping was carried out by Eurofins Genomics Europe (Denmark) using a custom 21K Sugar beet Affymetrix Axiom microarray. After filtering for minor allele frequency and missing values (thresholds of 0.1% and 20%, respectively), 5,768 SNPs were included in the analyses for rye and 6,514 SNPs for sugar beet.

TABLE 1

Table 1 Number of three-way hybrids, parental components, and plots of three-way hybrids.

2.2 Genomic prediction models

Three genomic prediction models were evaluated. Model 1 (M1) was based on the GCA-model developed by González-Diéguez et al. (2021). Here, it is assumed that each hybrid was produced from completely inbred parental lines. The maternal lines belonged to heterotic group 1, and the paternal lines belonged to heterotic group 2. Genotypes of the two-way crosses were imputed from the genotypes of the MS and NR lines, and these genotypes were used in the calculation of genomic relationship matrices. If a SNP marker was heterozygous in a parental line (alleles B₁b₁ in group 1 or B₂b₂ in group 2), it was randomly assigned to one of the two homozygous genotypes (B₁B₁/b₁b₁ or B₂B₂/b₂b₂, respectively).

In model 2 (M2) and model 3 (M3), the genotypes of the MS and NR lines were used directly in the calculations in order to better utilize the genomic relationship between the lines within group 1. In M2, the R lines and the two-way crosses could have arbitrary levels of inbreeding, while the MS and NR lines were assumed completely inbred. Any heterozygous SNPs in the MS and NR lines were randomly assigned to one of the two homozygous genotypes. In M3, the R lines, the two-way crosses, and the MS and NR lines could all have arbitrary levels of inbreeding. If all parental components are completely inbred, then M2 and M3 are equivalent to M1.

Thus, the M1 model was:

\begin{array}{l} \begin{array}{l} y = X b + T_{1} ɡ_{A_{(1)}} + T_{2} ɡ_{A_{(2)}} + T_{3} ɡ_{D} + T_{1} ɡ_{A A_{(1)}} + T_{2} ɡ_{A A_{(2)}} + T_{3} ɡ_{A A_{(3)}} \\ + \\ T_{1} \\ r_{(1)} \\ + \\ T_{2} \\ r_{(2)} \\ + \\ T_{3} \\ r_{(3)} \\ + \\ T_{4} \\ k \\ + \\ T_{5} \\ l \\ + \\ T_{6} \\ m \\ + \\ e \end{array} & (1) \end{array}

where y is the vector of phenotypes of the hybrids; X is the design matrix for fixed effects (year x location x trial. For rye, treatment was included as a second fixed effect); b is the vector of fixed effects; T₁ and T₂ are design matrices to assign hybrids to their parental lines in heterotic group 1 and 2, $ɡ_{A_{(1)}}$ and $ɡ_{A_{(2)}}$ are vectors of additive genetic effects from parental lines from group 1 and 2, respectively, with $ɡ_{A_{(1)}} \sim N (0, G_{A_{(1)}} σ_{A_{(1)}}^{2})$ and $ɡ_{A_{(2)}} \sim N (0, G_{A_{(2)}} σ_{A_{(2)}}^{2})$ , where $σ_{A_{(1)}}^{2}$ and $σ_{A_{(2)}}^{2}$ are additive genetic variances and $G_{A_{(1)}}$ and $G_{A_{(2)}}$ are additive genomic relationship matrices; $ɡ_{A A_{(1)}}$ and $ɡ_{A A_{(2)}}$ are vectors of additive-by-additive epistatic effects within heterotic group 1 and 2, respectively, with $ɡ_{A A_{(1)}} \sim N (0, G_{A A_{(1)}} σ_{A A_{(1)}}^{2})$ and $G_{A A_{(2)}} \sim N (0, G_{A A_{(2)}} σ_{A A_{(2)}}^{2})$ , where $σ_{A A_{(1)}}^{2}$ and $σ_{A A_{(2)}}^{2}$ are epistatic genetic variances within each heterotic group and $G_{A A_{(1)}}$ and $G_{A A_{(2)}}$ are within-group epistatic genomic relationship matrices; T₃ is the design matrix for the effects of the hybrids; $ɡ_{A A_{(3)}}$ is the vector of additive-by-additive epistatic effects between alleles from heterotic group 1 and 2, respectively, with $ɡ_{A A_{(3)}} \sim N (0, G_{A A_{(3)}} σ_{A A_{(3)}}^{2})$ , where $σ_{A A_{(3)}}^{2}$ is epistatic genetic variance between the heterotic groups and $G_{A A_{(3)}}$ is the across-group epistatic genomic relationship matrix; g_D is the vector of genetic dominance deviations due to within locus interactions between alleles from different heterotic groups with $ɡ_{D} \sim N (0, D σ_{D}^{2})$ , where $σ_{D}^{2}$ is genetic dominance variance and D is the dominance relationship matrix across hybrids; r₍₁₎, r₍₂₎ and r₍₃₎ are vectors of residual genetic effects of lines from group 1 and 2 and of the hybrids, respectively, with $r_{(1)} \sim N (0, I_{r_{(1)}} σ_{r_{(1)}}^{2})$ , $r_{(2)} \sim N (0, I_{r_{(2)}} σ_{r_{(2)}}^{2})$ and $r_{(3)} \sim N (0, I_{r_{(3)}} σ_{r_{(3)}}^{2})$ , where $I_{r_{(1)}}$ , $I_{r_{(2)}}$ , and $I_{r_{(3)}}$ are identity matrices and $σ_{r_{(1)}}^{2}$ , $σ_{r_{(2)}}^{2}$ and $σ_{r_{(3)}}^{2}$ are residual genetic variances; T₄, T₅, and T₆ are design matrices for random effects of interactions between year x location and maternal parent, paternal parent or treatment (only included for rye), respectively, and k, l, and m are the vectors of the random effects of the interactions with $k \sim N (0, I_{k} σ_{k}^{2})$ , $l \sim N (0, I_{l} σ_{l}^{2})$ , and $m \sim N (0, I_{m} σ_{m}^{2})$ , where I_k, I_l, and I_m are identity matrices and $σ_{k}^{2}$ , $σ_{l}^{2}$ , and $σ_{m}^{2}$ , are variances for the interactions; e is the vector of random residual effects with $e \sim N (0, I_{e} σ_{e}^{2})$ , where I_e is an identity matrix and $σ_{e}^{2}$ is residual variance.

For M1, genomic relationship matrices were calculated as proposed by González-Diéguez et al. (2021):

Additive genomic relationship matrix for heterotic group 1:

\begin{array}{l} G_{A_{(1)}} = \frac{Z_{1} Z_{1}^{'}}{\sum_{i}^{n s n p} p_{1_{i}} q_{1_{i}}} & (2) \end{array}

where $p_{1_{i}}$ and $q_{1_{i}}$ are the frequencies of allele $B_{1_{i}}$ and $b_{1_{i}}$ for the i^th marker, respectively, and Z₁ = M₁ - P₁; M₁ is a matrix with genotypes of parental lines in group 1 coded as 0 for genotype b₁b₁ and 1 for genotype B₁B₁ for each marker; P₁ is a matrix where each column contains the allele frequencies of $B_{1}$ , and nsnp is number of markers.

Additive-by-additive epistatic relationship matrix for lines within group 1 was calculated as the Hadamard product of the additive genomic relationship matrix for group 1 scaled by the trace of the resulting matrix divided by the number of lines in group 1 to get an average diagonal of 1:

\begin{array}{l} G_{A A_{(1)}} = \frac{G_{A_{(1)}} ⨀ G_{A_{(1)}}}{t r (G_{A_{(1)}} ⨀ G_{A_{(1)}}) / n_{1}} & (3) \end{array}

The additive and epistatic genomic relationship matrices for heterotic group 2 were calculated in same way as for group 1.

Additive-by-additive epistatic relationship matrix between lines in group 1 and 2:

\begin{array}{l} G_{A A_{(3)}} = \frac{T_{1} G_{A_{(1)}} T_{1}^{'} ⨀ T_{2} G_{A_{(2)}} T_{2}^{'}}{t r (T_{1} G_{A_{(1)}} T_{1}^{'} ⨀ T_{2} G_{A_{(2)}} T_{2}^{'}) / n_{H}} & (4) \end{array}

where n_H is the number of hybrids. The matrices $G_{A A_{(3)}}$ and D can both include realized hybrids as well as all potential crosses of the parental lines, so the crosses with the largest effects can be predicted even though they are not yet phenotypically tested.

Dominance relationship matrix of dominance interactions between alleles from different heterotic groups:

\begin{array}{l} D = \frac{W_{1} W_{1}^{'}}{\sum_{i}^{n s n p} (4 p_{1_{i}} q_{1_{i}} p_{2_{i}} q_{2_{i}})} & (5) \end{array}

where $p_{1_{i}}$ , $q_{1_{i}}$ , $p_{2_{i}}$ and $q_{2_{i}}$ are the frequencies of the alleles $B_{1_{i}}$ and $b_{1_{i}}$ in heterotic group 1 and $B_{2_{i}}$ and $b_{2_{i}}$ in heterotic group 2 for the i^th marker, respectively, and W₁ is a matrix with a row for each hybrid and a column for each marker (González-Diéguez et al., 2021). The elements of W₁ are shown in Table 2.

TABLE 2

Table 2 Elements of W₁, W₂, and W₃ for each marker in the hybrids from crosses between parental lines from group 1 and group 2, which are used in the calculation of the dominance relationship matrix for M1, M2, and M3, respectively*.

It should be noted that the mean heterosis of the hybrids is not estimated separately in the model but is included in the overall mean of the hybrid phenotypes. Thus, the across-group epistatic and dominance effects that are estimated are deviations of individual hybrids from the mean heterosis.

In M2, paternal R lines and maternal two-way crosses could have arbitrary levels of inbreeding, while MS and NR lines were assumed completely inbred. Genotypes of MS and NR were used for the calculation of additive and epistatic genomic relationship matrices for heterotic group 1. If an MS and NR lines had the same genotypes for all SNPs, it was only included once in the relationship matrices.

Thus, the M2 model was:

\begin{array}{l} \begin{array}{l} y = X b + (T_{7} + T_{8}) ɡ_{A_{(1, 1)}} + T_{2} ɡ_{A_{(2)}} + T_{3} ɡ_{D} + (T_{7} + T_{8}) ɡ_{A A_{(1, 1)}} + T_{2} ɡ_{A A_{(2)}} + T_{3} ɡ_{A A_{(3)}} \\ + \\ ( \\ T_{7} \\ + \\ T_{8} \\ ) \\ r_{(1, 1)} \\ + \\ T_{2} \\ r_{(2)} \\ + \\ T_{3} \\ r_{(3)} \\ + \\ T_{4} \\ k \\ + \\ T_{5} \\ l \\ + \\ T_{6} \\ m \\ + \\ e \end{array} & (6) \end{array}

where y is the vector of phenotypes of the three-way hybrids; T₇ and T₈ are design matrices for MS and NR, respectively; $ɡ_{A_{(1, 1)}}$ , $ɡ_{A A_{(1, 1)}}$ , and $r_{A_{(1, 1)}}$ are vectors of additive, epistatic, and residual genetic effects for both MS and NR, respectively, with $ɡ_{A_{(1, 1)}} \sim N (0, \frac{1}{2} G_{A_{(1, 1)}} σ_{A_{(1, 1)}}^{2})$ , $ɡ_{A A_{(1, 1)}} \sim N (0, G_{A A_{(1, 1)}} σ_{A A_{(1, 1)}}^{2})$ , and $r_{(1, 1)} \sim N (0, I_{r_{(1, 1)}} σ_{r_{(1, 1)}}^{2})$ , where $σ_{A_{(1, 1)}}^{2}$ , $σ_{A A_{(1, 1)}}^{2}$ and $σ_{r_{(1, 1)}}^{2}$ are additive, within-group epistatic and residual genetic variances for MS and NR, and $G_{A_{(1, 1)}}$ and $G_{A A_{(1, 1)}}$ are additive and epistatic genomic relationship matrices, and $I_{r_{(1, 1)}}$ is an identity matrix. $G_{A_{(1, 1)}}$ was scaled by ½ to account for the first cross between MS and NR, which produced the two-way cross. Additionally, M₂, which was used in the calculation of the additive genomic relationship matrix for group 2 ( $G_{A_{(2)}}$ ) now included heterozygous genotypes B₂b₂ coded as 0.5. The marker matrix for the dominance relationship matrix, W₂, was extended to account for heterozygous genotypes in the two-way crosses and in the R lines, which now have twelve possible crossing combinations instead of four in M1 (Table 2). The additive-by-additive epistatic relationship matrix between lines in group 1 and 2 was calculated as:

\begin{array}{l} G_{A A_{(3)}} = \frac{(T_{7} + T_{8}) G_{A_{(1, 1)}} {(T_{7} + T_{8})}^{'} ⨀ T_{2} G_{A_{(2)}} T_{2}^{'}}{t r ((T_{7} + T_{8}) G_{A_{(1, 1)}} {(T_{7} + T_{8})}^{'} ⨀ T_{2} G_{A_{(2)}} T_{2}^{'}) / n_{H}} & (7) \end{array}

In M3, the same model parameters were used as for M2 (Equation 6), but now every parental line (MS, NR, two-way crosses, and R) could have arbitrary levels of inbreeding. Therefore, M₁, which was used in the calculation of the additive genomic relationship matrix for MS and NR ( $G_{A_{(1, 1)}}$ ) included heterozygous genotypes B₁b₁ coded as 0.5. The marker matrix for the dominance relationship matrix, W₃, was further extended to account for heterozygous genotypes in all parental lines, which now have 27 possible crossing combinations (Table 2).

2.3 Estimation of variance components and heritabilities

Variance components for the random effects included in M1, M2, and M3 were estimated by restricted maximum likelihood using the software package DMU (Madsen and Jensen, 2013). Estimated genetic variances were multiplied with D_K (the mean of the diagonal of the respective relationship matrix minus the overall mean of the matrix) in order to account for the lack of Hardy-Weinberg equilibrium (Legarra, 2016; Vitezica et al., 2017). Narrow-sense heritabilities were calculated as the sum of additive genetic variances divided by total phenotypic variance, and broad-sense heritabilities were calculated as sum of additive and non-additive genetic variances divided by total phenotypic variance. Heritabilities were calculated both at plot level and at entry mean level of the three-way hybrids, i.e. based on the mean of all plot records for each three-way hybrid.

Phenotypic variance at plot level, ${\hat{σ}}_{p_{p l o t}}^{2}$ , was calculated as:

\begin{array}{l} {\hat{σ}}_{p_{p l o t}}^{2} = {\hat{σ}}_{ɡ_{A}}^{2} + {\hat{σ}}_{ɡ_{A A}}^{2} + {\hat{σ}}_{ɡ_{D}}^{2} + {\hat{σ}}_{r}^{2} + {\hat{σ}}_{k}^{2} + {\hat{σ}}_{l}^{2} + {\hat{σ}}_{m}^{2} + {\hat{σ}}_{e}^{2} & (8) \end{array}

where ${\hat{σ}}_{ɡ_{A}}^{2}$ is the estimated sum of additive genetic variances for group 1 and 2, ${\hat{σ}}_{ɡ_{A A}}^{2}$ is the estimated sum of epistatic genetic variances within and across group 1 and 2, ${\hat{σ}}_{ɡ_{D}}^{2}$ is estimated dominance genetic variance, ${\hat{σ}}_{r}^{2}$ is the estimated sum of residual genetic variances of group 1, group 2 and of the hybrids, ${\hat{σ}}_{k}^{2}$ , ${\hat{σ}}_{l}^{2},$ and ${\hat{σ}}_{m}^{2}$ are estimated variances for the year x location interactions defined above for M1, and ${\hat{σ}}_{e}^{2}$ is estimated residual variance.

Phenotypic variance at entry mean level of the three-way hybrids, ${\hat{σ}}_{p_{e n t r y}}^{2}$ , was calculated as:

\begin{array}{l} {\hat{σ}}_{p_{e n t r y}}^{2} = {\hat{σ}}_{ɡ_{A}}^{2} + {\hat{σ}}_{ɡ_{A A}}^{2} + {\hat{σ}}_{ɡ_{D}}^{2} + {\hat{σ}}_{r}^{2} + \frac{{\hat{σ}}_{k}^{2}}{n_{k}} + \frac{{\hat{σ}}_{l}^{2}}{n_{l}} + \frac{{\hat{σ}}_{m}^{2}}{n_{m}} + \frac{{\hat{σ}}_{e}^{2}}{n_{e}} & (9) \end{array}

where n_k is average number of year x location observed per two-way cross, n_l is average number of year x locations observed per R line, n_m is average number of observations per year x location x treatment interaction (only included for rye), and n_e is average number of observations per three-way hybrid.

2.4 Cross-validation strategies and predictive abilities

Predictive abilities of the models were evaluated using four different leave-one-out cross-validation strategies. Phenotypes of three-way hybrids were left out from the training set based on their parental components and predicted based on the remaining data. In the four cross-validations, the phenotypes were left out based on I) the maternal two-way cross, II) the paternal R line, III) the specific combination of the two parents (three-way hybrid), or IV) based on the breeding cycle of the R lines. Predictive abilities were then defined as the correlation between estimated genetic effects and the phenotype corrected for all other effects, which is equivalent to the genetic effects estimated from the full model plus the residual effects. The strategies were chosen to evaluate predictive abilities for total genetic values, for GCA of each parental line, and for SCA. For total genetic values, correlations were calculated based on the sum of all genetic effects, and for GCA and SCA, correlations were based only on the effects of the component that was left out in the cross-validation strategy. GCA was estimated as the sum of additive genetic effects and with-in group epistasis of parental components from each heterotic group, and SCA was estimated as across-group epistasis and dominance of the three-way hybrids. Furthermore, correlations were calculated at plot level and at entry mean level of the three-ways hybrids.

3 Results

3.1 Phenotyping and genotyping

For rye, 570 three-way hybrids were phenotyped for grain yield with a total of 12,326 plot observations from seven years. For sugar beet, 657 three-way hybrids were phenotyped for root yield with a total of 47,703 plot observations from eleven years. The distributions of the phenotypes are shown in Figure 1, and for both traits, they were approximately normally distributed. The average rye grain yield was 8.7 t/ha with a coefficient of phenotypic variation of 21.1%, and the average sugar beet root yield was 82.4 t/ha with a coefficient of phenotypic variation of 25.8%.

FIGURE 1

Figure 1 Histograms of phenotypic observations for (A) rye grain yield and (B) sugar beet root yield.

The number of parental components used for producing the hybrids are shown in Table 1. The MS, NR and R lines had been inbred for several generations and had a high degree of homozygosity based on SNP markers (mean from 88 to 96%, Table 3). The homozygosity based on the SNPs might be different from the homozygosity of QTL, because each QTL is most likely not in complete linkage disequilibrium with one SNP. The two-way crosses were produced by crossing MS and NR lines belonging to the same heterotic group, and their mean homozygosity estimated from the parental genotypes was therefore relatively high (70% and 86% for sugar beet and rye, respectively). Each two-way cross was a mix of plants that were homozygous and plants that were heterozygous for SNPs that were heterozygous in at least one of the parental lines (mean of 7% and 23% of the SNPs for rye and sugar beet, respectively).

TABLE 3

Table 3 Mean homozygosity and heterozygosity of parental components estimated based on SNP markers.

Plots of the first two principal components from a principal component analysis of the SNP genotypes of the parental lines (explaining 39.6% and 5.2% of the variance for rye, and 55.4% and 4.4% for sugar beet) showed that the MS and NR lines were located together in one small group for both crops, while the R lines formed another and more diverse group (Figure 2). For rye, the two groups were clearly separated, while there was a small overlap between the groups for sugar beet.

FIGURE 2

Figure 2 Principal component analysis for (A) rye and (B) sugar beet MS, NR (red circles) and R lines (blue circles) based on their SNP markers.

3.2 Genetic variances and heritabilities

The estimated variance components for the three genomic prediction models M1, M2, and M3 that differed in the assumptions about inbreeding of the parental components are shown in Figure 3. For both crops, the differences in the estimated variances were relatively small when comparing M1, M2, and M3. For M1, the additive genetic variance for R lines was higher, and the additive genetic variance for MS and NR was lower, compared to the estimated variances for M2 and M3. For grain yield in rye, the majority of the phenotypic variance of the three-way hybrids could be explained by additive genetic variance in the R lines (57% for M1 and 50% for M2 and M3). Additive genetic variance of MS and NR and epistatic variances of R lines and of three-way hybrids explained similar, but smaller proportions of the total variance, ranging from 5% to 11%. For sugar beet root yield, additive genetic variances explained large proportions of the total variance for both MS and NR and for R (41% and 36%, respectively, based on M1, and 43% and 30% based on M2 and M3). For M1, variances for epistatic and dominance effects in the three-way hybrids explained equal proportions, while the epistatic variance explained a higher proportion in M2 and M3, and the dominance variance in the three-way hybrids explained less than 1%.

FIGURE 3

Figure 3 Plots of relative variance components for (A) rye grain yield and for (B) sugar beet root yield estimated using M1 (blue), M2 (grey), and M3 (yellow). Variances of additive (a), additive-by-additive (aa), dominance (d) and residual genetic effects (l) in MS+NR, R lines and in three-way hybrids (H), interaction effects between parental components and year-location (MSNR-yl and R-yl), year-location-treatment (ylt) and residuals (e) shown as proportions of total phenotypic variance at entry mean level.

The estimated heritabilities of both traits were high at entry mean level and intermediate at plot level (Table 4). The heritabilities estimated based on the three models were very similar, however the broad-sense heritabilities were slightly higher based on M2 and M3 than based on M1. For sugar beet, the differences between the broad-sense and narrow-sense heritabilities were slightly larger based on M2 and M3 than based on M1. Genetic variances of GCA of MS+NR lines were considerably lower than variances of GCA of R lines for rye, while variances of GCA of MS+NR and of R lines were similar for sugar beet. Genetic variances of SCA were lower than variances of GCA for both crops.

TABLE 4

Table 4 Broad-sense (H²) and narrow-sense (h²) heritabilities estimated from M1, M2 and M3 at entry mean level of the three-way hybrids or at plot level, and estimated genetic variances of GCA for each heterotic group ( $σ_{G C A_{(M S + N R)}}^{2}$ and $σ_{G C A_{(R)}}^{2}$ ) and SCA ( $σ_{S C A}^{2}$ ).

3.3 Genomic predictions

For all three genomic prediction models, the four cross-validation strategies resulted in high predictive abilities of hybrid performance at plot level and especially at entry mean level for both rye and sugar beet (Figures 4, 5). Prediction accuracies of total genetic values were up to 0.55 at plot level and 0.88 at entry mean level for M3 based on the leave three-way hybrid out cross-validations for rye and 0.47 at plot level and 0.89 at entry mean level for sugar beet. The prediction of non-additive effects in the three-way hybrids (SCA) resulted in lower predictive abilities of 0.10 at plot level and 0.32 at entry mean level for M3 for rye and of 0.13 at plot level and 0.50 at entry mean level for sugar beet. Differences between predictive abilities at plot level and at entry mean level were larger for sugar beet than for rye due to more observations per hybrid for sugar beet (average of 21.6 observations for rye and average of 72.6 observations for sugar beet). The differences between the predictions based on the three models were very small, and in most cases, M2 and M3 performed equally well or slightly better than M1. The largest significant differences between the models were for prediction of SCA, where the predictive ability increased from 0.10 at plot level for M1 to 0.13 for M3 for sugar beet and from 0.08 for M1 to 0.11 for M2 for rye. Differences in predictive abilities based on the four cross-validation strategies were mainly found for predictions of GCA or SCA rather than for prediction of total genetic values (Figures 4, 5C, D). In rye, the predictive ability was lower for GCA of MS+NR lines and for SCA than for GCA of R lines. In sugar beet, the predictive ability was lower for SCA than for GCA of both parental groups. The leave-breeding cycle-out cross-validation resulted in lower predictive abilities of total genetic values, particularly for rye.

FIGURE 4

Figure 4 Correlations for grain yield in rye based on M1 (blue), M2 (grey) and M3 (yellow) between total genetic values and phenotypes corrected for non-genetic effects at plot level (A) and at entry mean level (B), and between the genetic effects of the component left out in each of the four cross-validations and phenotypes corrected for other effects at plot level (C) and at entry mean level (D). Asterisks above the bars represent significant differences between the correlations (p-value< 0.05/3).

FIGURE 5

Figure 5 Correlations for root yield in sugar beet based on M1 (blue), M2 (grey) and M3 (yellow) between total genetic values and phenotypes corrected for non-genetic effects at plot level (A) and at entry mean level (B), and between the genetic effects of the component left out in each of the four cross-validations and phenotypes corrected for other effects at plot level (C) and at entry mean level (D). Asterisks above the bars represent significant differences between the correlations (p-value< 0.05/3).

4 Discussion

4.1 Genetic variances and heritabilities

Genetic variance components of additive, epistatic and dominance deviations were estimated based on the genomic models. For rye, the majority of genetic variance for grain yield was due to additive effects from the R lines, while variances of additive genetic effects of the maternal lines and of non-additive genetic deviations were low. The reason for the low genetic variance of maternal lines might be that the tested three-way hybrids were produced from a relatively small number of different maternal lines, and there was less genetic variation between these lines based on the principal component analysis of the SNPs compared to the variation between R lines (Figure 2). For sugar beet, the number of maternal lines was higher, and there was more genetic variation between them based on the principal component analysis of the SNPs and also based on the estimated genetic variances for root yield. For both rye and sugar beet, genetic variance of SCA was considerably lower than variance of GCA, which is in agreement with other studies of yield in hybrid crops (Technow et al., 2014; Wang et al., 2017; Werner et al., 2018; González-Diéguez et al., 2021). However, this does not mean that the overall heterotic effects in the hybrids are small, but that the variance for these effects are low between the hybrids compared to the variance of additive genetic effects or that part of non-additive variances is captured as additive (Reif et al., 2007; Huang and Mackay, 2016).

The entry mean heritabilities for rye grain yield and sugar beet root yield were considerably higher than heritabilities at plot level due to a high average number of observations per hybrid in the datasets. The residual variances were therefore relatively low, when corrected for the average number of observations. Similarly, the correlations between estimated genetic effects and the corrected phenotypes at entry mean level were also higher than correlations based on corrected phenotypes at plot level due to the high average number of observations per hybrid. Entry mean heritabilities in the same high range as in the current study have been reported in previous studies of grain yield in rye (Wang et al., 2014; Auinger et al., 2016; Schulthess et al., 2016). For sugar beet, the differences between broad- and narrow-sense heritabilities based on the three genomic models indicated that a slightly smaller part of the genetic variance was captured as additive in models M2 and M3 accounting for incomplete inbreeding compared to M1. The degree of mean heterozygosity in the parental lines of the hybrids were higher in sugar beet than in rye (Table 3). Thus, larger differences between the estimated variance components and between the predictive abilities of the three models were expected for sugar beet than for rye. The estimates of residual genetic variance were slightly higher for R lines in rye and for three-way hybrids in sugar beet based on M2 and M3 than M1, which could be due to the heterozygosity of SNPs in the parental lines that was included in M2 and M3. Hybrids produced from parents that are heterozygous for some loci, will be a mixture of plants that are homozygous and plants that are heterozygous for those loci, and thereby the effects of the loci are difficult to estimate correctly.

The models M1, M2, and M3 are equivalent if all parental components are completely inbred. In M2 and M3, the genotypes of MS and NR parental lines were directly used in calculations of the genomic relationship matrices, and the heterozygosity of the two-way crosses between them were accounted for. Unlike in M1, which was developed for two-way hybrids. However, the differences between the variance components estimated from the three models were small, especially for rye, due to the low degree of heterozygosity in both parental groups of the evaluated breeding material. Larger differences between the estimates of the models would be expected for breeding programs with higher degrees of heterozygosity in the parental components, e.g. if synthetics are used as restorers for top-cross hybrids (Siekmann et al., 2021; Hackauf et al., 2022).

4.2 Partitioning of genetic variance

The sum of genetic variances estimated based on the three models were similar. However, for M1 the additive variance of the MS+NR lines was lower, and the additive variance of R lines was higher than for M2 and M3. In model M1 that assumes complete inbreeding of parental lines, the genomic relationship matrices all have a mean diagonal equal to 1 and an overall mean very close to 0. Therefore, the estimates of the variance components can be interpreted as genetic variances, although the genotypes are not in Hardy-Weinberg equilibrium (Legarra, 2016; González-Diéguez et al., 2021). However, when including heterozygous marker genotypes in M2 and M3, this no longer holds true. Therefore, estimated variance components were be multiplied with D_k (the mean of the diagonal of the respective relationship matrix minus the overall mean of the matrix) to be practically interpretable as genetic variances of the groups in the breeding programs (Legarra, 2016; Vitezica et al., 2017). Correct partitioning of genetic variances into additive and non-additive variances is important in order to make informed decisions in breeding programs that might affect short- or long-term genetic gain of GCA and SCA (Allier et al., 2019). If all QTL for additive and non-additive effects are assumed to be in linkage equilibrium, genetic variances can be estimated orthogonally, and it should thereby be possible to partition genetic variances correctly into additive, epistatic and dominance variances using the genomic models (Appendix 1) (Cockerham et al., 1954). However, when using empirical data, the assumption of linkage equilibrium is rarely true, and therefore estimated variances of different genetic effects might be correlated. Consequently, the partitioning of genetic variances may change depending on which parameters are included in the models (González-Diéguez et al., 2021; Raffo et al., 2022). Thus, estimated genetic variances should be carefully interpreted, as they might not completely reflect the corresponding underlying biological additive and non-additive gene actions (Huang and Mackay, 2016).

4.3 Dominance and epistatic deviations

High predictive abilities have been reported in studies of hybrid rye (Wang et al., 2014; Auinger et al., 2016) and sugar beet (Hofheinz et al., 2012; Würschum et al., 2013) that used genomic prediction models, where non-additive effects were not explicitly included. This indicates that there is low variance for the non-additive genetic effects or that non-additive genetic effects can be partially captured as additive (Huang and Mackay, 2016; Vitezica et al., 2017), which is in accordance with the results of the current study. Including dominance in genomic prediction models has been shown to result in similar or improved predictive abilities. The improvement depends on the ratio between additive and dominance variances of the trait and on the variability of inbreeding in the studied populations (Nishio and Satoh, 2014; Zhao et al., 2014; Duenk et al., 2017; Wang et al., 2017; Werner et al., 2018; Ramstein et al., 2020; Roth et al., 2022).

Different conclusions have been reached in studies of the effect of including epistasis in genomic prediction models. For wheat, improvements in predictive abilities have been reported when epistasis was included in addition to additive genetic effects (Jiang and Reif, 2015; He et al., 2016; Raffo et al., 2022). For maize, predictive abilities were reported to be similar for models with or without inclusion of epistasis (Jiang and Reif, 2015; González-Diéguez et al., 2021). For other species, examples of reductions in predictive abilities have been reported, when epistasis was modelled (Lorenzana and Bernardo, 2009; Forneris et al., 2017). The parameterization of genetic effects and the degree of linkage disequilibrium between markers and QTL can affect how genetic effects are captured and partitioned in the models (Huang and Mackay, 2016; Schrauf et al., 2020). Thus, including epistasis can potentially improve prediction models, but the effect depends on the species, the genomic relationships of the studied populations, the genetic architecture of the trait, and on marker densities.

Dominance and epistatic deviations were included in the genomic prediction models of the current study. Accurate partitioning of the genetic variances of these non-additive deviations is challenging, because their genomic relationship matrices were highly correlated both in the current study and in González-Diéguez et al. (2021). However, the non-additive genetic variances were small compared to the additive genetic variances. Besides potentially improving predictive abilities, an advantage of including non-additive deviations in the models is that it enables predictions of the best combinations of parental components to produce hybrids. This can especially be helpful for hybrid breeding programs, where there is large genetic variance for SCA.

4.4 Cross-validation strategies and predictive abilities

Four different cross-validation strategies were used to evaluate the predictive ability of the three models for GCA, SCA and total genetic value in different scenarios. Prediction of GCA of parental components is important for selection of lines within heterotic groups, and prediction of SCA and total genetic value of hybrids is important for selection of the optimal combinations of parental lines. The cross-validation strategies leave two-way cross out and leave R line out were used to study the predictive abilities for GCA of untested parental components from each heterotic group. The leave three-way out strategy was used to study predictive ability for SCA, when parental components had been tested in other combinations. These cross-validations reflect the potential of the models in scenarios, where half-sibs and full-sibs of untested components are included in the training set. However, breeders are often interested in predicting genetic values of untested lines based only on previous breeding cycles (Auinger et al., 2016). Thus, leave breeding cycle out cross-validations were used to study the predictive abilities for total genetic values of hybrids and for GCA of R lines within each breeding cycle based on the remaining cycles.

Predictive abilities based on the leave breeding cycle out cross-validations were lower than predictive abilities based on the other cross-validation strategies, because larger parts of the phenotypic data were left out of the training set, and because the genomic relationships between components in training and test sets were lower. However, the reduction in predictive abilities was not as large as in other studies, where similar cross-validation strategies were used on data from barley and wheat breeding programs (Nielsen et al., 2016; Kristensen et al., 2018; Raffo et al., 2022). A reason for this could be that breeding programs for hybrid crops mainly use internally developed lines as crossing parents for new breeding cycles, while breeding programs for line cultivars commonly include cultivars developed in external breeding programs as crossing parents in addition to their internally developed lines (Lüttringhaus et al., 2020). Thus, genomic relationships between lines in training set and lines from new, untested breeding cycles would be expected to be higher within hybrid breeding programs. Another reason could be the high number of breeding cycles in the training data. The reduction in predictive ability based on the leave breeding cycle out cross-validation was lower for sugar beet than for rye, which could indicate that having data from a higher number of breeding cycles would lead to higher predictive abilities (Auinger et al., 2016; Bernal-Vasquez et al., 2017). However, when many breeding cycles are included, predictive abilities based on this cross-validation strategy might be inflated if some lines have been used as crossing parents for new breeding cycles and data based on their offspring was included in the training set. Furthermore, predicted genetic values might be inflated, because linkage disequilibrium between markers and QTL erodes over several generations (Boichard et al., 2022).

4.5 Models accounting for incomplete inbreeding

The predictive abilities for total genetic values of hybrids were equally high for the three genomic models for both rye and sugar beet. The largest differences in predictive abilities between the models were for GCA of MS+NR lines and for SCA, while predictive abilities for GCA of R lines were very similar for the models. This was as expected since R lines were almost completely homozygous and thus better comply with the assumption of complete inbreeding in M1. The two-way crosses were more heterozygous, and the extended models M2 and M3 could therefore capture GCA of MS+NR and SCA more accurately than M1. Even though the differences between predictive abilities of the three models were quite small, it would be advantageous to use M2 or M3 over M1 for breeding of three-way hybrids, because these models enable prediction of GCA for both MS and NR lines and not only for their two-way crosses as in M1. Additionally, models that account for incomplete inbreeding can be useful in a wider range of real or simulated breeding schemes, where lines are more heterozygous than in the datasets used here. For example, genetic values might be estimated more accurately for lines in early generations, before they have reached a high degree of homozygosity (Bernal-Vasquez et al., 2017). Thereby, lines can be selected as crossing parents earlier, and the generation time of breeding cycles can be reduced, which could lead to higher genetic gains.

The three-way rye hybrids evaluated in the current study were based on the Gülzow (G) type cytoplasmic male sterility (CMS) (Melz et al., 2003; Vendelbo et al., 2021b). The frequency of non-restoration alleles for the G type system is low in Central European rye germplasm, which makes it challenging to increase genetic variation of NR lines for breeding (Łapiński and Stojałowski, 2003; Hackauf et al., 2022). For the Owen type CMS in sugar beet, non-restoration alleles are rare in most populations as well (Moritani et al., 2013). Consequently, the genetic variation of the heterotic group of MS and NR lines was low compared to the heterotic group of R lines in both crops, and the degree of homozygosity of the maternal two-way hybrids were high, especially in rye. Thereby, the experimental three-way hybrids resemble two-way hybrids to a large extent. The predominant hybridization system in rye is based on the Pampa (P) type CMS. In contrast to the G type CMS system, non-restoration alleles are common, while restoration alleles are rare for the P type system. Additionally, synthetic restorers from two inbred lines are used for the production of the commercial top-cross hybrids. Thus, the average degree of heterozygosity in restorers of the P type based breeding systems is higher than in the R lines of the current study (Siekmann et al., 2021). The models accounting for incomplete inbreeding could thereby be advantageous to use, not only for three-way hybrids programs, but also for breeding programs based on the P type CMS system.

5 Conclusion

Three genomic models for predicting hybrid performance were evaluated based on data from two commercial breeding programs. The models can be used to predict GCA of parental lines within each of two heterotic groups and to predict SCA of realized and potential three-way hybrids. Estimated genetic variances of GCA (additive and within-group epistasis) were considerably larger than variances of SCA (across-group epistasis and dominance) for both grain yield in rye and root yield in sugar beet. For rye, variance of GCA of R lines were larger than variance of GCA of MS+NR lines, while variances of GCA of both parental groups were more similar for sugar beet. Average levels of heterozygosity in parental components were low, and therefore, the differences between the three models were small. The predictive ability of model M1, which assumes complete inbreeding in parental lines, was similar to or lower than the predictive ability of the extended models M2 and M3 accounting for incomplete inbreeding in parental lines of three-way hybrids. The predictive abilities of the three models were similar for predicting total genetic values of hybrids and for GCA of R lines. For prediction of GCA of MS+NR lines in sugar beet and of SCA in both crops, predictive abilities significantly improved when using the extended models compared to using M1. Promising NR lines can potentially be selected for producing new MS lines via backcrossing at earlier stages in the breeding programs, because the extended models enable prediction of GCA for both MS and NR lines. Additionally, NR or R lines with high GCA can be selected as crossing parents for new breeding cycles within each heterotic group.

The predictive ability of the models was high for prediction of hybrid performance (total genetic value), and the models can therefore be a valuable tool for selecting the most promising parental lines to produce new hybrids. Due to the relatively narrow genetic variation within the heterotic group of MS+NR lines, the differences in performance of hybrids will mainly be affected by the GCA of their parental R lines for grain yield in rye. However, other traits such as flowering time, plant height, and disease resistances should also be considered when selecting the optimal combinations of parents. For root yield in sugar beet, performance of hybrids will be more equally affected by GCA of parents from both heterotic groups and to a smaller extent by the SCA of the combinations.

The models developed here are suitable for a wide range of hybrid breeding programs, where the parental lines can have any level of inbreeding. Thus, the genomic prediction models might improve breeding programs for hybrid crops by facilitating selection of lines within heterotic groups as well as selection of best combinations of lines across groups for the production of new hybrids.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

JJ, AJ, JH, LR, PSn, JO, PSa, and PK contributed to conception and design of the study. Data was collected and curated by PSa, JO, LR, PSn, DF, PK, and MM. PK and PSa performed statistical analysis. PK, PSa, JJ, DF, and TC contributed to choosing statistical methods and interpreting results. PK wrote the first draft of the manuscript, and all authors contributed to manuscript revision.

Funding

The research was funded by Ministry of Food, Agriculture and Fisheries of Denmark under the Green Development and Demonstrations Program (grant no. 34009-19-1603).

Acknowledgments

We would like to thank the breeding assistants and technicians at Nordic Seed and DLF Beet Seed for supporting work in the field, glass house and laboratory, particularly Johannes Hiller, Anette Deterding, Marlene Walbrodt, and Hanne Svenstrup (Nordic Seed).

Conflict of interest

PSa, JO, and AJ were employed by Nordic Seed A/S, and MM was employed by Nordic Seed Germany GmbH. PSn, LR, and JH were employed by DLF Beet Seed AB, and DF was employed by DLF Seeds A/S.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1193433/full#supplementary-material

References

Allier, A., Lehermeier, C., Charcosset, A., Moreau, L., Teyssedre, S. (2019). Improving short- and long-term genetic gain by accounting for within-family variance in optimal cross-selection. Front. Genet. 10. doi: 10.3389/fgene.2019.01006

PubMed Abstract | CrossRef Full Text | Google Scholar

Arruda, M. P., Lipka, A. E., Brown, P. J., Krill, A. M., Thurber, C., Brown-Guedira, G., et al. (2016). Comparing genomic selection and marker-assisted selection for Fusarium head blight resistance in wheat (Triticum aestivum L.). Mol. Breed. 36 (7), 1–11. doi: 10.1007/s11032-016-0508-5

CrossRef Full Text | Google Scholar

Auinger, H. J., Schönleben, M., Lehermeier, C., Schmidt, M., Korzun, V., Geiger, H. H., et al. (2016). Model training across multiple breeding cycles significantly improves genomic prediction accuracy in rye (Secale cereale L.). Theor. Appl. Genet. 129 (11), 2043–2053. doi: 10.1007/s00122-016-2756-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Bernal-Vasquez, A. M., Gordillo, A., Schmidt, M., Piepho, H. P. (2017). Genomic prediction in early selection stages using multi-year data in a hybrid rye breeding program. BMC Genet. 18 (1), 1–17. doi: 10.1186/s12863-017-0512-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Boichard, D., Fritz, S., Croiseau, P., Ducrocq, V., Cuyabano, B., Tribout, T. (2022). “Long-distance associations generate erosion of genomic breeding values of candidates for selection,” in Proceedings of 12th World Congress on Genetics Applied to Livestock Production (WCGALP). (Wageningen, The Netherlands: Wageningen Academic Publishers).

Google Scholar

Campbell, L. G. (1990). Sugarbeet germplasm selected from the USDA collection. North Dakota. Farm. Res. 47 (6), 32–34.

Google Scholar

Cockerham, C. C. (1954). An extension of the concept of partitioning hereditary variance for analysis of covariances among relatives when epistasis is present. Genetics 39 (6), 859–882. doi: 10.1093/genetics/39.6.859

PubMed Abstract | CrossRef Full Text | Google Scholar

Crossa, J., Pérez-Rodríguez, P., Cuevas, J., Montesinos-López, O., Jarquín, D., de los Campos, G., et al. (2017). Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 22 (11), 961–975. doi: 10.1016/j.tplants.2017.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Duenk, P., Calus, M. P. L., Wientjes, Y. C. J., Bijma, P. (2017). Benefits of dominance over additive models for the estimation of average effects in the presence of dominance. G3 (Bethesda). 7 (10), 3405–3414. doi: 10.1534/g3.117.300113

PubMed Abstract | CrossRef Full Text | Google Scholar

Duvick, D. N. (2005). “The Contribution of Breeding to Yield Advances in maize (Zea mays L.),”. Adv. Agronomy.) 83-145, 83–145. doi: 10.1016/S0065-2113(05)86002-X

CrossRef Full Text | Google Scholar

Falconer, D. S., Mackay, T. F. C. (1996). Introduction to quantitative genetics (Harlow, England: Longman Group Ltd).

Google Scholar

Forneris, N. S., Vitezica, Z. G., Legarra, A., Perez-Enciso, M. (2017). Influence of epistasis on response to genomic selection using complete sequence data. Genet. Sel. Evol. 49 (1), 66. doi: 10.1186/s12711-017-0340-3

PubMed Abstract | CrossRef Full Text | Google Scholar

González-Diéguez, D., Legarra, A., Charcosset, A., Moreau, L., Lehermeier, C., Teyssèdre, S., et al. (2021). Genomic prediction of hybrid crops allows disentangling dominance and epistasis. Genetics 218 (1), 1–16. doi: 10.1093/GENETICS/IYAB026

CrossRef Full Text | Google Scholar

Hackauf, B., Haffke, S., Fromme, F. J., Roux, S. R., Kusterer, B., Musmann, D., et al. (2017). QTL mapping and comparative genome analysis of agronomic traits including grain yield in winter rye. Theor. Appl. Genet. 130 (9), 1801–1817. doi: 10.1007/s00122-017-2926-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Hackauf, B., Siekmann, D., Fromme, F. J. (2022). Improving yield and yield stability in winter rye by hybrid breeding. Plants 11 (19), 1–27. doi: 10.3390/plants11192666

CrossRef Full Text | Google Scholar

He, S., Schulthess, A. W., Mirdita, V., Zhao, Y., Korzun, V., Bothe, R., et al. (2016). Genomic selection in a commercial winter wheat population. Theor. Appl. Genet. 129 (3), 641–651. doi: 10.1007/s00122-015-2655-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Hofheinz, N., Borchardt, D., Weissleder, K., Frisch, M. (2012). Genome-based prediction of test cross performance in two subsequent breeding cycles. Theor. Appl. Genet. 125 (8), 1639–1645. doi: 10.1007/s00122-012-1940-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, W., Mackay, T. F. (2016). The genetic architecture of quantitative traits cannot be inferred from variance component analysis. PloS Genet. 12 (11), e1006421. doi: 10.1371/journal.pgen.1006421

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Y., Reif, J. C. (2015). Modeling epistasis in genomic selection. Genetics 201 (2), 759–768. doi: 10.1534/genetics.115.177907

PubMed Abstract | CrossRef Full Text | Google Scholar

Kristensen, P. S., Jahoor, A., Andersen, J. R., Cericola, F., Orabi, J., Janss, L., et al. (2018). Genome-wide association studies and comparison of models and cross-validation strategies for genomic prediction of quality traits in advanced winter wheat breeding lines. Front. Plant Sci. 9. doi: 10.3389/FPLS.2018.00069

CrossRef Full Text | Google Scholar

Kristensen, P. S., Jahoor, A., Andersen, J. R., Orabi, J., Janss, L. L., Jensen, J. (2019). Multi-trait and trait-assisted genomic prediction of winter wheat quality traits using advanced lines from four breeding cycles. Crop Breeding. Genet. Genomics. 1, 1–15. doi: 10.20900/cbgg20190010

CrossRef Full Text | Google Scholar

Labroo, M. R., Studer, A. J., Rutkoski, J. E. (2021). Heterosis and hybrid crop breeding: A multidisciplinary review. Front. Genet. 12. doi: 10.3389/fgene.2021.643761

CrossRef Full Text | Google Scholar

Laidig, F., Piepho, H.-P., Rentel, D., Drobek, T., Meyer, U., Huesken, A. (2017). Breeding progress, variation, and correlation of grain and quality traits in winter rye hybrid and population varieties and national on-farm progress in Germany over 26 years. Theor. Appl. Genet. 130 (5), 981–998. doi: 10.1007/s00122-017-2865-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Łapiński, M., Stojałowski, S. (2003). Occurrence and genetic identity of male sterility-inducing cytoplasm in rye (Secale spp.). Plant Breed. Seed. Sci. 48, 7–23.

Google Scholar

Legarra, A. (2016). Comparing estimates of genetic variance across different relationship models. Theor. Population. Biol. 107, 26–30. doi: 10.1016/j.tpb.2015.08.005

CrossRef Full Text | Google Scholar

Longin, C. F., Muhleisen, J., Maurer, H. P., Zhang, H., Gowda, M., Reif, J. C. (2012). Hybrid breeding in autogamous cereals. Theor. Appl. Genet. 125 (6), 1087–1096. doi: 10.1007/s00122-012-1967-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Lorenzana, R. E., Bernardo, R. (2009). Accuracy of genotypic value predictions for marker-based selection in biparental plant populations. Theor. Appl. Genet. 120 (1), 151–161. doi: 10.1007/s00122-009-1166-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Lüttringhaus, S., Gornott, C., Wittkop, B., Noleppa, S., Lotze-Campen, H. (2020). The economic impact of exchanging breeding material: assessing winter wheat production in Germany. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.601013

PubMed Abstract | CrossRef Full Text | Google Scholar

Madsen, P., Jensen, J. (2013) DMU: A User’s Guide. A Package for Analysing Multivariate Mixed Models”. 6, release 5.2 ed. Available at: http://dmu.agrsci.dk.

Google Scholar

Melz, G., Melz, G., Hartmann, F. (2003). Genetics of a male-sterile rye of ´G-type´with results of the first F1-hybrids. Plant Breed. Seed. Sci. 47, 47–55.

Google Scholar

Meuwissen, T. H. E., Hayes, B. J., Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829. doi: 10.1093/genetics/157.4.1819

PubMed Abstract | CrossRef Full Text | Google Scholar

Meuwissen, T. H. E., Hayes, B., Goddard, M. (2016). Genomic selection: A paradigm shift in animal breeding. Anim. Front. 6 (1), 6–6. doi: 10.2527/af.2016-0002

CrossRef Full Text | Google Scholar

Moritani, M., Taguchi, K., Kitazaki, K., Matsuhira, H., Katsuyama, T., Mikami, T., et al. (2013). Identification of the predominant nonrestoring allele for Owen-type cytoplasmic male sterility in sugar beet (Beta vulgaris L.): development of molecular markers for the maintainer genotype. Mol. Breed. 32 (1), 91–100. doi: 10.1007/s11032-013-9854-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Nielsen, N. H., Jahoor, A., Jensen, J. D., Orabi, J., Cericola, F., Edriss, V., et al. (2016). Genomic prediction of seed quality traits using advanced barley breeding lines. PloS One 11 (10), 1–18. doi: 10.1371/journal.pone.0164494

CrossRef Full Text | Google Scholar

Nishio, M., Satoh, M. (2014). Including dominance effects in the genomic BLUP method for genomic evaluation. PloS One 9 (1), e85792. doi: 10.1371/journal.pone.0085792

PubMed Abstract | CrossRef Full Text | Google Scholar

Raffo, M. A., Sarup, P., Guo, X., Liu, H., Andersen, J. R., Orabi, J., et al. (2022). Improvement of genomic prediction in advanced wheat breeding lines by including additive-by-additive epistasis. Theor. Appl. Genet. 135 (3), 965–978. doi: 10.1007/s00122-021-04009-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramstein, G. P., Larsson, S. J., Cook, J. P., Edwards, J. W., Ersoz, E. S., Flint-Garcia, S., et al. (2020). Dominance effects and functional enrichments improve prediction of agronomic traits in hybrid maize. Genetics 215 (1), 215–230. doi: 10.1534/genetics.120.303025

PubMed Abstract | CrossRef Full Text | Google Scholar

Reif, J. C., Gumpert, F. M., Fischer, S., Melchinger, A. E. (2007). Impact of interpopulation divergence on additive and dominance variance in hybrid populations. Genetics 176 (3), 1931–1934. doi: 10.1534/genetics.107.074146

PubMed Abstract | CrossRef Full Text | Google Scholar

Roth, M., Beugnot, A., Mary-Huard, T., Moreau, L., Charcosset, A., Fiévet, J. B., et al. (2022). Improving genomic predictions with inbreeding and nonadditive effects in two admixed maize hybrid populations in single and multienvironment contexts. Genetics 220 (4), 1–18. doi: 10.1093/genetics/iyac018

CrossRef Full Text | Google Scholar

Schrauf, M. F., Martini, J. W. R., Simianer, H., de Los Campos, G., Cantet, R., Freudenthal, J., et al. (2020). Phantom epistasis in genomic selection: on the predictive ability of epistatic models. G3 (Bethesda). 10 (9), 3137–3145. doi: 10.1534/g3.120.401300

PubMed Abstract | CrossRef Full Text | Google Scholar

Schulthess, A. W., Wang, Y., Miedaner, T., Wilde, P., Reif, J. C., Zhao, Y. (2016). Multiple-trait- and selection indices-genomic predictions for grain yield and protein content in rye for feeding purposes. Theor. Appl. Genet. 129 (2), 273–287. doi: 10.1007/s00122-015-2626-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Siekmann, D., Jansen, G., Zaar, A., Kilian, A., Fromme, F. J., Hackauf, B. (2021). A genome-wide association study pinpoints quantitative trait genes for plant height, heading date, grain quality, and yield in rye (Secale cereale L.). Front. Plant Sci. 12. doi: 10.3389/fpls.2021.718081

PubMed Abstract | CrossRef Full Text | Google Scholar

Technow, F., Riedelsheimer, C., Schrag, T. A., Melchinger, A. E. (2012). Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects. Theor. Appl. Genet. 125 (6), 1181–1194. doi: 10.1007/s00122-012-1905-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Technow, F., Schrag, T. A., Schipprack, W., Bauer, E., Simianer, H., Melchinger, A. E. (2014). Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize. Genetics 197 (4), 1343–1355. doi: 10.1534/genetics.114.165860

PubMed Abstract | CrossRef Full Text | Google Scholar

Vendelbo, N. M., Mahmood, K., Sarup, P., Hovmøller, M. S., Justesen, A. F., Kristensen, P. S., et al. (2021a). Discovery of a Novel Leaf Rust (Puccinia recondita) Resistance Gene in Rye (Secale cereale L.) Using Association Genomics. Cells 11 (1), 1–19. doi: 10.3390/cells11010064

PubMed Abstract | CrossRef Full Text | Google Scholar

Vendelbo, N. M., Mahmood, K., Sarup, P., Kristensen, P. S., Orabi, J., Jahoor, A. (2021b). Genomic scan of male fertility restoration genes in a ‘Gulzow’ Type hybrid breeding system of rye (Secale cereale L.). Int. J. Mol. Sci. 22 (17), 1–16. doi: 10.3390/ijms22179277

CrossRef Full Text | Google Scholar

Vendelbo, N. M., Sarup, P., Orabi, J., Kristensen, P. S., Jahoor, A. (2020). Genetic structure of a germplasm for hybrid breeding in rye (Secale cereale L.). PloS One 15 (10), 1–23. doi: 10.1371/journal.pone.0239541

CrossRef Full Text | Google Scholar

Vitezica, Z. G., Legarra, A., Toro, M. A., Varona, L. (2017). Orthogonal estimates of variances for additive, dominance, and epistatic effects in populations. Genetics 206 (3), 1297–1307. doi: 10.1534/genetics.116.199406

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Li, L., Yang, Z., Zheng, X., Yu, S., Xu, C., et al. (2017). Predicting rice hybrid performance using univariate and multivariate GBLUP models based on North Carolina mating design II. Heredity. (Edinb). 118 (3), 302–310. doi: 10.1038/hdy.2016.87

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Mette, M. F., Miedaner, T., Gottwald, M., Wilde, P., Reif, J. C., et al. (2014). The accuracy of prediction of genomic selection in elite hybrid rye populations surpasses the accuracy of marker-assisted selection and is equally augmented by multiple field evaluation locations and test years. BMC Genomics 15 (1), 1–12. doi: 10.1186/1471-2164-15-556

PubMed Abstract | CrossRef Full Text | Google Scholar

Werner, C. R., Qian, L., Voss-Fels, K. P., Abbadi, A., Leckband, G., Frisch, M., et al. (2018). Genome-wide regression models considering general and specific combining ability predict hybrid performance in oilseed rape with similar accuracy regardless of trait architecture. Theor. Appl. Genet. 131 (2), 299–317. doi: 10.1007/s00122-017-3002-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Würschum, T., Maurer, H. P., Kraft, T., Janssen, G., Nilsson, C., Reif, J. C. (2011). Genome-wide association mapping of agronomic traits in sugar beet. Theor. Appl. Genet. 123 (7), 1121–1131. doi: 10.1007/s00122-011-1653-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Würschum, T., Reif, J. C., Kraft, T., Janssen, G., Zhao, Y. S. (2013). Genomic selection in sugar beet breeding populations. BMC Genet. 14, 1–8. doi: 10.1186/1471-2156-14-85

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, Y., Mette, M. F., Gowda, M., Longin, C. F. H., Reif, J. C. (2014). Bridging the gap between marker-assisted and genomic selection of heading time and plant height in hybrid wheat. Heredity 112 (6), 638–645. doi: 10.1038/hdy.2014.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: genomic selection, non-additive genetic effects, Gca and sca, hybrid breeding, heterotic groups, inbreeding, grain yield, root yield

Citation: Kristensen PS, Sarup P, Fé D, Orabi J, Snell P, Ripa L, Mohlfeld M, Chu TT, Herrström J, Jahoor A and Jensen J (2023) Prediction of additive, epistatic, and dominance effects using models accounting for incomplete inbreeding in parental lines of hybrid rye and sugar beet. Front. Plant Sci. 14:1193433. doi: 10.3389/fpls.2023.1193433

Received: 24 March 2023; Accepted: 16 October 2023;
Published: 02 November 2023.

Edited by:

Jinyan Zhu, Yangzhou University, China

Reviewed by:

Bernd Hackauf, Julius Kühn-Institut, Germany
Habtamu Ayalew, Inari Agriculture, United States

Copyright © 2023 Kristensen, Sarup, Fé, Orabi, Snell, Ripa, Mohlfeld, Chu, Herrström, Jahoor and Jensen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Peter Skov Kristensen, cHNrckBxZ2cuYXUuZGs=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Prediction of additive, epistatic, and dominance effects using models accounting for incomplete inbreeding in parental lines of hybrid rye and sugar beet

1 Introduction

2 Materials and methods

2.1 Phenotypic and genotypic data

2.2 Genomic prediction models

2.3 Estimation of variance components and heritabilities

2.4 Cross-validation strategies and predictive abilities

3 Results

3.1 Phenotyping and genotyping

3.2 Genetic variances and heritabilities

3.3 Genomic predictions

4 Discussion

4.1 Genetic variances and heritabilities

4.2 Partitioning of genetic variance

4.3 Dominance and epistatic deviations

4.4 Cross-validation strategies and predictive abilities

4.5 Models accounting for incomplete inbreeding

5 Conclusion

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good