Non-additive Effects in Genomic Selection

Varona, Luis; Legarra, Andres; Toro, Miguel A.; Vitezica, Zulma G.

doi:10.3389/fgene.2018.00078

REVIEW article

Front. Genet., 06 March 2018

Sec. Livestock Genomics

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00078

Non-additive Effects in Genomic Selection

1. Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain
2. Instituto Agroalimentario de Aragón (IA2), Zaragoza, Spain
3. Génétique Physiologie et Systèmes d'Elevage (GenPhySE), Institut National de la Recherche Agronomique de Toulouse, Castanet-Tolosan, France
4. Departamento Producción Agraria, ETS Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
5. Génétique Physiologie et Systèmes d'Elevage (GenPhySE), Université de Toulouse, Castanet-Tolosan, France

Abstract

In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection.

Introduction

Through his experiments on pea plants, Gregor Mendel (1866) realized that some traits are dominant over others (for example “round peas” were dominant over “wrinkled peas”). In Mendel's own words: “As a rule, hybrids do not represent the form exactly intermediate between the parental strains… Those traits that pass into hybrid association entirely or almost entirely unchanged, thus themselves representing the traits of the hybrid, are termed “dominating,” and those that become latent in the association, “recessive””. Shortly after the rediscovery of Mendel's rules, it was observed that, in some cases, the addition of the individual action of genes could not explain the mode of inheritance, and Bateson (1909) coined the term “epistasis” to describe the cases in which the actions of two or more genes interact. A distinction must be drawn between biological (functional) genetic effects that correspond to the Mendelian definition (i.e., dominance means that the heterozygote value is higher or lower than the mean of homozygous genotypes) and statistical (population or weighted) effects which depend on allelic frequencies. In the latter, the relevant issue is the contribution of non-additive effects to genetic variance. Some authors argue that non-additive genetic effects may be a general phenomenon whose understanding is important for gaining more knowledge on the nature of quantitative traits, but whose contribution to variance is negligible (Crow, 2010).

From the perspective of quantitative genetics, Fisher (1918) conceived the infinitesimal model which postulates that a very large number of unlinked genes control the genetic variation of quantitative traits. He described the resemblance between relatives in a pure additive model which was quickly extended to incorporate dominance (Fisher, 1918; Wright, 1921). Resemblance between relatives including epistatic effects of second and higher order was also described (Cockerham, 1954; Kempthorne, 1954). However, whilst the formulation of the infinitesimal model in the additive context is evident, its interpretation is not clear when non-additive effects are included (Barton et al., 2017).

The main goal of animal or plant breeding is to identify, select and mate the best individuals of a breeding stock in order to maximize performance in future generations (Falconer and McKay, 1996; Bernardo, 2010). The procedure for computing the breeding values (genetic evaluation) of candidates for selection plays a crucial role. Traditionally, these methods use phenotypic and genealogical information, such as the selection index (Hazel, 1943) or the Best Linear Unbiased Predictor (Henderson, 1973) and rely on the foundations of the infinitesimal model (Fisher, 1918).

Nevertheless, non-additive genetic effects have been ignored in the genetic evaluation of livestock for several reasons: (i) the lack of informative pedigrees, such as large full-sib families; (ii) the calculations involved are more complex; (iii) the fact that statistical additive variance captures biological dominance or higher order interaction effects (Hill, 2010); and, (iv) the difficulty in using dominant values in practice (mate allocation). As a consequence, estimates of non-additive genetic variances are scarce in livestock populations (Misztal et al., 1998; Nguyen and Nagyné-Kiszlinger, 2016).

Genomic selection

Since the late 80s and 90s, developments in molecular genetics resulted in a set of neutral molecular markers, such as microsatellites, that were commonly used to detect QTL (Quantitative Trait Loci) in almost all livestock populations. The objective of those studies was to identify polymorphic markers or genes associated with phenotypic variation of traits of interest (www.animalgenome.org/QTL), with the ultimate goal of using them in Marker or Gene Assisted Selection (Dekkers, 2004). However, these strategies became obsolete with the advent of dense genotyping devices (Gunderson et al., 2005) that provided a very large amount of SNP (Single Nucleotide Polymorphism) and that allowed the development of genomic selection (GS) models (Meuwissen et al., 2001).

Genomic selection has become a very successful strategy for the prediction of the breeding values of candidates for selection and has revolutionized the field of animal breeding over the past decade. The basic idea of GS is to develop the following linear model:

The model explains the phenotypic data of m individuals (y_i) with i = 1 …m (or transformations of data, such as daughter yield deviations) by the effects associated with a very large number (n) of SNP (a_j) with j = 1 …n. Moreover, t_ij is the genotypic configuration (coded additively, e.g., Falconer and McKay, 1996) of the ith individual and for the jth SNP (0, 1, and 2 for A₁A₁, A₁A₂, and A₂A₂ genotypes, respectively), and e_i is the residual. Furthermore, the prediction of individual breeding values () of the candidates for selection can be calculated a posteriori from marker effect estimates as .

A significant limitation for implementation is that most genomic evaluation models suffer the statistical problem of a larger number of parameters (n) that must be estimated from a smaller number of data (m). The most common method employed for resolving this problem is the use of some type of regularization of SNP marker effects (Gianola, 2013). Several approaches have been suggested, ranging from a simple Gaussian regularization (Meuwissen et al., 2001) to more complex models that involve t shaped (Meuwissen et al., 2001), double exponential (De los Campos et al., 2009b), mixtures of distributions (Meuwissen et al., 2001; Habier et al., 2011; Erbe et al., 2012), or non-parametric or semi-parametric approaches (Gonzalez-Recio et al., 2014). The predictive ability of all these approaches depends on the genetic architecture of the traits being analyzed (Daetwyler et al., 2010), although for polygenic traits, all approaches offer similar results (Wang et al., 2015).

An interesting property of the assumption of a Gaussian prior distribution for marker effects (Random Regression BLUP—RR-BLUP) is that the GS model can be reformulated in terms of individual (animal) effects, using the equations of the Henderson's classic Mixed Model that provide breeding values for all individuals, including candidates for selection (Genomic BLUP or GBLUP). The only difference with standard mixed model equations is that the numerator relationship matrix (A) is replaced by the genomic relationship matrix (G), as defined by VanRaden (2008). In addition, this approach can be extended for the genetic evaluation of non-genotyped individuals in the Single-Step approach (Aguilar et al., 2010), facilitating the integration of GS procedures in the genetic evaluation of candidates for selection in most livestock breeding programmes. More recently, Fernando et al. (2014) described a Bayesian procedure that can also simultaneously evaluate genotyped and non-genotyped individuals and allows the use of alternative regularization procedures. Nevertheless, computational costs are markedly higher with the Bayesian model than with the Single-Step approach.

Despite the regularization procedure, the genomic evaluation methods are based on the evaluation of marker substitution effects through the construction of the covariates (t_ij) or the G matrix (above). The additive (or breeding) values capture a large part of dominant and higher-order interaction effects (Hill et al., 2008; Crow, 2010; Hill, 2010). Substitution effects that capture dominance and epistatic functional effects are not necessarily stable across generations or populations due to changes in allelic frequencies. In any case, only additive values (substitution effects) contribute to breeding values and are therefore expressed in the next generation. However, estimates of non-additive genetic effects may be of relevance because: (i) they may contribute to increasing the accuracy of prediction of breeding values and the response to selection (Toro and Varona, 2010; Aliloo et al., 2016; Duenk et al., 2017); (ii) they allow the definition of mate allocation procedures between candidates for selection (Maki-Tanila, 2007; Toro and Varona, 2010; Aliloo et al., 2017); and (iii) they can be used to benefit from non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes (Maki-Tanila, 2007; Zeng et al., 2013).

Genomic selection models with dominance

The simplest approach for the inclusion of dominance in genomic selection models is to extend the basic model with the inclusion of a dominance effect (Toro and Varona, 2010; Su et al., 2012) associated to each SNP marker:

where y_i is the phenotypic value of the ith individual and μ is the population mean. For each of the n SNP markers, a_j and d_j are the additive and dominance effects for the jth marker, respectively. The covariates t_ij and c_ij are 2, 1, and 0 (coded additively) and 0, 1, and 0 (coded in a “biological dominant” manner) for the genotypes A₁A₁, A₁A₂, A₂A₂ of each marker, respectively. In some ways, pedigree-based models for dominance were based on “expected” dominant relationships. Thus, genomic models are based on “observed” heterozygotes. However, when using this model it should be noted that that a_j is no longer the marker substitution effect, but the “biological” additive genotypic effect and individual breeding values are not predicted. In fact, the partition of variance in statistical components due to additivity, dominance, and epistasis does not reflect the “biological” (or “functional”) effect of the genes although it is useful for prediction and selection (Huang and Mackay, 2016). The model was reformulated in terms of breeding values and dominance deviations (Falconer and Mackay, 1996) by Vitezica et al. (2013) after the assumption of a Hardy-Weinberg equilibrium within each:

where

and α_j = a_j + d_j (q_j − p_j) is now the allelic substitution effect and p_j and q_j are the allelic frequencies for A₁ and A₂ for the jth SNP marker. The genetic variance due to a single locus is:

where the additive variance is and the dominance variance is and the multilocus variances, under linkage equilibrium (LE), are , . In fact, “biological” (in terms of genotypic additive and dominant values) and “statistical” (in terms of breeding values and dominance deviations) models are equivalent parameterisations of the same model (Vitezica et al., 2013), and the following expressions:

that can be used to switch variance components estimates between “biological” ( and ) and “statistical” ( and ) models. It can be verified that . In addition, if p = q = 0.5, all variances are identical and if d = 0, . A further generalization can be also achieved to avoid the requirements of the Hardy-Weinberg equilibrium (Vitezica et al., 2017), by following the NOIA model (Alvarez-Castro and Carlborg, 2007) by replacing w_ij and g_ij with:

where, p_11j, p_12j, and p_22j are the genotypic frequencies for A₁A₁, A₁A₂, and A₂A₂ at the jth SNP marker, respectively.

Note that all these models require a regularization process for additive and dominance effects. The simplest approach is to expand the RR-BLUP by the assumption of a prior Gaussian distribution for the additive and dominance effects. It is feasible to assume any other kind of prior distribution for the dominance (as described above) and the additive effects (Acevedo et al., 2015). However, a major advantage of using a Gaussian prior distribution is that the model can be easily transformed into Henderson's Mixed Model equations by using the definition of additive (G) and dominance covariance matrices (D), as suggested by Vitezica et al. (2013).

Genomic selection models with dominance have been tested in several populations, including dairy cattle (Ertl et al., 2014; Aliloo et al., 2016; Jiang et al., 2017), pigs (Esfandyari et al., 2016; Xiang et al., 2016), sheep (Moghaddar and van der Werf, 2017), and layers (Heidaritabar et al., 2016) with ambiguous results. Jiang et al. (2017) found a negligible percentage of variation explained by dominance effects for productive life in a Holstein cattle population, although Ertl et al. (2014) suggested that dominance may suppose up to 39% of the total genetic variation for Somatic Cell Score in a population of Fleckvieh cattle. In general, the increase in the accuracy of additive breeding values by including dominance was scarce, with the exception of Aliloo et al. (2016).

Dominance and inbreeding depression (or heterosis)

The classical theory of quantitative genetics (Falconer and Mackay, 1996) postulates that inbreeding depression (or heterosis) occurs due to directional dominance. However, the presence of directional dominance (i.e., a higher percentage of positive than negative dominant effects) is in sharp contrast to the assumptions of the procedures described above that use symmetric prior distributions. This drawback can be overcome by the assumption of a mean of dominant effects that is different from zero, e.g., E(d) = μ_d, as proposed by Xiang et al. (2016). The standard model can be reformulated as:

where , then E(d^*) = 0. It should be noted the term is an average of dominance effects for the ith individual, because c_ij has a value of 1 for heterozygous loci and 0 for homozygous. Inbreeding (or full homozygosity) coefficients f_i can be calculated as:

So, . The first term nμ_d is absorbed in the overall mean of the model (μ), and the second (−f_inμ_d) corresponds to a covariate b = −nμ_d associated with inbreeding (f_i). This covariate can be seen as inbreeding depression (if it has a detrimental effect) caused by genomic inbreeding. In addition, it can be also implemented in the GBLUP models described above with the introduction of a covariate within the mixed model equations.

Nonetheless, it assumes that the expected mean of the dominance effects is the same for all markers. In the literature, there are signs that the decrease in performance is associated heterogeneously within the genomic regions (Pryce et al., 2014; Howard et al., 2015; Saura et al., 2015). Models that consider alternative means of dominance effects within genomic regions may be useful to model inbreeding depression in a more appropriate way.

An alternative approach to explain the phenomenon of inbreeding depression (or heterosis) is the consideration of a possible relationship between additive and dominance biological effects (Wellmann and Bennewitz, 2011). There is theoretical proofs (Caballero and Keightley, 1994) and empirical evidence (Bennewitz and Meuwissen, 2010) that supports this argument. Wellmann and Bennewitz (2012) expanded the “biological” model described above with regularization procedures that allows for this dependence. They defined up to four models (Bayes D0 to D3) based on the Bayes C approach (Verbyla et al., 2009). The last two models (Bayes D2 and D3) included dependencies between genotypic additive and dominance effects. In the first (D2), the dependence was modeled through the prior variance of the dominance effects (Var (d||a|)) and in the second (D3), they further expanded it to the prior mean (E (d||a|)), where |a| is the absolute value of the additive effect. Implementation of these models is extremely complex and they have not been thoroughly tested (Bennewitz et al., 2017).

Imprinting

Another source of non-additive genetic variation is genomic imprinting (Reik and Walter, 2001). This involves total or partial inactivation of paternal and maternal alleles. Following the quantitative model established by Spencer (2002), Nishio and Satoh (2015) put forward two alternative genomic selection models to include imprinting effects. The first extends the “statistical” model with dominance (in terms of breeding values and dominance deviations) as:

where

and i_j is the imprinting effects associated with jth marker. The second alternative proposed the distribution of the genetic effects into paternal (p_j) and maternal (m_j) gametic effects and a dominance deviation.

where

These models have been implemented in some studies with livestock data: (Hu et al., 2016) did not find an increase in predictive ability when imprinting effects were included in the model. In addition, estimates of the percentage of phenotypic variation caused by imprinting were small and ranged between 1.3 and 1.4% in pigs (Guo et al., 2016) and from 0.2 to 2.1% in dairy cattle (Jiang et al., 2017). However, this latter study reported that imprinting effects supposed more than 20% of the total genetic variance in some reproductive traits, like pregnancy or conception rate.

Epistasis

The last and most complex source of non-additive genetic variation is the epistatic interactions between two or more genes. An immediate approach for genomic evaluation including epistatic interactions is to define an explicit model by including pairwise or higher order epistatic effects:

where aa_jk, ad_jk, and dd_jk are second order additive x additive, additive x dominant and dominant x dominant epistatic effects between the jth and kth SNP effects and aaa_jkl, aad_jkl, add_jkl and ddd_jk are third order additive x additive x additive, additive x additive x dominant, additive x dominant x dominant and dominant x dominant x dominant epistatic effects. Despite the method of regularization used, the number of parameters to estimate is extremely large. Consequently, the computational requirements are enormous and the amount of information available, in the statistical sense, for the estimation of each epistatic effect is very small. Therefore, the most efficient (at least from a computational point of view) method for including epistatic interactions in genomic selection models is to define appropriate covariance matrices between individual effects, in the same way that the standard GBLUP model uses the genomic relationship matrix, but, in this case, taking into account the interactive nature of the genetic effects. There are two main approaches in the published literature: (1) the definition of genomic relationship matrices that consider epistatic interactions (Varona et al., 2014; Martini et al., 2016; Vitezica et al., 2017), and (2) the application of Kernel-based statistical methods (Gianola et al., 2006; de los Campos et al., 2009a; Morota and Gianola, 2014).

This simplest method for defining genomic relationship matrices is the extended GBLUP model (EGBLUP), described by Jiang and Reif (2015) and Martini et al. (2016). These authors start from a reduced version of the “biological” model:

and they define an equivalent model:

where μ is the general mean, y is the vector of phenotypic data and e is the vector of the residuals. In addition, the model includes one “biologically” additive (g₁) and one epistatic (g₂) multivariate Gaussian term with the following distributions:

Where G₁ = TT′ and G₂ = G₁° G₁ being:

and the Hadamard product. Moreover, n is the number of SNP markers and k the number of individuals. However, with this model the additive and epistatic effects are not orthogonal and dominant effects are not included. Therefore, it can only be used for prediction of the phenotypes and not for the estimation of variance components (Martini et al., 2016). To avoid this inconvenience, Varona et al. (2014) and Vitezica et al. (2017) developed a full orthogonal model. They start with the expansion of the individual genotypic effect into additive, dominance and epistatic effects:

Where g is the vector of the individual genotypic effects, g_A is the vector of additive effects, g_D the vector of individual dominance effects, g_ij is the second order epistatic effects, g_ijk the third order epistatic effects and so on. For simplicity, each individual effect is defined by the sum of SNP (or combination of SNP) effects h with equal prior Gaussian variability and weighted by an incidence matrix (H). So, for the additive and dominant effects, g_A=H_Aa and g_D=H_Dd: :

Where each h vector is composed by n (number of SNP markers) elements (h_Ai = {h_Ai1, h_Ai2, …, h_Ain} and h_Di = {h_Di1, h_Di2, …, h_Din}) and a and d are the vectors of the SNP additive and dominant effects. These h_Ai and h_Divectors can be defined in several ways, depending of the reference point or the assumption of the Hardy-Weinberg equilibrium, among others. However, orthogonal partitioning of variances must follow the NOIA approach (Alvarez-Castro and Carlborg, 2007):

Therefore, and under the assumption that SNP additive or dominant effects follow a Gaussian distribution, the additive and dominant “genomic” (co) variance relationship matrices can be computed as:

where the division by traces standardizes the variance components to an ideal infinite “unrelated” population. For second order epistatic effects (g_AA, g_AD, and g_DD), Alvarez-Castro and Carlborg (2007) proved that:

and, as a consequence, the matrices H_AA, H_AD and H_DD can be written as:

and, as before, under the assumption of Gaussian distribution of second-order epistatic effects, the covariance between them can be calculated as:

and the covariance between any higher order epistatic effects must be:

However, H matrices are extremely large and calculation of HH′ cross-products is computationally expensive; each H matrix has as many columns as marker interactions and as many rows as individuals. Nevertheless, Vitezica et al. (2017) provided an algebraic shortcut that allows calculation from the additive and dominance matrices, described above, as:

For higher order interactions the results are equivalent. As an example, the covariance matrix for the AAD epistatic interaction can be calculated as:

It should be noted that G∘G… products tend to I and higher order epistatic effects tend to be confused with residuals. Nevertheless, this orthogonal approach assumes linkage equilibrium between SNP molecular markers. Linkage disequilibrium (LD) modifies the distribution of the variance into additive, dominance and epistatic components, and orthogonal partition is not possible (Hill and Maki-Tanila, 2015). In outbred populations, substantial LD is present only between polymorphisms in tight linkage (Hill and Maki-Tanila, 2015). However, whilst the distribution of epistatic effects is still unclear (Wei et al., 2015, there is evidence of epistatic interactions between linked loci (Lynch, 1991). Alternative approaches, such as those of Akdemir and Jannick (2015) and Akdemir et al. (2017) have been developed to define locally epistatic relationship matrices. These studies used a RKHS (Reproducing Kernel Hilbert Space) to define these matrices and average them.

The RKHS approach to model epistatic interactions relies on the idea that the relationship between phenotypes and genotypes may not be linear (Gianola et al., 2006; de los Campos et al., 2009a). The main objective is to predict the performance of each individual given its marker genotype through a function that maps the genotypes into phenotypic responses. One of the simplest methods is to consider that this function is linear and, consequently, the results are equivalent to the GBLUP approach. Nevertheless, the power of the Kernel concept relies on the possibility of using alternative functions of marker genotypes. In short, RKHS procedures result in some non-parametric functions g() of a SNP markers set (X):

and define a cost function to minimize

where the term is a norm under a Hilbert space. Kimeldorf and Wahba (1971) found that g(X) can be reformulated as:

where K is a positive semi-definite matrix that meets the requisites of a Kernel Matrix. It defines the similarity between individuals and meets the distance requirements in a Hilbert space (Wootters, 1981). The performance of the method depends on an adequate choice of K that can be chosen from among a very large number of options. The easiest RKHS option is to use the genealogical (A) or genomic (G) relationship matrices as kernel matrices (Rodríguez-Ramilo et al., 2014), this leads to the standard BLUP or the GBLUP as particular cases of RKHS. However, they only are able to capture the additive genetic variation and if the model tries to accommodate dominance or epistatic interactions, an alternative Kernel matrix has to be implemented for a pair of SNP vectors of two individuals (x and x′). Most kernels proposed so far (Gianola et al., 2006; Piepho, 2009; Morota et al., 2013; Tusell et al., 2014) consider the similarity across individuals within loci (i.e., similarities within loci are summed). Using Taylor series expansions, it can be shown that kernels of this type are a weighted sum of the additive (G) and dominance covariance matrices (D), and therefore implicitly account for dominance (Piepho, 2009). However, these kernels do not consider joint similarity across loci. A kernel that includes epistasis should measure similarities simultaneously between pairs, triplets etc., of loci across individuals, as described in Jiang and Reif (2015) and Martini et al. (2016).

Applications of genomic selection with non-additive genetic effects

Predictive performance

The most direct application of the genomic prediction models is to predict the performance of an individual for continuous or categorical phenotypes. Here the introduction of non-additive genetic effects in the procedures of prediction becomes relevant, as the main objective is to predict performance conditioned on the genotype of the individual, despite the additive, dominant or epistatic gene action. In fact, simulation studies show up to 17% more accurate predictions based on the sum of additive and dominance effects compared to prediction based on only additive effects (Wellmann and Bennewitz, 2012; Da et al., 2014). However, the performance of semi-parametric or non-parametric approaches such as RKHS methods seems to be appropriate because they are designed to maximize predicting ability over a given individual and not to predict the future performance of the progeny; they are also designed to capture complex and non-explicit interactions. Moreover, some new research fields have merged with genomic evaluation for predicting future performance, examples include: microbiomics (Ramayo-Caldas et al., 2016; Yang et al., 2017), metabolomics (Fontanesi, 2016) and precision farming (Banhazi et al., 2012). Over time they will provide a global picture of the genetic and environmental circumstances that affect the future performance of individuals and they will contribute to the development of more accurate prediction models.

Mate allocation

In the past, there was a strong belief in “nicking”: pairs of individuals that, wisely selected, would give rise to very efficient offspring (Lush, 1943). In terms of quantitative genetics, the existence of “nicking” would imply that there is large variance of dominant deviations (or epistasis) compared to the variance of breeding values, something that finally turned out to be generally false. Even so, there is room for mate allocation within a population (Toro and Varona, 2010). Under models that include dominance effects, the output of the genomic selection procedure can be used to calculate the prediction of performance of future mating (G_ij) between the ith and jth individual as:

where pr_ijk(A₁A₁), pr_ijk(A₁A₂), and pr_ijk(A₂A₂) are the probabilities of the genotypes A₁A₁, A₁A₂, and A₂A₂ for the combination of the ith and jth individual and the kth marker, â_k and are the estimates of the additive and dominance effects for the same marker and n is the number of markers. Later, optimisation procedures like linear programming (Jansen and Wilton, 1985) or heuristic approximations (simulated annealing, Kirkpatrick et al., 1983) can be used to define a set of mates that maximize performance in the future generation. In a simulated example, Toro and Varona (2010) compared random mating vs. mate selection with a model including dominance and found advantages that ranged between 6 and 22% of the expected response. Sun et al. (2013), Ertl et al. (2014), and Aliloo et al. (2017) have confirmed these improvements with dairy cattle data. However, its implementation in livestock populations is limited because it must be taken into account that the accuracy of the prediction of a potential mate will be low and the advantage will be only relevant when traits have a large amount of non-additive genetic variance. In addition, it requires the genotyping of male and females in the population that is not always available. Moreover, the use of models that include more complex interactions, such as models with epistatic effects or non-parametric approaches, is not so immediate. In fact, the predicted performance of a mate should be calculated after integrating the predictive performance over all possible future genotypic configurations of the expected progeny. For epistasis (but not for dominance) these genotypic configurations also depend on recombination fractions across the genome.

Selection for crossbreeding

There is consensus that profit from non-additive genetic effects in a selection program can be obtained when commercial animals are the product of mating with those that do not participate in the maintenance of a breeding population. The typical way to proceed is to produce two-way or three-way crosses between populations maintained and selected separately (i.e., in pigs). Selection is carried out within lines to benefit from additivity and, in addition, the value of the cross may increase due to the heterosis. Some of the most popular livestock production systems, including pig, poultry, and rabbit production, involve regular crossbreeding schemes, with the aim of capturing the complementarity between the performance of the purebred populations and heterosis. The breeding goal within pure lines is to select individuals to maximize the response in the crossbred population. The traditional approach for this objective was Reciprocal Recurrent Selection—RRS—(Comstock et al., 1949). RRS postulates the selection of individuals in purebred populations based on the performance of their crossbred progeny. If the source of information is the performance of these crossbred progeny, the main drawback of the practical application of RRS is the increase of generation intervals that reduce overall genetic response. In practical terms, the performance of the pure lines is used, and a high genetic purebred/crossbred correlation is sought in order to warrant correct genetic progress (Wei and van der Werf, 1994), however, this may not be the case because of non-additive effects or genotype x environment (G x E) interactions.

The use of genomic information can provide a very useful tool to improve the ability of prediction of breeding values in purebred populations based on crossbred performance without the need to wait for recording crossbred progeny. Ibánez-Escriche et al. (2009) designed a first approach of the use of GS for crossbred performance under a purely additive model. This study defined a breed specific genomic selection model as:

where is the SNP allele at the jth locus from breed k and received from the sire of the ith individual that can take values 0 or 1, and is the breed-specific substitution effect for the jth locus and the kth breed. Similarly, and were defined for the alleles received from the dam of the lth breed. The objective of this approach was to estimate allele substitution effects within breed. Even under the assumption of absence of G x E interactions, SNP allele substitution effects may differ between populations due to: (1) Specific population patterns of linkage disequilibrium with the QTL, or (2) The presence of genotypic dominance effects. The allelic substitution effects of the A (or B) population (α_A or α_B) on performance of A x B depends on the biological additive (a) and dominance (d) effects, and the allelic frequencies of B–p_B- (or A–p_A -) as α_A = a + (1 − 2p_B) d or α_B = a + (1 − 2p_A) d). Under dominance, Kinghorn et al. (2010) demonstrated a clear advantage of this approach, assuming the estimation of SNP effects was perfect. This model has been expanded by Sevillano et al. (2017) to a three-way crossbreeding scheme, after the evaluation of a procedure to trace the breed-of-origin of alleles in three-way crossbred animals (Sevillano et al., 2016). This is an example of the “partial genetic” approach (substitution effects defined within populations). Stuber and Cockerham (1966) showed that gene substitution effects can be defined within populations or across populations, and, if all the (non-additive) effects are accounted for, both approaches are equivalent. Christensen et al. (2015) proposed an alternative model called the “common genetic” approach. Both models were compared by Xiang et al. (2016, 2017) in the same data set with very similar results, but more research is still needed.

Crossbreeding implies mating between individuals of parental populations and a formal description of the additive and dominance variance in the crossbred population is required to evaluate the relevance of mate allocation when the crossbreds are generated. Toosi et al. (2010) and Zeng et al. (2013) extended the aforementioned model to include additive and dominance effects and proved (in both cases with simulated data) its superiority over the strictly additive model if dominance variance is present. These results were confirmed by Esfandyari et al. (2015), who proved that the response to selection for crossbreeding performance is increased by training on crossbred genotypes and phenotypes, and by tracking the allele line origin when pure lines are not closely related. Later, Vitezica et al. (2016) described the substitution effects and dominance deviations within the scope of an F1 population and showed that the additive and dominant variance in a crossbred population is:

where are the additive variance generated by the purebred populations A and B, respectively, is the dominance variance, p_A, q_A, p_Band q_B are the allelic frequencies in purebred populations, and a and d are the additive and dominance effects.

However, all these approaches assume that the additive and dominance effects have the same magnitude in pure and crossbred populations and this implies an absence of G x E interaction. To avoid this restriction, Vitezica et al. (2016) and Xiang et al. (2016) proposed a multivariate genomic BLUP that is capable of considering different additive and dominance effects and their correlations between pure and crossbred populations.

Selection in purebred populations

The response to selection in purebred populations depends on the magnitude of the additive variance and on the prediction of the additive breeding values for the candidates for reproduction. It is usually assumed that it is not worth selecting individuals with the highest dominance values because they will go back to zero as a result of random mating. However, Toro (1993, 1998) proposed two mating strategies that can be used to take advantage of dominance in a closed population. The first (Toro, 1993), was a method that basically consists of performing two types of mating: (a) minimum coancestry mating in order to obtain the progenies that will constitute the commercial population and will also be utilized for testing, and (b) maximum coancestry mating from which the breeding population will be maintained. Toro's second strategy (Toro, 1998) advocates the use of the selection of grandparental combinations. Both strategies are analogous with reciprocal-recurrent selection (Comstock et al., 1949) in that they rely on the crucial distinction between commercial and breeding populations. Nevertheless, they have been exclusively tested by simulation and with a reduced set of genes with known additive and dominance effect. Their efficiency has yet to be verified using a large number of SNP markers.

Final remarks

Despite huge efforts in the development of statistical models for the implementation of genomic selection with non-additive effects, there are still some issues that have to be dealt with before the use of these models in genomic evaluation becomes standard. A major obstacle is the lack of serious testing as this requires extensive data sets with genotypes and phenotypes, and these data sets are rare. In fact, non-additive genetic variance is expected to be low for most traits (Crow, 2010; Hill et al., 2010), with the exception of fitness related traits. Therefore, the inclusion of non-additive effects in genomic selection models will provide very low (or negligible) improvement in the genetic response or the ability of prediction.

Non-additive effects are easily incorporated into GBLUP procedures (Vitezica et al., 2013, 2017) but efforts must be made to define a single-step approach (Aguilar et al., 2010) that is able to use phenotypic data from non-genotyped individuals and the complete genealogical information of breeding schemes. The major limitation of the GBLUP or single-step approaches is the calculation of the inverse of the genomic relationship matrices (G), the introduction of non-additive effects will involve the calculation of the inverse of additional matrices related with dominance or epistatic effects. Nevertheless, this is really a constraint in populations with a large number of genotyped individual (i.e., Holstein), while most of the livestock populations do not suffer for any limitations. In fact, the computational cost for inverting additive and non-additive genomic relationship matrices is equivalent. On the other hand, using current pedigree-based BLUP models based on dominance (de Boer and Hoeschele, 1993) seems futile because the models are computationally complicated.

Recent studies (Xiang et al., 2016) have shown that inbreeding depression can be modeled and included in GS approaches through a covariate with the average individual heterozygosity. Nevertheless, this approach only considers the effects of the dominance in inbreeding depression and the role of epistatic interactions in inbreeding depression (Minvielle, 1987) has not yet been studied. However, directional dominance is not necessary requisite for having a substantial dominance variance. In fact it would be interesting to know if there are traits with substantial dominance variance and without inbreeding depression, because they would be good candidates for successful strategies of using dominance. In addition, it should be mentioned that the genetic architecture of non-additive genetic effects and its relationship with inbreeding depression and heterosis is a relevant subject of future research.

The presence of dominance with inbreeding implies the existence of up to five variance components in pedigree-based analysis (Smith and Maki-Tanila, 1990; de Boer and Hoeschele, 1993): additive; dominance between non-inbred; dominance between inbred; covariance between additive; and, inbred dominance values and inbreeding depression. As far as we know, this model has only been used twice with real data in animal breeding (Shaw and Woolliams, 1999; Fernández et al., 2017); their equivalence with the variance components captured by SNP marker effects has to be clarified.

Finally, the parametric approach for the estimation of epistatic effects (Vitezica et al., 2017) fails when linkage disequilibrium is present. A full description of the effect of the genes and their interactions in populations under linkage disequilibrium and the definition of predictive effects has not been reformulated within the scope of genomic selection. It is unclear what we mean by genetic variances when there is linkage disequilibrium, particularly because linkage disequilibrium is population specific and unstable across generations or subpopulations. Nevertheless, Mäki-Tanila and Hill (2014) showed that when the number of loci increases, epistatic variance disappears. At the same time, the proportion of dominance variance stays the same. Thus, dominance variance is the main non-additive component even with linkage disequilibrium (Hill and Maki-Tanila, 2015).

Statements

Author contributions

LV prepare the initial draft of the review and it was corrected and improved by AL, MT, and ZV. The final manuscript was read and approved by all the authors.

Acknowledgments

This work was financed by the INRA SELGEN metaprogram—project EpiSel (ZV, AL), CGL2016-80155 (LV) and CGL2016-75904-C2-2-P (MT) of Ministerio de Economía y Competitividad of Spain.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AcevedoC. F.de ResendeM. D. V.SilvaF. F.VianaJ. M. S.ValenteM. S. F.ResendeM. F. R.et al. (2015). Ridge, Lasso and Bayesian additive-dominance genomic models. BMC Genetics16:105. 10.1186/s12863-015-0264-2
2
AguilarI.MisztalI.JohnsonD. L.LegarraA.TsurutaS.LawlorT. J. (2010). Hot topic: a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. J. Dairy Sci. 93, 743–752. 10.3168/jds.2009-2730
3
AkdemirD.JannickJ. (2015). Locally epistatic genomic relationships matrices for genomic association and prediction. Genetics199, 857–871. 10.1534/genetics.114.173658
4
AkdemirD.JannickJ.Isidro-SanchezJ. (2017). Locally epistatic models for genome-wide prediction and association by importance sampling. Genet. Sel. Evol. 49:74. 10.1186/s12711-017-0348-8
5
AlilooH.PryceJ. E.González-RecioO.CocksB. G.HayesB. J. (2016). Accounting for dominance to improve genomic evaluations of dairy cows for fertility and milk production traits. Genet. Sel. Evol. 48:186. 10.1186/s12711-016-0186-0
6
AlilooH.PryceJ. E.González-RecioO.CocksB. G.GoddardM. E.HayesB. J. (2017). Including non-additive genetic effects in mating programs to maximize dairy farm profitability. J. Dairy Sci. 100, 1203–1222. 10.3168/jds.2016-11261
7
Alvarez-CastroJ. M.CarlborgO. (2007). A unified model for functional and statistical epistasis and its application in quantitative trait loci analysis. Genetics176, 1151–1167. 10.1534/genetics.106.067348
8
BanhaziT. M.LehrH.BlackJ. L.CrabtreeH.SchofieldP.TscharkeM.et al. (2012). Precision livestock farming: an international review of scientific and commercial aspects. Int. J. Agric. Biol. Eng.5, 1–9. 10.3965/j.ijabe.20120503.001
- CrossRef
- Google Scholar
9
BartonN. H.EtheridgeA. M.VéberA. (2017). The infinitesimal model: definition, derivation and implications. Theor. Pop. Biol. 118, 50–73. 10.1016/j.tpb.2017.06.001
10
BatesonW. (1909). Mendel's Principles of Heredity. Cambridge, UK: Cambridge University Press Warehouse. 10.5962/bhl.title.44575
- CrossRef
- Google Scholar
11
BennewitzJ.MeuwissenT. H. E. (2010). The distribution of QTL additive and dominance effects in porcine F2 crosses. J. Anim. Breed. Genet. 127, 171–179. 10.1111/j.1439-0388.2009.00847.x
12
BennewitzJ.EdelC.FriesR.MeuwissenT. H. E.WellmannR. (2017). Application of a Bayesian dominance model improves power in quantitative trait genome-wide association analysis. Genet. Sel. Evol.49:7. 10.1186/s12711-017-0284-7
13
BernardoR. (2010). Breeding for Quantitative Traits in Plants, 2nd Edn. Woodsbury, MN: Stemma Press.
- Google Scholar
14
CaballeroA.KeightleyP. D. (1994). A pleiotropic nonadditive model of variation in quantitative traits. Genetics138, 883–900.
- Pubmed Abstract
- Google Scholar
15
ChristensenO. F.LegarraA.LundM. S.SuG. (2015). Genetic evaluation for three-way crossbreeding. Genet. Sel. Evol, 47:177. 10.1186/s12711-015-0177-6
16
CockerhamC. C. (1954). An extension of the concept of partitioning hereditary variance for analysis of covariances among relatives when epistasis is present. Genetics39, 859–882.
- Pubmed Abstract
- Google Scholar
17
ComstockR. E.RobinsonH. F.HarveyP. H. (1949). A breeding procedure designed to make maximum use of both general and specific combining ability. Agron. J.41, 360–367. 10.2134/agronj1949.00021962004100080006x
- CrossRef
- Google Scholar
18
CrowJ. F. (2010). On epistasis: why it is unimportant in polygenic directional selection. Philos. Trans. R. Soc. Lond. B Biol. Sci. 365, 1241–1244. 10.1098/rstb.2009.0275
19
DaY.WangC.WangS.HuG. (2014). Mixed model methods for genomic prediction and variance component estimation of additive and dominance effects using SNP markers. PLoS ONE9:e87666. 10.1371/journal.pone.0087666
20
DaetwylerH. D.Pong-WongR.VillanuevaB.WoolliamsJ. A. (2010). The impact of genetic architecture on genome-wide evaluation methods. Genetics185, 1021–1031. 10.1534/genetics.110.116855
21
de BoerI.HoescheleI. (1993). Genetic evaluation methods for populations with dominance and inbreeding. Theor. Appl. Genet. 86, 245–258. 10.1007/BF00222086
22
de los CamposG.GianolaD.RosaG. J. (2009a). Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation. J. Anim. Sci. 87, 1883–1887. 10.2527/jas.2008-1259
23
De los CamposG.NayaH.GianolaD.CrossaJ.LegarraA.ManfrediE.et al. (2009b). Prediction quantitative traits with regression models for dense molecular markers and pedigree. Genetics182, 375–385. 10.1534/genetics.109.101501
24
DekkersJ. C. (2004). Commercial application of marker- and gene-assisted selection in livestock: strategies and lessons. J. Anim. Sci. 82, E313–E328. 10.2527/2004.8213_supplE313x
25
DuenkP.CalusM. P. L.WientjesY. C. J.BijmaP. (2017). Benefits of dominance over additive models for the estimation of average effects in the presence of dominance. G3 Genes Genomes Genetics7, 3405–3414. 10.1534/g3.117.300113
26
ErbeM.HayesB. J.MatukumalliL. K.GoswaniS.BowmanP. J.ReichC. M.et al. (2012). Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. J. Dairy Sci. 95, 4114–4129. 10.3168/jds.2011-5019
27
ErtlJ.LegarraA.VitezicaZ. G.VaronaL.EdelC.EmmerlingR.et al. (2014). Genomic analysis of dominance effects on milk production and conformation traits in Fleckvieh cattle. Genet. Sel. Evol. 46:40. 10.1186/1297-9686-46-40
28
EsfandyariH.BijmaP.HenryonM.ChristensenO. F.SorensenA. C. (2016). Genomic prediction of crossbred performance based on purebred Landrace and Yorkshire data using a dominance model. Genet. Sel. Evol. 48:40. 10.1186/s12711-016-0220-2
29
EsfandyariH.SorensenA. C.BijmaP. (2015). A crossbred reference population can improve the response to genomic selection for crossbred performance. Genet. Sel. Evol. 47:76. 10.1186/s12711-015-0155-z
30
FalconerD. S.McKayT. (1996). Introduction to Quantitative Genetics. Harlow: Pearson Education Limited.
- Google Scholar
31
FernándezE. N.LegarraA.MartínezR.SánchezJ. P.BaselgaM. (2017). Pedigree-based estimation of covariance between dominance deviations and additive genetic effects in closed rabbit lines considering inbreeding and using a computationally simpler equivalent model. J. Anim. Breed. Genet. 134, 184–195. 10.1111/jbg.12267
32
FernandoR. L.DekkersJ. C.GarrickD. J. (2014). A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analysis. Genet. Sel. Evol. 46:50. 10.1186/1297-9686-46-50
33
FisherR. A. (1918). The correlation between relatives on the supposition of Mendelian Inheritance. Trans. R. Soc. Edinburgh52, 399–433. 10.1017/S0080456800012163
- CrossRef
- Google Scholar
34
FontanesiL. (2016). Metabolomics and livestock genomics: insights into a phenotyping frontier and its application in animal breeding. Anim. Front.6, 73–79. 10.2527/af.2016-0011
- CrossRef
- Google Scholar
35
GianolaD. (2013). Priors in whole-genome regression: the Bayesian alphabet returns. Genetics194, 573–596. 10.1534/genetics.113.151753
36
GianolaD.FernandoR. L.StellaA. (2006). Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics173, 1761–1776. 10.1534/genetics.105.049510
37
Gonzalez-RecioO.RosaG. J. M.GianolaD. (2014). Machine learning methods and predictive ability metrics for genome-wide predictin of complex traits. Livest. Sci. 166, 217–231.
- Google Scholar
38
GundersonK. L.SteemersF. J.LeeG.MendozaL. G.CheeM. S. (2005). A genome-wide scalable SNP genotyping asay using microarray technology. Nat. Genet. 37, 549–554. 10.1038/ng1547
39
GuoX.ChristensenO. F.OstersenT.WangY.LundM. S.SuG. (2016). Genomic prediction using models with dominance and imprinting effects for backfat thickness and average daily gain in Danish Duroc pigs. Genet. Sel. Evol. 48:67. 10.1186/s12711-016-0245-6
40
HabierD.FernandoR. L.KizilkayaK.GarrickD. J. (2011). Extension of the bayesian alphabet for genomic selection. BMC Bioinformatics12:186. 10.1186/1471-2105-12-186
41
HazelL. N. (1943). The genetic basis for constructing selection indexes. Genetics28, 476–490.
- Pubmed Abstract
- Google Scholar
42
HeidaritabarM.WolcA.ArangoJ.ZengJ.SettarP.FultonJ. E.et al. (2016). Impact of fitting dominance and additive effects on accuracy of genomic prediction of breeding values in layers. J. Anim. Breed. Genet. 133, 334–346. 10.1111/jbg.12225
43
HendersonC. R. (1973). Sire evaluation and genetic trends, in Proceedings of the Animal Breeding and Genetics Symposium in Honour of Dr. Jay L. Lush 10-41 (Champaing, IL: ASAS and ADSA).
- Google Scholar
44
HillW. G. (2010). Understanding and using quantitative genetic variation. Philos. Trans. R. Soc. Lond. B Sci. 365, 73–85. 10.1098/rstb.2009.0203
45
HillW. G.Maki-TanilaA. (2015). Expected influence of linkage disequilibrium on genetic variance caused by dominance and epistasis on quantitative traits. J. Anim. Breed. Genet. 132, 176–186. 10.1111/jbg.12140
46
HillW. G.GoddardM. E.VisscherP. M. (2008). Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet.4:e1000008. 10.1371/journal.pgen.1000008
47
HowardJ. T.Haile-MariamM.PryceJ. E.MalteccaC. (2015). Investigation of regions impacting inbreeding depression and their association with the additive genetic effect for United States and Australia Jersey dairy cattle. BMC Genomics16:813. 10.1186/s12864-015-2001-7
48
HuY.RosaG. J. M.GianolaD. (2016). Incorporating parent-of-origin effects in whole-genome prediction of complex traits. Genet. Sel. Evol. 48:34. 10.1186/s12711-016-0213-1
49
HuangW.MackayT. F. C. (2016). The genetic architecture of quantitative traits cannot be inferred from variance component analysis. PLoS Genet.10:e1006421. 10.1371/journal.pgen.1006421
- CrossRef
- Google Scholar
50
Ibánez-EscricheN.FernandoR. L.ToosiA.DekkersJ. C. (2009). Genomic selection of purebreds for crossbred performance. Genet. Sel. Evol. 41:12. 10.1186/1297-9686-41-12
51
JansenG. B.WiltonJ. W. (1985). Selecting mating pairs with linear programming techniques. J. Dairy Sci. 68, 1302–1305. 10.3168/jds.S0022-0302(85)80961-9
52
JiangJ.ShenB.O' ConnellJ. R.VanRadenP. M.ColeJ. B.MaL. (2017). Dissection of additive, dominance, and imprinting effects for production and reproduction traits in Holstein cattle. BMC Genomics18:425. 10.1186/s12864-017-3821-4
53
JiangY.ReifJ. C. (2015). Modeling epistasis in genomic selection. Genetics201, 759–768. 10.1534/genetics.115.177907
54
KempthorneO. (1954). The correlation between relatives in a random mating population. Proc. R. Soc. Lond. B Biol. Sci. 143, 102–113. 10.1098/rspb.1954.0056
55
KimeldorfG.WahbaG. (1971). Some results on Tchebycheffian spline functions. J. Math. Anal. Appl. 33, 82–95. 10.1016/0022-247X(71)90184-3
- CrossRef
- Google Scholar
56
KinghornB. P.HickeyJ. M.van der WerfJ. H. J. (2010). Reciprocal recurrent genomic selection for total genetic merit in crossbred individuals, in Proceedings of the 9th World Congress on Genetics Applied to Livestock Production (Leipzig), 36.
- Google Scholar
57
KirkpatrickS.GelattC. D.VecchiM. P. (1983). Optimization by simulated annealing. Science220, 671–680. 10.1126/science.220.4598.671
58
LushJ. L. (1943). Animal Breeding Plans, 2nd Edn. Ames, IA: The Collegiate Press Inc., Iowa.
- Google Scholar
59
LynchM. (1991). The genetic interpretation of inbreeding depression and outbreeding depression. Evolution45, 622–629. 10.1111/j.1558-5646.1991.tb04333.x
60
Maki-TanilaA. (2007). An overview on quantitative and genomic tools for utilising dominance genetic variation in improving animal production. Agric. Food Sci. 16, 188–198. 10.2137/145960607782219337
- CrossRef
- Google Scholar
61
Mäki-TanilaA.HillW. G. (2014). Influence of gene interaction on complex trait variation with multilocus models. Genetics198, 355–367. 10.1534/genetics.114.165282
62
MartiniJ. W.WimmerV.ErbeM.SimianerH. (2016). Epistasis and covariance: how gene interaction translates into genomic relationship. Theor. Appl. Genet.129, 963–976. 10.1007/s00122-016-2675-5
63
MendelG. (1866). Versuche über Pflanzen-Hybriden. – Verhandlungen des Naturforschenden Vereines, Abhandlungern, Brünn, 4, 3–47. Editions in different languages published by Matlová (1973). 10.5962/bhl.title.61004
- CrossRef
- Google Scholar
64
MeuwissenT. H.HayesB.GoddardM. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics157, 1819–1829.
- Pubmed Abstract
- Google Scholar
65
MinvielleF. (1987). Dominance is not necessary for heterosis: a two-locus model. Genet. Res. 49, 245–247. 10.1017/S0016672300027142
- CrossRef
- Google Scholar
66
MisztalI.VaronaL.CulbertsonM.GenglerN.BertrandJ. K.MabryJ.et al. (1998). Studies on the value of incorporating the effect of dominance in genetic evaluations of dairy cattle, beef cattle and swine. Biotechnol. Agron. Soc. Environ. 2, 227–233.
- Google Scholar
67
MoghaddarN.van der WerfJ. H. J. (2017). Genomic estimation of additive and dominance effects and impact of accounting for dominance on accuracy of genomic evaluation in sheep populations. J. Anim. Breed. Genet. 134, 453–462. 10.1111/jbg.12287
68
MorotaG.GianolaD. (2014). Kernel-based whole-genome prediction of complex traits: a review. Front. Genet. 5:363. 10.3389/fgene.2014.00363
69
MorotaG.KoyamaM.RosaG. J. M.WeigelK. A.GianolaD. (2013). Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data. Genet. Sel. Evol. 45:17. 10.1186/1297-9686-45-17
70
NguyenT. N.Nagyné-KiszlingerH. (2016). Dominance effects in domestic populations. Acta Agraria Kaposvariensis20, 1–20.
- Google Scholar
71
NishioM.SatohM. (2015). Genomic best linear unbiased prediction method including imprinting effects for genomic evaluation. Genet. Sel. Evol. 47:32. 10.1186/s12711-015-0091-y
72
PiephoH. P. (2009). Ridge regression and extensions for genomewide selection in maize. Crop Sci. 49, 1165–1176. 10.2135/cropsci2008.10.0595
- CrossRef
- Google Scholar
73
PryceJ. E.Haile-MariamM.GoddardM. E.HayesB. J. (2014). Identification of genomic regions associated with inbreeding depression in Holstein and Jersey dairy cattle. Genet. Sel. Evol. 46:71. 10.1186/s12711-014-0071-7
74
Ramayo-CaldasY.MachN.LepageP.LevenezF.DenisC.LemonnierG.et al. (2016). Phylogenetic network analysis applied to pig gut microbiota identifies an ecosystem structure linked with growth traits. ISME J.10, 2973–2977. 10.1038/ismej.2016.77
75
ReikW.WalterJ. (2001). Genomic imprinting, parental influence on the genome. Nat. Rev. Genet. 2, 21–32. 10.1038/35047554
76
Rodríguez-RamiloS. T.García-CortésL. A.González-RecioO. (2014). Combining genomic and genealogical information in a reproducing kernel hilbert spaces regression model for genome-enabled predictions in dairy cattle. PLoS ONE9:e93424. 10.1371/journal.pone.0093424
77
SauraM.FernándezA.VaronaL.FernándezA. I.De CaraM. A. R.BarragánC.et al. (2015). Detecting inbreeding depression for reproductive traits in Iberian pigs using genome wide data. Genet. Sel. Evol. 47:12. 10.1186/s12711-014-0081-5
78
SevillanoC. A.VandenplasJ.BastiaansenJ. W. M.CalusM. P. L. (2016). Empirical determination of breed-of-origin of alleles in three-way crossbred pigs. Genet. Sel. Evol. 48:55. 10.1186/s12711-016-0234-9
79
SevillanoC. A.VandenplasJ.BastiaansenJ. W. M.BergsmaR.CalusM. P. L. (2017). Genomic evaluation for a three-way crossbreeding system considering breed-of-origin of alleles. Genet. Sel. Evol. 49:75. 10.1186/s12711-017-0350-1
80
ShawF. H.WoolliamsJ. A. (1999). Variance component analysis of skin and weight data for sheep subjected to rapid inbreeding. Genet. Sel. Evol. 31, 43–59. 10.1186/1297-9686-31-1-43
- CrossRef
- Google Scholar
81
SmithS. P.Maki-TanilaA. (1990). Genotypic covariance matrices and their inverses for models allowing dominance and inbreeding. Genet. Sel. Evol. 22, 65–91. 10.1186/1297-9686-22-1-65
- CrossRef
- Google Scholar
82
SpencerH. G. (2002). The correlation between relatives on the supposition of genomic imprinting. Genetics161, 411–417.
- Pubmed Abstract
- Google Scholar
83
StuberC. W.CockerhamC. C. (1966). Gene effects and variances in hybrid populations. Genetics64, 1279–1286
- Google Scholar
84
SuG.ChristensenO. F.OstersenT.HenryonM.LundM. S. (2012). Estimating additive and non-additive genetic variances and predicting genetic merits using genome-wide dense single nucleotide polymorphism markers. PLoS ONE7:e45293. 10.1371/journal.pone.0045293
85
SunC.VanRadenP. M.O'ConnellJ. R.WeigelK. A.GianolaD. (2013). Mating programs including genomic relationships and dominance effects. J. Dairy Sci. 96, 8014–8023. 10.3168/jds.2013-6969
86
ToosiA.FernandoR. L.DekkersJ. C. (2010). Genomic selection in admixed and crossbred populations. J. Anim. Sci. 88, 32–46. 10.2527/jas.2009-1975
87
ToroM. A. (1993). A new method aimed at using the dominance variance in closed breeding populations. Genet. Sel. Evol. 25, 63–74. 10.1186/1297-9686-25-1-63
- CrossRef
- Google Scholar
88
ToroM. A. (1998). Selection of grandparental combinations as a procedure designed to make use of dominance genetic effects. Genet. Sel. Evol. 30, 339–349. 10.1186/1297-9686-30-4-339
- CrossRef
- Google Scholar
89
ToroM. A.VaronaL. (2010). A note on mate allocation for dominance handling in genomic selection. Genet. Sel. Evol. 42:33. 10.1186/1297-9686-42-33
90
TusellL.Pérez-RodríguezP.ForniS.GianolaD. (2014). Model averaging for genome-enabled prediction with reproducing kernel Hilbert spaces: a case study with pig litter size and wheat yield. J. Anim. Breed Genet. 131, 105–115. 10.1111/jbg.12070
91
VanRadenP. M. (2008). Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4123. 10.3168/jds.2007-0980
92
VaronaL.VitezicaZ. G.MunillaS.LegarraA.- (2014). A general approach for calculation of genomic relationship matrices for Epistatic effects, in Proceedings from the 10th World Congress on Genetics Applied to Livestock Production (Vancouver, BC), 11–22.
- Google Scholar
93
VerbylaK. L.HayesB. J.BowmanP. J.GoddardM. E. (2009). Accuracy of genomic selection using stochastic search variable selection in Australian Holstein Friesian dairy cattle. Genet. Res.91, 307–311. 10.1017/S0016672309990243
94
VitezicaZ. G.LegarraA.ToroM. A.VaronaL. (2017). Orthogonal estimates of variances for additive, dominance and epistatic effects in populations. Genetics206, 1297–1307. 10.1534/genetics.116.199406
95
VitezicaZ. G.VaronaL.LegarraA. (2013). On the additive and dominant variance and covariance of individuals within the genomic selection scope. Genetics195, 1223–1230. 10.1534/genetics.113.155176
96
VitezicaZ. G.VaronaL.ElsenJ. M.MisztalI.HerringW.LegarraA. (2016). Genomic BLUP including additive and dominant variation in purebreds and F1 crossbreds, with an application in pigs. Genet. Sel. Evol. 48:6. 10.1186/s12711-016-0185-1
97
WangX.YangZ.XuC. (2015). A comparison of genomic selection methods for breeding value prediction. Sci. Bull.60, 925–935. 10.1007/s11434-015-0791-2
- CrossRef
- Google Scholar
98
WeiM.van der WerfJ. H. J. (1994). Maximizing genetic response in crossbreds using both purebred and crossbred information. Anim. Prod.59, 401–413. 10.1017/S0003356100007923
- CrossRef
- Google Scholar
99
WeiW. H.HemaniG.HaleyC. S. (2015). Detecting epistasis in human complex traits. Nat. Rev. Genet. 15, 722–733. 10.1038/nrg3747
100
WellmannR.BennewitzJ. (2011). The contribution of dominance to the understanding of quantitative genetic variation. Genet. Res.92, 139–154. 10.1017/S0016672310000649
- CrossRef
- Google Scholar
101
WellmannR.BennewitzJ. (2012). Bayesian models with dominance effects for genomic evaluation of quantitative traits. Genet. Res.94, 21–37. 10.1017/S0016672312000018
102
WoottersW. K. (1981). Statistical distance and Hilbert space. Phys. Rev. D23, 357–363. 10.1103/PhysRevD.23.357
- CrossRef
- Google Scholar
103
WrightS. (1921). Systems of mating. I. The biometric relations between parent and offspring. Genetics6, 111–123.
- Pubmed Abstract
- Google Scholar
104
XiangT.ChristensenO. F.LegarraA. (2017). Technical note: genomic evaluation for crossbred performance in a single-step approach with metafounders. J. Anim. Sci. 95, 1472–1480. 10.2527/jas2016.1155
105
XiangT.ChristensenO. F.VitezicaZ. G.LegarraA. (2016). Genomic evaluation by including dominance effects and inbreeding depression for purebred and crossbred performance with an application in pigs. Genet. Sel. Evol. 48:92. 10.1186/s12711-016-0271-4
106
YangH.HuangX.FangS.HeM.ZhaoY.WuZ.et al. (2017). Unraveling the fecal microbiota and metagenomic functional capacity associated with feed efficiency in pigs. Front. Microbiol.8:1555. 10.3389/fmicb.2017.01555
107
ZengJ.ToosiA.FernandoR. L.DekkersJ. C. M.GarrickD. J. (2013). Genomic selection of purebred animals for crossbred performance in the presence of dominant gene action. Genet. Sel. Evol. 45:11. 10.1186/1297-9686-45-11

Summary

Keywords

genomic selection, dominance, epistasis, crossbreeding, genetic evaluation

Citation

Varona L, Legarra A, Toro MA and Vitezica ZG (2018) Non-additive Effects in Genomic Selection. Front. Genet. 9:78. doi: 10.3389/fgene.2018.00078

Received

15 December 2017

Accepted

19 February 2018

Published

06 March 2018

Volume

9 - 2018

Edited by

Rohan Luigi Fernando, Iowa State University, United States

Reviewed by

Romdhane Rekaya, University of Georgia, United States; Piter Bijma, Wageningen University & Research, Netherlands

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Luis Varona lvarona@unizar.es

This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Livestock Genomics

REVIEW article

Non-additive Effects in Genomic Selection

Abstract

Introduction

Genomic selection

Genomic selection models with dominance

Dominance and inbreeding depression (or heterosis)

Imprinting

Epistasis

Applications of genomic selection with non-additive genetic effects

Predictive performance

Mate allocation

Selection for crossbreeding

Selection in purebred populations

Final remarks

Statements

Author contributions

Acknowledgments

Conflict of interest

References

Summary

Outline

Cite article

Article metrics

REVIEW article

Non-additive Effects in Genomic Selection

Abstract

Introduction

Genomic selection

Genomic selection models with dominance

Dominance and inbreeding depression (or heterosis)

Imprinting

Epistasis

Applications of genomic selection with non-additive genetic effects

Predictive performance

Mate allocation

Selection for crossbreeding

Selection in purebred populations

Final remarks

Statements

Author contributions

Acknowledgments

Conflict of interest

References

Summary

Outline

Cite article

Share article

Article metrics