- 1Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
- 2Wageningen University & Research, Animal Breeding and Genomics, Wageningen, Netherlands
- 3Danish Pig Research Centre, SEGES, Copenhagen, Denmark
- 4School of Agriculture and Environment, The University of Western Australia, Crawley, WA, Australia
- 5VikingGenetics, Randers, Denmark
- 6SEGES, Aarhus, Denmark
Genotype × environment interaction (G × E) is of increasing importance for dairy cattle breeders due to international multiple-environment selection of animals as well as the differentiation of production environments within countries. This theoretical simulation study tested the hypothesis that genomic selection (GS) breeding programs realize larger genetic benefits by cooperation in the presence of G × E than conventional pedigree-based selection (PS) breeding programs. We simulated two breeding programs each with their own cattle population and environment. Two populations had either equal or unequal population sizes. Selection of sires was done either across environments (cooperative) or within their own environment (independent). Four scenarios, (GS/PS) × (cooperative/independent), were performed. The genetic correlation (rg) between the single breeding goal trait expressed in two environments was varied between 0.5 and 0.9. We compared scenarios for genetic gain, rate of inbreeding, proportion of selected external sires, and the split-point rg that is the lowest value of rg for long-term cooperation. Between two equal-sized populations, cooperative GS breeding programs achieved a maximum increase of 19.3% in genetic gain and a maximum reduction of 24.4% in rate of inbreeding compared to independent GS breeding programs. The increase in genetic gain and the reduction in rate of inbreeding realized by GS breeding programs with cooperation were respectively at maximum 9.7% and 24.7% higher than those realized by PS breeding programs with cooperation. Secondly, cooperative GS breeding programs allowed a slightly lower split-point rg than cooperative PS breeding programs (0.85∼0.875 vs ≥ 0.9). Between two unequal-sized populations, cooperative GS breeding programs realized higher increase in genetic gain and showed greater probability for long-term cooperation than cooperative PS breeding programs. Secondly, cooperation using GS were more beneficial to the small population while also beneficial but much less to the large population. In summary, by cooperation in the presence of G × E, GS breeding programs realize larger improvements in terms of the genetic gain and rate of inbreeding, and have greater possibility of long-term cooperation than conventional PS breeding programs. Therefore, we recommend cooperative GS breeding programs in situations with mild to moderate G × E, depending on the sizes of two populations.
Introduction
Genotype × environment interaction (G × E) is present in situations where a specific environmental variation has different effects on different genotypes (Falconer and Mackay, 1996b). Environmental variation can be physical, chemical, biological, behavior patterns, or life events (Ottman, 1996). G × E is often illustrated by the unparallel norms of reaction (Lynch and Walsh, 1998). It can lead to scaling effect including different variances, and reranking effect of genotypes between environments. Reranking is often measured by the genetic correlation (rg) between a trait expressed in two different environments (Falconer, 1952). An rg less than unity indicates the presence of G × E. For instance, G × E exists for milk production in Holstein dairy cattle with rg ranging from 0.6 to 1 between environments within and between countries (Konig et al., 2005; Nauta et al., 2006; Haile-Mariam et al., 2008; Huquet et al., 2012; Hugo et al., 2017). The rg of functional traits, such as fertility, tends to be lower than that of production traits (rg = 0.33 ∼ 0.74) (Haile-Mariam et al., 2008; Hugo et al., 2017). When G × E exists, it makes good sense to utilize the best combinations of genotypes and environments for more efficient animal production because G × E implies that genotypes differ in genetic potential of adapting to variable environments (Falconer and Mackay, 1996c; Montaldo, 2001). Therefore, from the perspective of breeding programs to achieve more efficient animal production, it is essential to investigate the mechanisms that contribute to improving genetic gain and simultaneously decreasing inbreeding when G × E exists between different environments.
In many cases, different breeding programs are operated in different environments because of G × E. Moreover, cooperation of breeding programs, e.g., by using jointly each other’s sires, can result in larger genetic gain and lower rate of inbreeding. The cooperation discussed here refers to the combination of joint-evaluation and across-environment selection. Cooperation of breeding programs in the presence of G × E depends on the rg between environments. In the era before genomic selection (GS), Banos and Smith (1991) proposed that selection of Holstein sires across countries would achieve the maximum genetic response in each country when rg was above 0.7 or 0.8. Smith and Banos (1991) added that small populations benefited from combined selection, whereas large populations also benefited but less when the initial genetic means were the same. Mulder and Bijma (2006) found that cooperation between two equal-sized breeding programs using conventional evaluation and selection strategies was possible in the first generations with rg as low as 0.4 to 0.6, but only possible in the long term with rg as high as 0.8 to 0.9, which was called the split-point rg. The split-point rg is the lowest value of rg for long-term cooperation. In other words, if rg is lower than split-point rg then each breeding program selects animals from the external population only in the short-term, while if rg is higher than split-point rg then each breeding program selects animals from the external population in the short and also long term. Thus, when the split-point rg is lower, it indicates greater probability of long-term cooperation. It is evident that the level of G × E influences the decision of running cooperative or independent breeding programs. The above-mentioned studies using pedigree-based selection (PS) found improvement in genetic gain by cooperation but none of them investigated the impact of cooperation on the rate of inbreeding. While our study attempted to fill this gap of rate of inbreeding and investigated the mechanisms of benefiting from cooperation using GS in the presence of G × E.
Genomic selection increases genetic gain and reduces rate of inbreeding compared to the conventional methods (Schaeffer, 2006; Daetwyler et al., 2007; Buch et al., 2012). We hypothesized that this also applies to GS implemented in situations where across-environment genetic evaluation and selection of external breeding animals takes place. Genotypes for single nucleotide polymorphisms (SNP) are more likely to be well replicated across environments than individuals or their close relatives (Hayes et al., 2016). This enables GS to increase the accuracies of domestic animals and external animals and reduce the difference in accuracy between them compared to PS. Consequently, GS exploits larger variation in the distributions of estimated breeding values (EBV) of both domestic candidates and external candidates and brings two distributions closer to each other compared to conventional PS evaluation, thus increases genetic gain and pulls the split-point rg to a lower numeric value, i.e., has a greater probability of long-term cooperation. Inbreeding is expected to be reduced by the increased reference population that contains more genetic diversity and by the selection of external animals across environments with slightly different selection. In addition, at the split-point rg it is expected to observe the minimum rate of inbreeding. Because at the split-point rg it is an open breeding system with the greatest number of ancestors who have made long-term genetic contributions to the current generation among all levels of rg. The greatest number of ancestors results in a minimum rate of inbreeding, according to the relationship between long-term genetic contributions from a generation of ancestors and rate of inbreeding of the current population: , where ΔF is the rate of inbreeding of the current generation, α is the constant departure from Hardy-Weinberg proportions, ri is the long-term genetic contribution of ancestor i (Wray and Thompson, 1990; Bijma and Woolliams, 2000; Woolliams et al., 2015). In view of these rationale, we hypothesized that GS breeding programs would realize larger improvements in genetic gain and rate of inbreeding by cooperation, and have greater probability for long-term cooperation in the presence of G × E than conventional PS breeding programs. To test this hypothesis, we stochastically simulated two breeding programs that resembled two dairy cattle breeding programs carried out in two different environments each with their own cattle population. Two breeding programs had either equal or unequal population sizes. Selection was for a single trait, which could also be regarded as a multi-trait selection index. We also varied the rg of the selected trait expressed in two environments and the heritability (h2) of that trait.
Materials and Methods
Design
We used stochastic simulation to estimate improvements in genetic gain and inbreeding and the probability of long-term cooperation made by two cooperative GS or PS breeding programs in the presence of G × E. For this purpose, we designed four scenarios each with two interacting breeding programs that resembled two dairy cattle breeding programs carried out in two different environments.
Across-Environment Genomic Evaluation and Across-Environment Selection (AG-AS)
Both breeding programs adopted across-environment single-step genomic best linear unbiased prediction (ssGBLUP) for all animals and across-environment selection for sires in generations t = 6…20. This scenario simulated two cooperative GS breeding programs. “Cooperation” in this study was defined as the situation that one population eventually selected sire(s) from the other population in generations t = 6…20 based on across-environment genetic evaluation.
Within-Environment Genomic Evaluation and Within-Environment Selection (WG-WS)
Both breeding programs adopted within-environment ssGBLUP for all animals and within-environment selection for sires in generations t = 6…20. This scenario simulated two independent GS breeding programs. It provided the reference to estimate the benefits of cooperation made by AG-AS compared to the independent GS breeding programs.
Across-Environment Pedigree-Based Evaluation and Across-Environment Selection (AP-AS)
Both breeding programs adopted across-environment pedigree-based BLUP (PBLUP) for all animals and across-environment selection for sires in generations t = 6…20. This scenario simulated two cooperative conventional breeding programs. It was compared to AG-AS for the capability of benefiting from cooperation in the presence of G × E either with PBLUP or ssGBLUP.
Within-Environment Pedigree-Based Evaluation and Within-Environment Selection (WP-WS)
Both breeding programs adopted within-environment PBLUP for all animals and within-environment selection for sires in generations t = 6…20. This scenario simulated two independent conventional breeding programs. It provided the reference to estimate the benefits of cooperation made by AP-AS compared to independent PS breeding programs.
For all four scenarios, two breeding programs — having their own environment (E1/E2) and cattle population — had either equal (1000:1000) population sizes or unequal (400:1600) population sizes. The population size was determined by the number of selected reproductive females. The selected trait was controlled by 2,000 biallelic quantitative trait loci (QTL). Each breeding program aimed to improve the breeding goal trait in its own environment (Trait 1/Trait 2) and selected sires and dams by truncation selection on the EBV of its own environment. Trait 1 and Trait 2 in the base population had equal initial genetic means of 0, additive genetic variances () of 1.0 and h2 ranging from 0.1 to 0.5. The rg between Trait 1 and Trait 2 varied from 0.5 to 0.9 representing different levels of G × E between E1 and E2. Phenotypes were only observed for females, which resembled milk production. Each scenario was replicated 50 times and each replicate was simulated for 20 discrete generations (t = 0, 1…20). Each generation took one unit of time that consisted of a series of related events, namely culling, update of reproductive cycles, selection, mating and sampling offspring.
Simulation Procedure
Generation −3000 to −1: Founder Population
A common founder population was simulated using QMSim software (Sargolzaei and Schenkel, 2009) for all replicates of all scenarios, to establish linkage disequilibrium (LD) information between markers and QTL. It was based on the LD profile of Danish Red Dairy Cattle population in order to have a genetic architecture that resembles a real population. This reference genome-wide LD profile refers to 6,581 animals with 43,621 markers (unpublished). The founder population was simulated for 3,000 discrete generations, and the numbers of both sexes always remained the same, and it was always random mating with replacement between males and females. The detailed simulation set-up for founder population applied in this study was same as the “diverse” breeding program tailed for Red dairy cattle in Thomasen et al. (2020), which intensively elaborates information such as the number and the length of chromosomes, the initial numbers and distribution of QTL and SNP markers, set-up regarding expansion and bottleneck of the population size, the rule of descendants inheriting alleles, the selection of the final QTL and SNP markers, and so on. At generation t = −1, 2,000 QTL and 40,000 SNP loci were obtained; chromosomes from 300 male and 300 female founders were gathered by sex to form 30 paternal and 30 maternal pools each with 600 chromosomes. Chromosomes were subsequently sampled from these pools to establish genotypes of base animals. To sum up, the effective population size of the founder population is 600 (Oldenbroek, 2007).
Generation 0: Base Population
Simulation specifically for each scenario was initiated at generation t = 0 by sampling genotypes from those above-mentioned chromosome pools to form the base population consisting of 200 males and 4,000 females. For each base animal, two chromosomes for each pair of chromosomes were randomly sampled without replacement from the corresponding chromosome pool of its sex. Then the pair of sampled chromosomes were replaced before sampling for the next base animal. Base animals were designed to be non-inbred and unrelated to each other by assigning each of them unique alleles at 6,000 IBD (identical by descent) marker loci, which included 200 IBD markers equidistantly distributed within each chromosome and never involved in selection. Each base animal’s contribution to their descendant generations was traced and true inbreeding of descendants relative to the base population was inferred based on these 6,000 IBD markers.
Base animals were randomly appointed to E1 and E2, and the numbers of appointed males and females to either environment were determined by the population size. For the ith base animal in each environment, the phenotype of the breeding goal trait, pi, was calculated as pi = ai + ei, where ai was that base animal’s true breeding value (TBV) and ei was the residual environmental value. In particular, TBV for the breeding goal trait (ai) as well as TBV for the “unexpressed” trait (aiu) that was only expressed in the other environment was simulated simultaneously, both of which were calculated as the sum of 2,000 QTL effects. The effects of each QTL on the two traits followed a bivariate normal distribution, , and all effects were subsequently scaled to achieve an initial total genetic variance-covariance matrix of (i.e., for each trait , the additive QTL variance explained all genetic variance). ei was obtained by sampling from e∼N(0, 1/h2−1). As a result, the phenotypes of the breeding goal trait of base animals in each environment had a mean of 0 and a standard deviation of 1/h.
Generation 1: Random Selection From Base Population
Parents in generation t = 1 were randomly selected from base animals within environments with the selected proportion of 50%. In generations t = 1…5, only selected sires were genotyped.
In generations t = 1…20, selected males were evenly and randomly mated with selected females. QTL transmitted from parents to offspring as per Mendelian rules without newly generated mutation after the founder population was established. QTL effects on both traits were the same as for base animals, but allele frequency at each QTL could change due to selection and drift. TBV for each trait for each animal born in generations t = 1…20 was the sum of two allelic effects over all QTL. The residual environmental values and phenotypes were obtained as for base animals. To maintain population sizes, each selected female produced three offspring with a probability of 0.5 for both male and female offspring.
Generation 2 to 5: Separate Breeding Using Within-Environment Evaluation and Selection
In generations t = 2…5, each breeding program was carried out independently in their own environment using within-environment truncation selection for both sexes based on within-environment PBLUP.
Generation 6 to 20: “Scenarios”
In generations t = 6…20, four scenarios were carried out with different genetic evaluation strategies or different selection strategies of sires as above-mentioned. But dams were always selected within-environments irrespective of scenarios. All animals were genotyped prior to selection. Table 1 lists the simulation information of two breeding programs in scenario AG-AS in full details. Except for the genetic evaluation strategies and the selection strategies of sires in generations t = 6…20, other all scenarios shared the same details as AG-AS.
Genetic Evaluation
Genetic evaluation was executed by DMU version 6 (Madsen and Jensen, 2014) in generation t = 2…20. The models used in all scenarios can be summarized as follows:
where yi was the vector of observations of ith trait, i = 1, 2; βi was the vector of generation as the fixed effect in the ith environment, the number of levels of which increased by one every generation; ai was the vector of additive genetic effects of ith trait; and ei was the vector of residual effects of ith trait. Xi and Zi were the incidence matrices connecting βi and ai to yi, respectively. It was assumed that and , where H was the matrix of additive genetic relationships among individuals in the pedigree if it was PBLUP, or the unified genetic relationship matrix of all genotyped and non-genotyped animals in the pedigree if it was ssGBLUP (Legarra et al., 2014); I was the identity matrix; ⊗ was the symbol for the Kronecker product of two matrices; was the additive genetic variance of ith trait; rgσa1σa2 was the additive genetic covariance between two traits; was the residual variance of ith trait (Henderson and Quaas, 1976). For ssGBLUP, the H-matrix requires the genomic covariance matrix of breeding values, G, for genotyped animals, which was obtained by the second method proposed by VanRaden (2008). Residual covariance between two traits was zero, because each animal stayed in one environment and therefore only expressed the breeding goal trait in that environment. When the evaluation was within environment, the rg was set to 0, equivalent to running simultaneously two univariate models.
Genetic Gain, Rate of Inbreeding, and Split-Point Correlation
The indicators used to evaluate each breeding program were genetic gain per generation, rate of inbreeding per generation and the split-point rg.
Genetic gain per generation was computed for each replicate as the regression coefficient of mean TBV of animals on generation (t = 10…20). The presented genetic gain was the average over 50 replicates. The unit of genetic gain was the genetic standard deviation of the breeding goal trait in the base population.
Rate of inbreeding per generation was calculated for each replicate as ΔF = 1 − exp(bln(1−Ft), t), where ΔF was rate of inbreeding, Ft was the mean inbreeding coefficient at generation t, bln(1−Ft), t was the regression coefficient of the natural logarithm of (1 − Ft) on generation (t = 10…20). Because the inbreeding coefficient at any generation t referred to the base population is
Therefore,
which deduces that ln(1 − ΔF) is the regression coefficient of ln(1 − Ft) on generation t. Ft was derived based on the average inbreeding coefficient of those 6,000 fore-mentioned IBD loci. Inbreeding at each IBD locus was defined as the probability that two alleles at that locus for a randomly selected animal out of the population are IBD. The presented rate of inbreeding was the average over 50 replicates. In addition, the ancestors who had made long-term genetic contributions to the current generation were traced and counted to help interpret the occurrence or absence of the minimum rate of inbreeding.
We evaluated genetic gain and rate of inbreeding per generation instead of per unit of time. Because the biological risks of the negative influence of inbreeding, namely inbreeding depression and deleterious alleles, as well as its balancing processes, such as mutation, are more relevant in the context of generation being the time scale (Daetwyler et al., 2007). Although the rates per generation numerically equalized the rates per unit of time in this study. Moreover, results regarding genetic gain and rate of inbreeding were based on generation 10 to 20 to avoid the unsteadiness caused by build-up of pedigree information (Dekkers, 1992) and reference information for genomic prediction, and reduction of genetic variance due to selection (Bulmer, 1971). Hereinafter improvements in genetic gain and rate of inbreeding by cooperation will be expressed as percentage of increment of cooperative scenarios over their corresponding independent scenarios. In details, the improvement in genetic gain and rate of inbreeding by AG-AS will be respectively calculated as [(ΔGAG−AS − ΔGWG−WS)/ΔGWG−WS] and [(ΔFAG−AS − ΔFWG−WS)/ΔFWG−WS] in percentage terms. While by AP-AS, they will be respectively [(ΔGAP−AS − ΔGWP−WS)/ΔGWP−WS] and [(ΔFAP−AS − ΔFWP−WS)/ΔFWP−WS] in percentage terms.
Split-point rg was determined for scenario AG-AS and AP-AS after computing the proportion of sires from the external environment in all sires selected by each environment in generations t = 6…20. The split-point rg is the lowest value of rg for long-term cooperation. In other words, above the split-point rg, one breeding program kept selecting sires from the external environment without decrease in the proportion of external sires till generation 20.
Software
The simulations were run using ADAM (Pedersen et al., 2009), a software to stochastically simulate breeding programs for animals and plants.
Results
Improvements in Genetic Gain and Rate of Inbreeding
The following sections presented findings for the situation that two environments had equal population sizes. These results were only presented for E1; results from E2 were similar to E1.
Genomic selection breeding programs realized larger increase in genetic gain by cooperation than conventional PS breeding programs (Table 2). Scenario AG-AS realized genetic gain ranging from 0.50 ± 0.04 to 0.79 ± 0.04, while AP-AS realized genetic gain between 0.31 ± 0.04 and 0.51 ± 0.04. It was almost always AG-AS > WG-WS > AP-AS > WP-WS in terms of the ranking of genetic gain. AG-AS increased up to 19.3% in genetic gain relative to WG-WS, while AP-AS increased up to 9.6% relative to WP-WS. AG-AS had significantly larger (P < 0.0001) increase in genetic gain than AP-AS given the same h2 and the same rg. At most AG-AS realized 9.7% higher increase than AP-AS at h2 = 0.1 and rg = 0.9. Also note that, at h2 = 0.3 and rg = 0.8, AG-AS obtained improvements in genetic gain with an increase of 1.2% and in rate of inbreeding with a reduction of 7.1% relative to WG-WS, without selecting any external sire in generations t = 10…20 (Figure 1). Following this, we ran an additional scenario, AG-WS (across-environment genomic evaluation and within-environment selection for sires) for all levels of rg combined with h2 of 0.3. It turned out to be intermediate between AG-AS and WG-WS in terms of genetic gain and rate of inbreeding (not shown).
Table 2. Genetic gain per generation (ΔG1) and rate of inbreeding per generation (ΔF) realized in E12 for EPS3 situation 4.
Figure 1. Proportions of sires from E21 in sires selected by E11 for EPS2 situation for AG-AS3 and AP-AS3. 1E1/E2, environment 1/environment 2. 2EPS, situation that two environments had equal population sizes (1000:1000). 3AG-AS, scenario of across-environment genomic evaluation and across-environment selection for sires; AP-AS, scenario of across-environment pedigree-based evaluation and across-environment selection for sires. The red horizontal line at Y = 0.02 acts as the baseline, below which it implies that E1 selected no sire from E2.
Genomic selection breeding programs realized larger reduction in rate of inbreeding by cooperation than conventional PS breeding programs (Table 2). Scenario AG-AS realized rate of inbreeding ranging from 0.0068 ± 0.0016 to 0.0106 ± 0.0025, while AP-AS realized rate of inbreeding between 0.0088 ± 0.0027 and 0.0138 ± 0.0036. In most cases, it was AG-AS < WG-WS < AP-AS < WP-WS in terms of the numeric ranking of rate of inbreeding. Throughout all combinations of h2 and rg, AG-AS decreased the rate of inbreeding up to 24.4% relative to WG-WS, while AP-AS decreased the rate of inbreeding up to 16.1% relative to WP-WS. By comparison, AG-AS had significantly larger (P < 0.00001) reduction in rate of inbreeding than AP-AS given the same h2 and the same rg. At most, AG-AS realized 24.7% larger reduction in rate of inbreeding than AP-AS at h2 = 0.1 and rg = 0.875, where AG-AS favorably realized a reduction in inbreeding rate while AP-AS unfavorably realized an increase in inbreeding rate relative to their respective independent scenarios.
Special attention was focused on AG-AS because it displayed the strongest capacity of increasing genetic gain and reducing rate of inbreeding. Within AG-AS genetic gain increased as rg increased independent of the heritability, with a few exceptions (Table 2). For all preset levels of h2, the minimum rate of inbreeding appeared at rg of 0.85∼0.875 that will be proven as the split-point rg of AG-AS in the next paragraph. At this split-point rg of 0.85∼0.875, we found that the greatest number of ancestors had made long-term genetic contributions to the current generation. For instance, the numbers of male and female ancestors reached a maximum at rg = 0.875, the split-point rg for h2 = 0.3, in generations t = 10…18 for E1 (Figure 2). In fact, the minimum rate of inbreeding of AP-AS appeared at rg of 0.9 (Table 2), which will also be proven as the split-point rg of AP-AS subsequently.
Figure 2. Numbers of ancestors who had made long-term genetic contributions to 3,000 offspring born at generation 20 in E11 for a randomly selected replicate for EPS2 situation for AG-AS3 (h2 = 0.3). 1E1, environment 1. 2EPS, the situation that two environments had equal population sizes (1000:1000). 3AG-AS, scenario of across-environment genomic evaluation and across-environment selection for sires.
Split-Point Correlation Between Short-Term and Long-Term Cooperation
Genomic selection breeding programs showed a slightly lower split-point rg between short-term cooperation and long-term cooperation than conventional PS breeding programs (0.85∼0.875 vs. ≥0.9, Figure 1). For AG-AS, E1 begun cooperation by using sires from E2 for breeding when rg≥0.8. It selected 0 to 42.12% of sires from E2 over generations with h2 of the breeding goal trait varing from 0.1 to 0.5. The proportion of sires from E2 increased as rg increased, regardless of h2. However, the trend of selecting sires from E2 in the long term, i.e., long-term cooperation, differed among different rg. When rg was less than 0.85∼0.875 selection of sires from E2 diminished and even vanished quickly in the first 5 generations. While when rg was larger than 0.85∼0.875 sustainable long-term cooperation became possible and sires from E2 were always selected by E1 to maximize genetic gain. Therefore, 0.85∼0.875 was identified as rg that split up short-term cooperation and long-term cooperation between two cooperating GS breeding programs with equal population sizes. In contrast for AP-AS, the split-point rg between two cooperating PS breeding programs with equal population sizes was ≥ 0.9 and therefore slightly higher than in AG-AS. The trend of proportion of selected external sires was determined by the genetic levels and variances of EBV in two populations over generations (Figure 3). In comparison, given the same rg, say rg = 0.875, GS retained larger variances of EBV of both domestic and external candidates thus resulted in a larger portion of external candidates in the top ranking (the proportion increased from 14% in generation 6 to 22% in generation 20 using AG-AS, while decreased from 10% to 2% using AP-AS). It can be concluded that genomic selection decreases the split-point rg from 0.9 to 0.85–0.875 compared to pedigreed-based selection, because of gaining larger variances in EBV of domestic and external sires.
Figure 3. Distributions of EBV of male candidates from two sources1 at generation 6 and 20 for E12 based on one single randomly selected replicate for EPS3 situation for scenario AG-AS4 and AP-AS4 (h2 = 0.3). 1Candidates from E1 and E2 are in blue and in yellow, respectively. 2E1/E2 refers to environment 1/environment 2. 3EPS refers to the situation that two environments had equal population sizes (1000:1000). 4AG-AS, scenario of across-environment genomic evaluation and across-environment selection for sires; AP-AS, scenario of across-environment pedigree-based evaluation and across-environment selection for sires. Vertical dashed lines mark distribution means.
Unequal Population Sizes
Genomic selection breeding programs realized larger increase in genetic gain by cooperation than conventional PS breeding programs (Table 3). It was obviously AG-AS ≥ WG-WS > AP-AS or WP-WS in terms of the ranking of the genetic gain for both E1 and E2. For E1 that had the small population size, AG-AS increased up to 51.7% in genetic gain relative to WG-WS, and AP-AS increased up to 22% relative to WP-WS. For E2 with the large population size, AG-AS increased up to 4.3% relative to WG-WS, and AP-AS increased up to 0.5% relative to WP-WS. AG-AS had significantly larger increase in the genetic gain than AP-AS given the same h2 and the same rg for both E1 (P = 0.0006) and E2 (P = 0.002), with the maximum difference between the increases realized by AG-AS and AP-AS being 29.7% for E1 and 4.8% for E2 both at rg = 0.9. These results revealed that cooperation was more beneficial for the small population in terms of genetic gain.
Table 3. Genetic gain per generation (ΔG1) and rate of inbreeding per generation (ΔF) realized in E12 and E22 for UPS3 situation (h2 = 0.3) 4.
Genomic selection breeding programs allowed for a lower split-point rg between short-term cooperation and long-term cooperation than PS breeding programs, with 0.6∼0.7 vs. 0.7∼0.8 for the small population and 0.7∼0.8 vs. 0.8∼0.9 for the large population (Figure 4). Using either AG-AS or AP-AS, E1 started to use sires from E2 at a lower rg than E2 did from E1, and E1 selected a larger proportion of sires from E2 than E2 did from E1. Both together suggested the small population was more inclined for cooperation than the large population in order to maximize genetic gain. Moreover, GS breeding programs lowering the split-point rg especially for small population also showed that the small population benefited from cooperation already at substantial degrees of GxE.
Figure 4. Proportions of sires from the other environment in sires selected by E11 and E21 for UPS2 situation for AG-AS3 and AP-AS3 (h2 = 0.3). 1 E1/E2, environment 1/environment 2. 2 UPS, situation that two environments had unequal population sizes (400:1600). 3 AG-AS, scenario of across-environment genomic evaluation and across-environment selection for sires; AP-AS, scenario of across-environment pedigree-based evaluation and across-environment selection for sires. The red horizontal line at Y = 0.02 acts as the baseline, below which it implies that one environment selected no sire from the other environment.
However, based on existing experimental design and results, we found no pattern that GS breeding programs realized larger reduction in rate of inbreeding by cooperation than conventional PS breeding programs (Table 3). It showed that AG-AS ≤ WG-WS < AP-AS < WP-WS in terms of the numeric ranking of rate of inbreeding was only apparent for E2. Even so, the reduction in inbreeding rate realized by AG-AS relative to WG-WS was not always greater than that realized by AP-AS relative to WP-WS given the same h2 and the same rg. Nevertheless, other patterns of interest were found. First, the inbreeding rates realized in E1 by implementing AG-AS and AP-AS both decreased first and then increased as rg increased. At the level of rg where the minimum inbreeding rate appeared, we also observed the largest numbers of male and female ancestors who made long-term contributions to the current generation (not shown). Secondly, E1 displayed greater variation of inbreeding rates across all levels of rg than E2 by implementing either AG-AS or AP-AS. Then note the generally much higher rate of inbreeding in E2 than in E1 when using either WP-WS or AP-AS, while rather similar rates of inbreeding were observed in E2 and E1 when using either WG-WS or AG-AS. Also note the sharp increased rates of inbreeding at rg = 0.9 in E1 by AP-AS (0.0129) and AG-AS (0.0094).
Discussion
Our findings supported our hypothesis that GS breeding programs benefit more from cooperation regarding genetic gain and rate of inbreeding and have greater possibility of long-term cooperation in the presence of G × E than conventional PS breeding programs. Cooperation here referred to the combination of joint genetic evaluation and selection of sires across two environments. Otherwise no cooperation meant two populations remained independent by executing a completely separate breeding program within its own environment. GS increases accuracy therefore increasing genetic gain and decreasing rate of inbreeding. Cooperation with using external bulls from the other environment as candidates, increased the selection intensity thus increased genetic gain and also introduced more genetic diversity therefore decreased rate of inbreeding. Genomic selection increased the benefits of cooperation so that achieving larger increases in genetic gain, larger reductions in rate of inbreeding and a lower split-point genetic correlation. In short, genomic selection shows larger probability of long-term cooperation compared to conventional pedigree-based breeding programs in the presence of G × E. Hence, we recommend cooperative GS breeding programs with mild to moderate G × E depending on the sizes of two populations.
Improvements in Genetic Gain and Rate of Inbreeding
Genomic selection breeding programs realize larger improvement in genetic gain by cooperation than PS breeding programs (Täubert et al., 2011; Thomasen et al., 2020) because of increased selection intensity and accuracy. The increased selection intensity due to incorporating external animals as candidates (Banos and Smith, 1991; Smith and Banos, 1991; Mulder and Bijma, 2006) also applies to PS breeding programs. The increased accuracy is, however, specific to GS breeding programs and mainly attributes to joint genetic evaluation of animals, which increases the reference population and therefore the accuracy. To check the contribution of joint genomic evaluation, we simulated AG-WS in which the cooperation included only joint genomic evaluation, but selection was within environment. Results showed that even such a “single-layer” cooperation strategy was also beneficial for genetic gain and rate of inbreeding, as reported by Slagboom et al. (2019). Subsequently when introducing the second level of cooperation, i.e. across-environment selection of sires, there emerges an additional aspect of increased accuracy: the variances of EBV of both domestic and external candidates increase more under GS breeding strategy than those under PS breeding strategy (Figure 3). This enables GS breeeding strategy not only to result in a higher probability for external sires to be top-ranked and selected, but also highly likely to increase selection intensity in turn due to the larger selection differential. Therefore, in the presence of mild G × E (rg ≥ 0.8) between two equally sized populations from two different environments, cooperative GS breeding programs are superior with a greater ability of increasing genetic gain than cooperative conventional PS breeding programs, due to increased accuracy of both domestic and external sires and increased selection intensity.
Genomic selection breeding programs realizing larger improvement in rate of inbreeding by cooperation is directly supported by the finding that AG-AS had significantly larger reduction in rate of inbreeding than AP-AS. This supportive proof is backed up by two underlying findings. First, GS breeding programs realized lower rate of inbreeding than conventional breeding programs either with cooperative breeding strategy or independent breeding strategy (Thomasen et al., 2020), because GS increases estimation accuracy of the Mendelian sampling term, thus allows for better differentiation within families and leads to lower coselection of sibs. More precisely, GS captures partly the Mendelian sampling term for non-phenotyped individuals and allows more balanced selection (utilizing between- and within-family variation simultaneously) than PS, which does not capture the Mendelian sampling term at all for non-phenotyped individuals (Daetwyler et al., 2007). Secondly, cooperative breeding programs realized lower rate of inbreeding than independent breeding programs using either GS or PS due to the import of genetic material from the external environment that introduces more genetic diversity. Because of the larger proportion of external sires selected, GS keeps its advantage in rate of inbreeding with cooperation and achieves a larger reduction in rate of inbreeding than cooperative PS breeding programs. So in the presence of mild G × E (rg ≥ 0.8) between two equally sized populations, cooperative GS breeding programs have greater capacity of slowing down rate of inbreeding than conventional cooperative PS breeding programs, because of more families selected and more genetic diversity introduced.
AG-AS did not only always have the highest genetic gain and lowest rate of inbreeding than other strategies, irrespective of rg and h2. Meanwhile AG-AS showed two obvious patterns within itself. First, the genetic gain increased as rg increased due to more genetically related traits providing more valuable information and resulting in higher estimation accuracy. Second, the rate of inbreeding first increased then decreased as rg increased with the minimum rate of inbreeding appearing at rg of 0.85∼0.875 — the split-point rg for two equally sized populations, which was well explained by the number of ancestors of long-term contribution. The rate of inbreeding did not trend up all the way like the genetic gain, suggesting that the weaker G × E is not necessarily the better if the balance between inbreeding and genetic gain is desired.
Improvement in Possibility of Long-Term Cooperation
Genomic selection breeding programs have greater probability of long-term cooperation compared to conventional breeding programs, which was supported by the lower split-point rg of AG-AS than that of AP-AS (0.85∼0.875 vs ≥0.9). Given a level of rg, a trade-off exists between the increased difference of genetic levels of two populations and the increased variation of EBVs (increased accuracy) of candidates from these two populations over time, as shown by Figure 3. As long as there is a net-benefit, cooperation will last, otherwise selection of external sires will reduce over time to zero. The result of this trade-off depends on rg and determines how long the cooperation would last (Figure 1). Initiated with the same genetic levels for the breeding goal trait and the “unexpressed” trait in each environment, the genetic level of the breeding goal trait gradually surpassed the “unexpressed” trait with a superiority proportional to 1-rg, according to the different efficiency of direct selection and indirect selection (Falconer, 1952; Mulder and Bijma, 2006). The alleged “unexpressed” trait in one environment was just the breeding goal trait of the other environment. Thus, a higher rg made two environments less distinct regarding the genetic level of the goal trait desired by one environment, which alleviated the difficulty for external sires to squeeze into the top ranking when across-environment selection was allowed. Meanwhile, higher rg added more to the estimation accuracy that was already increased generation by generation due to increasingly accumulated pedigree-phenotypes-genotypes information over generations. This resulted in greater variations in the distributions of EBV of candidates from two sources, which indicated more likelihood for the overlap of two distributions. Therefore, a higher rg performed better in the balancing act between the increased difference of genetic levels and the increased likelihood for the overlap between distributions of EBV of candidates from two sources. In comparison, GS is capable of retaining larger variations of EBV of domestic and external candidates than PS given the same rg, thus displays greater possibility of long-term cooperation. In previous studies, the similar case for short-term cooperation as the present study was also observed by Banos and Smith (1991), which reported that if rg was less than 0.8 it would take at most 4 generations of selection until each population relied exclusively on its own bulls. Mulder and Bijma (2006) concluded that long-term cooperation between two equally sized environments was possible when rg was higher than 0.80 to 0.90 for conventional BLUP evaluation using progeny testing. Smith and Banos (1991) reported the same finding as Mulder and Bijma (2006). The split-point rg is the lowest value of rg for long-term cooperation. The lower split-point rg observed in GS breeding programs implies that, given G × E between two equally sized populations, genomic selection is more likely to keep a long-term sustainable cooperative breeding relationship than conventional selection strategies.
Impacts of Unequal Population Sizes
The benefits of cooperation with GS compared to with PS were larger in the situation of unequal population sizes. The small population obtained much larger benefits in genetic gain by cooperation than the large population. These benefits for the small population resulted from a much larger selection intensity because of a much larger pool of selection candidates with cooperation than with independent breeding programs. Furthermore in the context of GS, the selection accuracy increased as well by cooperation because of a larger reference population and interchange of sires that created sufficient links between both populations. These benefits were smaller for the large population than for the small population, because the increase in selection intensity and accuracy were smaller (Schöpke and Swalve, 2016). As the small population improved, it gradually contributed genetic resources to the overall pool and attracted the large population to select animals from them (Banos and Smith, 1991). Because of the much larger benefits for the small population than for the large population, the small population started to use external sires at a lower rg and therefore had a lower split-point rg compared with the large population. Thus, when G × E is present between two unequally sized populations, GS breeding programs have greater ability to benefit from cooperation in terms genetic gain and more likelihood to keep long-term cooperation than conventional PS breeding programs, especially for the small population.
The inbreeding rates realized in the small environment E1 by AG-AS and AP-AS both decreased first and then increased as rg increased. This pattern could be well explained using the number of ancestors with long-term contributions, in the same way as depicted for the situation with equal population sizes. The large environment E2 displayed smaller variation in inbreeding rates across all levels of rg than E1 by implementing either AG-AS or AP-AS due to selecting very few external sires (Figure 4), therefore being less affected by the genetic resource from the other environment. Daetwyler et al. (2007) reported that increased selection intensity substantially increases rate of inbreeding with BLUP but had no impact on the rate of inbreeding with genomic selection. This explained why we observed much higher rate of inbreeding in E2 than in E1 when using WP-WS or AP-AS, while quite similar rates of inbreeding in E2 and in E1 when using WG-WS or AG-AS. Due to our experimental design, two environments selecting same numbers of sires in generation t = 6…20, E2 had a higher selection intensity and a larger number of matings per sire than E1. E2 having a higher selection intensity than E1 was somewhat obscure for cooperative scenarios because of across-environment selection. But it was true since E2 had larger selection differential than E1 given the same level of rg, therefore effectively resulted in a higher selection intensity. With high rg, the rate of inbreeding of the large environment E2 determined that of the small environment E1, because E1 substantially relied on the importation of external sires from E2. This explained the sharply increased rate of inbreeding at rg = 0.9 in E1 by either AP-AS or AG-AS. Nevertheless, if the effect of current experimental design was removed, we believe that GS breeding programs have greater ability to benefit from cooperation than conventional PS breeding programs in terms of the inbreeding rate given G × E present between two unequally sized populations.
Generalization to Multiple Environments
We studied the situation with only two G × E environments. However, we think the above mechanisms revealed in this study can be generalized to the situation including three or more environments in the presence of G × E within each of the pairs of environments. No matter how many external populations there are or how complicated the G × E structure is among all pairs of environments, the population in operation of selection can always treat the other environments as a whole. Then the seemingly more sophisticated multi-environmental question becomes the dual-environmental question that we investigated in this study. Based on results in this study, the operating population can focus primarily on those environments that have moderate to high genetic correlations with the operating environment and treat those environments as one external environment. From an operational point of view, multiple-across country evaluation (MACE) of bulls (Schaeffer, 1994) enables worldwide selection of bulls and each breeding program can rank all available bulls according to the total merit index for the country of interest. VanRaden and Sullivan (2010) extended MACE to genomic information and Vandenplas et al. (2018) pointed out that summary statistics per SNP are required for joint evaluations, without the need of exchange of genomic and phenotypic data across countries. The broad concepts observed in this study are clearly applicable to multiple environments; from an operational point of view selection across environments can be done when using for instance MACE. There is no point in judging very detailed for every situation whether the genetic correlation is above or below the split-point genetic correlation, because the rankings of bulls will automatically indicate whether external bulls will be selected or not.
Data Availability Statement
The datasets analyzed in this manuscript are not publicly available. Requests to access the datasets should be directed to cao.lu@mbg.au.dk.
Author Contributions
AS and LC conceived this research, designed the scenarios and populations. LC, HL, JT, MK, and AS participated in the preliminary discussion of designed scenarios. LC performed the simulation and analyzed data with great help from HL. All authors involved in the interpretation of results. LC prepared the first draft. All authors contributed to revising the manuscript, especially HM and MH.
Funding
GenSAP (Genomic Selection in Animals and Plants) project provided academically financial supports to this study, while CSC (China Scholarship Council) covered the living stipend for the author’s study abroad.
Conflict of Interest
JT was employed by company VikingGenetics. MK was employed by company SEGES, Denmark.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We are grateful to Emre Karaman, Aoxing Liu, for their theory instructions on across-environment genomic evaluation. We would also like to acknowledge the editor and reviewers for their valuable suggestion and comments.
References
Banos, G., and Smith, C. (1991). Selecting bulls across countries to maximize genetic improvement in dairy cattle. J. Anim. Breed. Genet. 108, 174–181. doi: 10.1111/j.1439-0388.1991.tb00172.x
Bijma, P., and Woolliams, J. A. (2000). Prediction of rates of inbreeding in populations selected on best linear unbiased prediction of breeding value. Genetics 156, 361–373.
Buch, L. H., Sorensen, M. K., Berg, P., Pedersen, L. D., and Sorensen, A. C. (2012). Genomic selection strategies in dairy cattle: strong positive interaction between use of genotypic information and intensive use of young bulls on genetic gain. J. Anim. Breed. Genet. 129, 138–151. doi: 10.1111/j.1439-0388.2011.00947.x
Bulmer, M. G. (1971). The effect of selection on genetic variability. Am. Nat. 105, 201–211. doi: 10.1086/282718
Daetwyler, H. D., Villanueva, B., Bijma, P., and Woolliams, J. A. (2007). Inbreeding in genome-wide selection. J. Anim. Breed. Genet. 124, 369–376. doi: 10.1111/j.1439-0388.2007.00693.x
Dekkers, J. C. M. (1992). Asymptotic response to selection on best linear unbiased predictors of breeding values. Anim. Sci. 54, 351–360. doi: 10.1007/s001220050765
Falconer, D. S. (1952). The problem of environment and selection. Am. Nat. 86, 293–298. doi: 10.1086/281736
Falconer, D. S., and Mackay, T. F. C. (1996a). 19. Correlated characters in Introduction to Quantitative Genetics, 4 Edn. Essex: Longman Group.
Falconer, D. S., and Mackay, T. F. C. (1996b). 4. Small Populations: II. Less Simplified Conditions in Introduction to Quantitative Genetics, 4 Edn. Essex: Longman Group.
Falconer, D. S., and Mackay, T. F. C. (1996c). 8. Variance in Introduction to Quantitative Genetics, 4 Edn. Essex: Longman Group.
Haile-Mariam, M., Carrick, M. J., and Goddard, M. E. (2008). Genotype by environment interaction for fertility, survival, and milk production traits in Australian dairy cattle. J. Dairy Sci. 91, 4840–4853. doi: 10.3168/jds.2008-1084
Hayes, B. J., Daetwyler, H. D., and Goddard, M. E. (2016). Models for genome × environment interaction: examples in livestock. Crop. Sci. 56, 2251–2259. doi: 10.2135/cropsci2015.07.0451
Henderson, C. R., and Quaas, R. L. (1976). Multiple trait evaluation using relatives’ records. J. Anim. Sci. 43, 1188–1197. doi: 10.2527/jas1976.4361188x
Hugo, H. M., Pelcastre-Cruz, A., Castillo-Juárez, H., Ruiz, F. J., and Miglior, F. (2017). Genotype × environment interaction for fertility and milk yield traits in Canadian, Mexican and US Holstein cattle. Span. J. Agric. Res. 15, 1–9. doi: 10.1016/j.livsci.2014.08.010
Huquet, B., Leclerc, H., and Ducrocq, V. (2012). Modelling and estimation of genotype by environment interactions for production traits in French dairy cattle. Genet. Sel. Evol. 44, 35–48. doi: 10.1186/1297-9686-44-35
Konig, S., Dietl, G., Raeder, I., and Swalve, H. H. (2005). Genetic relationships for dairy performance between large-scale and small-scale farm conditions. J. Dairy Sci. 88, 4087–4096. doi: 10.3168/jds.s0022-0302(05)73093-9
Legarra, A., Christensen, O., Aguilar, I., and Misztal, I. (2014). Single step, a general approach for genomic selection. Livest. Sci. 166, 54–65. doi: 10.1016/j.livsci.2014.04.029
Lynch, M., and Walsh, B. (1998). Genetics and Analysis of Quantitative Traits. Sunderland, MA: Sinauer Associates.
Madsen, P., Jensen, J., Labouriau, R., Christensen, O. F., and Sahana, G. (2014). “A Package for Analyzing Multivariate Mixed Models in quantitative Genetics and Genomics,” in Proceedings 10th World Congress on Genetics Applied to Livestock Production, Vancouver.
Montaldo, H. H. (2001). Genotype by environment interactions in livestock breeding programs: a review. Interciencia 26, 229–235.
Mulder, H. A., and Bijma, P. (2006). Benefits of cooperation between breeding programs in the presence of genotype by environment interaction. J. Dairy Sci. 89, 1727–1739. doi: 10.3168/jds.s0022-0302(06)72241-x
Nauta, W. J., Veerkamp, R. F., Brascamp, E. W., and Bovenhuis, H. (2006). Genotype by environment interaction for milk production traits between organic and conventional dairy cattle production in The Netherlands. J. Dairy Sci. 89, 2729–2737. doi: 10.3168/jds.s0022-0302(06)72349-9
Oldenbroek, K. (2007). Utilisation and Conservation of Farm Animal Genetic Resources. The Netherlands: Wageningen Academic Publishers. doi: 10.3920/978-90-8686-592-5
Ottman, R. (1996). Gene–environment interaction: definitions and study designs. Prev. Med. 25, 764–770. doi: 10.1006/pmed.1996.0117
Pedersen, L. D., Sorensen, A. C., Henryon, M., Ansari-Mahyari, S., and Berg, P. (2009). ADAM: a computer program to simulate selective breeding schemes for animals. Livest. Sci. 121, 343–344. doi: 10.1016/j.livsci.2008.06.028
Sargolzaei, M., and Schenkel, F. S. (2009). QMSim: a large-scale genome simulator for livestock. Bioinformatics 25, 680–681. doi: 10.1093/bioinformatics/btp045
Schaeffer, L. R. (1994). Multiple-country comparison of dairy sires. J. Dairy Sci. 77, 2671–2678. doi: 10.3168/jds.s0022-0302(94)77209-x
Schaeffer, L. R. (2006). Strategy for applying genome-wide selection in dairy cattle. J. Anim. Breed. Genet. 123, 218–223. doi: 10.1111/j.1439-0388.2006.00595.x
Schöpke, K., and Swalve, H. H. (2016). Review: opportunities and challenges for small populations of dairy cattle in the era of genomics. Animal 10, 1050–1060. doi: 10.1017/S1751731116000410
Slagboom, M., Kargo, M., Sørensen, A. C., Thomasen, J. R., and Mulder, H. A. (2019). Genomic selection improves the possibility of applying multiple breeding programs in different environments. J. Dairy Sci. 102, 8197–8209. doi: 10.3168/jds.2018-15939
Smith, C., and Banos, G. (1991). Selection within and across populations in livestock improvement. J. Anim. Sci. 69, 2387–2394.
Täubert, H., Rensing, S., and Reinhardt, F. (2011). “Comparing coventional and genomic breeding programs with ZPLAN+,” in Proceedings of the 2011 Interbull Meeting, Stavanger.
Thomasen, J. R., Liu, H., and Sørensen, A. C. (2020). Genotyping more cows increases genetic gain and reduces rate of true inbreeding in a dairy cattle breeding scheme using female reproductive technologies. J. Dairy Sci. 103, 597–606. doi: 10.3168/jds.2019-16974
Vandenplas, J., Calus, M. P. L., and Gorjanc, G. (2018). Genomic prediction using individual-level data and summary statistics from multiple populations. Genetics 210, 53–69. doi: 10.1534/genetics.118.301109
VanRaden, P. M. (2008). Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4423. doi: 10.3168/jds.2007-0980
VanRaden, P. M., and Sullivan, P. G. (2010). International genomic evaluation methods for dairy cattle. Genet. Sel. Evol. 42:7.
Woolliams, J. A., Berg, P., Dagnachew, B. S., and Meuwissen, T. H. E. (2015). Genetic contributions and their optimization. J. Anim. Breed. Genet. 132, 89–99. doi: 10.1111/jbg.12148
Keywords: joint genetic evaluation, across-environment selection of sires, stochastic simulation, genetic gain, rate of inbreeding, long-term cooperation
Citation: Cao L, Liu H, Mulder HA, Henryon M, Thomasen JR, Kargo M and Sørensen AC (2020) Genomic Breeding Programs Realize Larger Benefits by Cooperation in the Presence of Genotype × Environment Interaction Than Conventional Breeding Programs. Front. Genet. 11:251. doi: 10.3389/fgene.2020.00251
Received: 12 November 2019; Accepted: 02 March 2020;
Published: 21 April 2020.
Edited by:
S. Hong Lee, University of South Australia, AustraliaReviewed by:
Hans D. Daetwyler, La Trobe University, Melbourne, AustraliaGregor Gorjanc, The University of Edinburgh, United Kingdom
Copyright © 2020 Cao, Liu, Mulder, Henryon, Thomasen, Kargo and Sørensen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lu Cao, cao.lu@mbg.au.dk