Impacts of reference population size and methods on the accuracy of genomic prediction for fleece traits in Inner Mongolia Cashmere Goats

Yan, Xiaochun; Zhang, Jiaxin; Li, Jinquan; Wang, Na; Su, Rui; Wang, Zhiying

doi:10.3389/fvets.2024.1325831

ORIGINAL RESEARCH article

Front. Vet. Sci. , 05 February 2024

Sec. Livestock Genomics

Volume 11 - 2024 | https://doi.org/10.3389/fvets.2024.1325831

Impacts of reference population size and methods on the accuracy of genomic prediction for fleece traits in Inner Mongolia Cashmere Goats

Xiaochun Yan¹

Jiaxin Zhang¹

Jinquan Li^1,2,3,4

Na Wang⁵

Rui Su¹^*

Zhiying Wang¹^*

¹College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
²Inner Mongolia Key Laboratory of Sheep and Goat Genetics Breeding and Reproduction, Hohhot, China
³Key Laboratory of Mutton Sheep and Goat Genetics and Breeding, Ministry of Agriculture And Rural Affairs, Hohhot, China
⁴Engineering Research Centre for Goat Genetics and Breeding, Inner Mongolia Autonomous Region, Hohhot, China
⁵Inner Mongolia Yiwei White Cashmere Goat Co., Ltd., Hohhot, China

Introduction: Inner Mongolia Cashmere Goats (IMCGs) are famous for its cashmere quality and it’s a unique genetic resource in China. Therefore, it is necessary to use genomic selection to improve the accuracy of selection for fleece traits in Inner Mongolia cashmere goats. The aim of this study was to determine the effect of methods (GBLUP, BayesA, BayesB, Bayesian LASSO, Bayesian Ridge Region) and the reference population size on accuracy of genomic selection in IMCGs.

Methods: This study fully utilizes the pedigree and phenotype records of fleece traits in 2255 individuals, genotype of 50794 SNPs after quality control, and environmental data to perform genomic selection of fleece traits. Then GBLUP and Bayes series methods (BayesA, BayesB, Bayesian LASSO, Bayesian Ridge Region) were used to perform estimates of genetic parameter and genomic breeding value. And the accuracy of genomic estimated breeding value (GEBV) is evaluated using the five-fold cross validation method. And the analysis of variance and multiple comparison methods were used to determine the best method for genomic selection in fleece traits of IMCGs. Further the different reference population sizes (500, 1000, 1500, and 2000) was set. Then the best method was applied to estimate genome breeding values, and evaluate the impact of reference population sizes on the accuracy of genome selection for fleece traits in IMCGs.

Results: It was found that the genomic prediction accuracy for each fleece trait in IMCGs by GBLUP method is highest, and it is significantly higher than that obtained by Bayesian method. The accuracy of breeding value estimation is 58.52% -68.49%. Also, it was found that the size of the reference population has a significant impact on the accuracy of genome prediction of fleece traits. When the reference population size is 2000, the accuracy of genomic prediction for each fleece trait is significantly higher than other levels, with accuracy of 55.47% -67.87%. This provides a theoretical basis for design a reasonable genome selection plan for Inner Mongolia cashmere goats in the later stag.

1 Introduction

China is a large country in terms of the number of cashmere goats and cashmere production in the world. By the end of 2022, the number of goats in China was 92.0 million, and the cashmere production was 15243.64 tons (http://www.stats.gov.cn), which accounts for 80% of the world’s goat population (https://www.fao.org/). Inner Mongolia Cashmere Goats (IMCGs) are a major cashmere goat breed in China, which is famous for its high cashmere production and excellent quality of cashmere. According to geographical distribution, it is divided into three types, namely, Arbas type, Erlangshan type, and Alxa type (1). Methods to reduce cashmere diameter (CD) and increase cashmere production (CP) are important projects of Inner Mongolia Cashmere Goats breeding. In previous studies, genetic evaluation for fleece traits in IMCGs was performed by the BLUP method (2). The fleece traits had a certain degree of improvement. With the development of quantitative genetics and molecular biology, the breeding methods of livestock have improved (3). In order to improve goat efficiency and achieve early selection, the breeding methods of goats should be updated. Therefore, genomic selection needs to be performed. The idea of genomic selection was proposed by Meuwissen et al (4). It had been reported that genomic selection has significant advantages in traits with low habitability and which are difficult to measure (5). It was confirmed that genomic selection can improve the accuracy of estimated breeding values, increase genetic progress, and reduce breeding costs (6–8). The factors that affect the accuracy of genomic selection include methods (9), reference population size (10), heritability (11), and marker density (12).

With the development of genetics and statistics, a large number of methods for estimating genomic breeding values have been continuously proposed. According to different statistical models, genomic breeding value estimation methods can be divided into three categories: genome best linear unbiased prediction (GBLUP), ridge regression best linear unbiased prediction (RRBLUP), and Bayesian series methods (BayesA, BayesB, Bayes Cp, Bayes LASSO, and BayesRR). The GBLUP and RRBLUP models assume that the variance explained by each SNP is equal, and the advantage of this assumption is that only one variance needs to be estimated. In actualality, the SNP effects have different variance structures. Peters used different BayesB models to compare the accuracy of GEBV for milk traits of 695 Canadian Holstein cows (13). It was shown that the prediction accuracy with the BayesB method was significantly higher than that using the GBLUP method for milk traits. Lopes used five methods, including BayesA, BayesB, Bayes C $π$ , BLUP, and SSGBLUP, to evaluate the accuracy of genomic prediction for meat and carcass traits in Nelore cattle. It was found that the accuracy of GEBV among the five methods had no significant difference (14).

Generally, the larger the reference population size, the richer the genotype data and phenotype information, and the higher the accuracy of GEBV obtained (15). Takeda et al. compared the estimated breeding values for five carcass traits of Japanese black cattle under different reference population sizes (16). It was found that the accuracy of GEBV was increasing as the reference population size expanded. Lillehammer et al. used simulated data to perform genomic selection of maternal traits in pigs. It was illustrated that the genetic progress obtained by the reference population size of 1,000 was significantly higher than that in the 5,000 reference population (17).

The implementation of genomic selection for cashmere goats in China is relatively late. Previous studies have identified factors that affect the accuracy of GEBV in goats using simulated data. It is the first time to perform a genomic selection of the fleece traits in Inner Mongolia Cashmere Goats. This study used five different methods to estimate the genomic breeding values of fleece traits in IMCGs and compared the impact of these methods on the accuracy of GEBV. Then, the best methods were used to determine the impact of reference population size on the accuracy of GEBV, providing a theoretical basis for designing the breeding plan for fleece traits in Inner Mongolia Cashmere Goats.

2 Materials and methods

2.1 Genotype data

The individuals were genotyped using the Illumina GGP_Goat_70K BeadChip (Illumina, San Diego, CA). Markers on the X chromosome were discarded. SNPs were performed as quality control based on minor allele frequency (MAF > 0.05), proportion of missing genotypes (missing<0.05), and Hardy–Weinberg equilibrium (HWE > 10⁻⁶). Unqualified SNPs were removed. Moreover, individuals with more than 10% missing genotypes were excluded. In this study, 44 individuals and 16,294 SNPs were deleted from the raw genotype data. Finally, 2,255 individuals and 50,794 SNPs were used in the next analysis.

2.2 Phenotypic data

The phenotypic data were collected from Inner Mongolia Yiwei White Cashmere Goat Limited Liability Company, Wulan Town, Etuoke Banner, Ordos City, Inner Mongolia Autonomous Region, China (39°12′N; 107°97′E). In this study, the production performance records of fleece traits for 2,255 individuals (372 males and 1883 females) at ages 1 to 3 were collected from 2018 to 2021. The four fleece traits, including cashmere production (CP), cashmere diameter (CD), cashmere length (CL), and fiber length (FL), were considered in this study. The basic statistics of phenotype data were analyzed using Microsoft Excel and R software.

2.3 Estimation of genomic breeding value

In this study, the fixed effects, including sex, year of production, herd, and individual age, were considered. They were determined based on the previous results of our research team (2, 18–20). The linear mixed model was used to estimate the genomic breeding values for fleece traits in IMCGs with BayesA, BayesB, Bayesian LASSO, Bayesian Ridge Regression, and GBLUP methods. All methods were performed by the BGLR software (21).

2.3.1 GBLUP method

Van Raden (22) proposed the GBLUP method, which uses the additive effect matrix G constructed by genetic markers to replace the traditional kinship matrix A constructed by pedigree and then estimates the genomic breeding value of individuals. The model for the GBLUP method is as follows (Eq. 1):

\begin{array}{l} y = μ + X b + Z a + e & (1) \end{array}

where $y$ is the vector of the observations, μ is the mean value vector of the observations, $b$ is the vector of fixed effects, $a$ is a vector of additive genetic effects, following a normal distribution of $a ~ N (0, G σ_{a}^{2})$ , in which $σ_{a}^{2}$ is the variance of additive genetic effect, and $e$ is a vector of residual. The matrix $X$ is the incidence matrix for the fixed effects and $Z$ is the incidence matrix for additive genetic effects.

2.3.2 Bayesian series methods

The BayesA method assumes that a large number of markers have a smaller effect on the target trait, while a small number of markers have a larger effect and follow t-distribution. The BayesB method assumes that some SNP effects also follow t-distribution, but a large number of effects are zero, only some QTLs have a larger effect. Bayes Lasso is the same as BayesA, but the difference between them is that it assumes that the marker effect follows a double exponential distribution, resulting in a corresponding change in the posterior distribution of the labeling effect. The Bayesian Ridge Region (BayesRR) method assumes that the variance effect of each locus is specified by a certain percentage of the total genetic variance. The effects of the locus for BayesRR follow multiple normal distributions. The hypothetical distribution of all the effects of the marker in each Bayesian method and the formula of effect distribution are shown in Table 1 (6, 23–25). In this study, the model of Bayes methods is as follows (Eq. 2):

\begin{array}{l} y = μ + X b + \sum_{j}^{n} (Z_{i j} a_{j}) + e & (2) \end{array}

Table 1

Table 1. Basic description of Bayesian methods.

Here, $y$ is the vector of the observations, μ is the mean value vector of the observations, $X$ is the incidence matrix for the fixed effects, and $b$ is the vector of fixed effects. $Z_{i j}$ represents the genotype of the individual $i$ at site $j$ and $a_{j}$ represents the effect value of the site $j$ , and therefore $\sum_{j}^{n} (Z_{i j} a_{j})$ refers to the breeding value corresponding to the individual $i$ , $e$ to the vector of residual effects.

2.4 Accuracy of predicted genomic breeding value

In this study, 5-fold cross-validation was used to evaluate the accuracy of genomic prediction. First, the 2,255 individuals were randomly divided into five groups, and then one group (451 individuals) was selected as the validation population at each time, and the other four groups (1804 individuals) were used as the training population. The five repetitions are executed. The accuracy of genomic prediction is evaluated by calculating correlation coefficients between GEBV and the true corrected phenotype value in the validation population.

Finally, we used a one-way analysis of variance and multiple comparison methods to determine the best method for genomic selection of the fleece traits of IMCGs. Furthermore, different reference population sizes (500, 1,000, 1,500, and 2,000) were set, and then the best method was used to estimate GEBV and to evaluate the impact of reference population sizes on the accuracy of genomic prediction for fleece traits in IMCGs.

3 Results

3.1 Genotypic characteristics and phenotypic statistics

The SNPs after quality control are evenly distributed on 29 autosomes in goats (Figure 1). A total of 50,794 SNPs were kept to be used in the next analysis. In this study, a total of four fleece traits were collected, and the descriptive statistics of phenotype data in each fleece trait were presented in Table 2, including the abbreviation of each trait, the number of records (N), the maximum (Max), minimums (Min), mean, standard deviation (SD), and coefficient of variation (CV) values. The average values of four fleece traits in male individuals, including fiber length, cashmere diameter, cashmere length, and cashmere production, are 20.67 cm, 14.91 μm, 6.68 cm, and 1022.26 g, and the corresponding coefficient of variations were 20.46%, 6.44%, 17.66%, and 37.27%, respectively. The average values of four fleece traits in female animals, including fiber length, cashmere diameter, cashmere length, and cashmere production, are 19.27 cm, 15.20 μm, 6.43 cm, and 762.84 g, and the corresponding coefficient variations were 24.08%, 4.87%, 16.49%, and 23.58%, respectively.

Figure 1

Figure 1. Distribution of SNP density on each chromosome. The figure shows the number of SNPs within 1 Mb window size. As the color changes from green to red, the number of SNPs increases.

Table 2

Table 2. Descriptive statistics of phenotypic values of fleece traits in IMCGs.

3.2 Effect of GBLUP and Bayesian methods on the accuracy of GEBV

First, BayesA, BayesB, Bayesian LASSO, BayesRR, and GBLUP methods were used to estimate the genomic breeding value of fleece traits in Inner Mongolia Cashmere Goats. Then, we used the analysis of variance and multiple comparisons to determine the best method for genomic selection in fleece traits of IMCGs. The results of the variance analysis are presented in Table 3. It was shown that methods had a significant effect on the accuracy of genome prediction for cashmere length and cashmere production but had no significant effect on the accuracy of genome prediction for fiber length or cashmere diameter. The multiple comparison results of the accuracy of genome prediction of fleece traits in Inner Mongolia cashmere goats under five methods are shown in Table 4 and Figure 2. The range of genomic predictability of the fleece traits by using the GBLUP, BayesA, BayesB, Bayesian LASSO, and BayesRR methods is 58.52%~68.49%, 52.97%~64.89%, 53.00%~65.04%, 54.01%~61.43%, and 51.95%~61.56%, respectively. It was found that the genomic prediction accuracy with the GBLUP method is better than that with the BayesA, BayesB, Bayesian LASSO, and BayesRR methods. There was no significant difference in prediction accuracy among the Bayes series methods for the fleece traits in Inner Mongolia Cashmere Goats.

Table 3

Table 3. Variance analysis of the impact of methods on the accuracy of GEBV for fleece traits in Inner Mongolia Cashmere Goats.

Table 4

Table 4. Accuracy of GEBV in each fleece trait under different methods.

Figure 2

Figure 2. Comparison of the accuracy of GEBV for fleece traits with different methods. The x-axis in the figure represents the different methods used in this study to estimate the genomic breeding values of Inner Mongolia Cashmere Goats fleece traits. The y-axis represents the accuracy of estimating the genomic breeding values of fleece traits in Inner Mongolia Cashmere Goats using different methods. The different letters on the graph represent significant differences, while the same letters have no difference.

3.3 Effect of reference population size on the accuracy of GEBV

This study also compared the impact of different reference population sizes on the accuracy of estimated genomic breeding values for fleece traits in Inner Mongolia Cashmere Goats. Based on the above results, the GBLUP method is the best method for evaluating the accuracy of genomic selection of fleece traits in Inner Mongolia Cashmere Goats. The reference populations with sizes of 500, 1,000, 1,500, and 2,000 were set to perform genomic selection of fleece traits in IMCGs. The results of the variance analysis of reference population sizes are presented in Table 5. It was shown that reference population size had a significant effect on the accuracy of genomic prediction for fleece traits in IMCGs. The multiple comparison results of the accuracy of genomic prediction of fleece traits under different reference population sizes are shown in Table 6 and Figure 3. For CL traits, when the reference population size is between 1,500 and 2,000, there is no significant difference in the accuracy of the genomic breeding value. However, the accuracy of GEBV with reference population sizes of 1,500 and 2000 is significantly higher than that with 500 and 1,000 reference population sizes. The accuracy of GEBV for CL is 56.91–58.39%. For FL, CP, and CD traits, there was a significant difference between 2,000 and the other three levels (500, 1,000, and 1,500) in the reference population. The accuracy of genomic breeding values of 55.47%, 67.87%, and 60.11% in the reference population was 2,000 for FL, CP, and CD traits, respectively. Therefore, it is necessary that the reference population size be expanded to perform genome selection in IMCGs.

Table 5

Table 5. Variance analysis of the impact of reference population size on the accuracy of GEBV for fleece traits in Inner Mongolia Cashmere Goats.

Table 6

Table 6. Accuracy of GEBV in each fleece trait under different reference population size levels.

Figure 3

Figure 3. Comparison of the accuracy of GEBV for fleece traits with different reference population sizes. The x-axis in the figure represents the different reference population sizes used in this study to estimate the genomic breeding values of Inner Mongolia Cashmere Goats fleece traits. The y-axis represents the accuracy of estimating the genomic breeding values of fleece traits in Inner Mongolia Cashmere Goats using different reference population sizes. The different letters on the graph represent significant differences, while the same letters have no difference.

4 Discussion

In order to effectively apply genomic selection to design the breeding plan for Inner Mongolia Cashmere Goats, it is necessary to determine the factors affecting prediction accuracy. Therefore, we collected the cashmere performance records of 2,255 individuals to investigate the influence of methods and reference population size on the accuracy of genomic prediction.

This study was conducted to compare the genomic prediction ability of fleece traits in IMCGs using the GBLUP and Bayes series methods (BayesA, BayesB, Bayesian LASSO, and Bayesian Ridge Region). It was observed that the methods had a significant effect on the accuracy of genomic prediction for cashmere length and cashmere production. The genomic prediction accuracy with the GBLUP method is better than that with Bayesian methods. This result is also consistent with that of many previous studies. Baby et al. used the GBLUP and BayesB methods to evaluate the genomic estimated breeding values for 16 meat quality traits in the Berkshire population (n = 1,191) (26). The results showed that the GEBV accuracy ranged from 0.42 for collagen to 0.75 for water-holding capacity with the GBLUP method. Under the Bayes B model, the GEBV accuracy ranged from 0.10 for the National Pork Producers Council marbling score to 0.76 for drip loss. Zhu et al. (27) used the GBLUP and Bayesian Alphabet models to estimate the genomic breeding values of six wool traits in Alpine Merino sheep. The accuracy of the GBLUP method was slightly higher than that of the Bayesian methods. For the datasets of low-density SNP genotypes, the genomic prediction accuracy of wool traits was 0.34–0.0.60 for GBLUP. For the datasets of high-density SNP genotypes, the genomic prediction accuracy of wool traits was 0.35–0.57 for the GBLUP method. Silva et al. reported the genomic prediction ability for carcass composition indicator traits in Nellore cattle using the BLUP, GBLUP, ssGBLUP, and Bayesian methods (BayesA, BayesB, BayesC, and Bayes LASSO) (28). In terms of predictive ability and bias, it is identical in terms of the visual score trait between the Bayesian and GBLUP methods. However, the accuracy of GEBV with the GBLUP method is higher than that with the BayesB method for carcass traits. Vu et al. evaluated the impact of different prediction methods (BayesA, BayesCπ, and GBLUP) on the accuracy of GEBV in the Portuguese oyster (Crassostrea angulata) (29). It was indicated that the accuracy with GBLUP is slightly higher than that with Bayes methods, but there was no significant difference among the methods. The accuracy of genomic predictivity for the traits is 0.240–0.794. With the continuous progress of breeding work, more efficient and simple models will be optimized and developed. Applying these methods to the genomic selection of important traits in livestock and poultry will inevitably accelerate the breeding process of the population.

The size of the reference population is an important factor affecting the accuracy of genomic selection. How to reasonably construct a reference population for genomic selection in IMCGs is important. In this study, different reference population sizes (500, 1,000, 1,500, and 2000) were set to evaluate the accuracy of genomic selection for fleece traits in IMCGs. It was found that the size of the reference population has a significant impact on the accuracy of genomic prediction for fleece traits. Baby et al. reported that the GEBV accuracy increased with the size of the training data. In general, the GEBV accuracy with the Bayes B model was lower than that with the GBLUP model, especially for the small training sample size (26). Uemoto et al. (30) used simulated phenotype data under different scenarios to assess the prediction accuracy of GEBV under population size using a reference-test validation design. It was found that a large population size is needed to increase the accuracy of GEBV. Nwogwugwu et al. assessed genomic prediction ability by using the reference population of 1,000, 2000, 3,000, and 5,000 randomly selected from generations 7, 8, and 9 in a simulated Korean beef cattle population (31). According to the simulation results, the accuracy of genomic selection gradually increases as the number of reference populations increases. Kabanov et al. used three methods to assess breeding value and predictability for five main traits of Large White pigs (32). The research results showed that the accuracy of genomic selection also gradually increases with the size of the reference population. This also indicated that the size of the reference population has a certain impact on the accuracy of genomic selection. When the reference population size reaches a certain level, the accuracy of genomic selection cannot be significantly improved. This is similar to the cashmere length trait. The accuracy of genomic selection in IMCGs between the reference population size of 1,500 and 2000 had no significant difference. Therefore, it is important to choose a reasonable reference population size to perform genomic selection, which can ensure the accuracy of genomic selection while saving costs.

5 Conclusion

To summarize, this study used GBLUP and Bayesian methods (BayesA, BayesB, Bayesian LASSO, and Bayesian Ridge Region) to perform the genomic prediction. The 5-fold cross-validation was utilized to evaluate the accuracy of GEBV. It was found that the prediction accuracy for fleece traits in IMCGs with the GBLUP method is the highest. It indicates that the GBLUP method should be used for the genomic selection of Inner Mongolia Cashmere Goats. At the same time, it was demonstrated that the accuracy of genomic prediction for fleece traits with a reference population of 2000 is significantly higher than other scale reference populations. Therefore, it is necessary to further expand the size of the reference population to increase the accuracy of GEBV for fleece traits in Inner Mongolia Cashmere Goats.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: https://db.cngb.org/; CNP0005155.

Ethics statement

The animal studies were approved by the studies involving animals were reviewed and approved by the Laboratory Animal Welfare and Animal Experiment Ethics Inspection Committee of Inner Mongolia Agricultural University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.

Author contributions

XY: Data curation, Formal analysis, Software, Writing – original draft, Writing – review & editing. JZ: Software, Writing – original draft. JL: Writing – review & editing. NW: Data curation, Writing – original draft. RS: Writing – original draft, Writing – review & editing. ZW: Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The authors are grateful for the grants supported by the National Key Research and Development Program of China [2022YFE0113300, 2022YFD1300201, and 2022YFD1300204], Science and Technology Research Project of Inner Mongolia Autonomous Region [2021GG0086], China Agriculture Research System of MOF and MARA [No. CARS-39], “Youth Science and Technology Talent Support Plan” of colleges and universities in Inner Mongolia Autonomous Region [NJYT22038], Supported by Program for Innovative Research Team in Universities of Inner Mongolia Autonomous Region [NMGIRT2322], Inner Mongolia Agricultural University Outstanding Youth Science Fund Cultivation Project (BR230304), High-level Achievement Cultivation Special Project of School of Animal Science, Inner Mongolia Agricultural University (GZL202204), and Natural Science Foundation of Inner Mongolia (2021MS03093).

Conflict of interest

NW was employed by the company Inner Mongolia Yiwei White Cashmere Goat Co. Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Li, YR, Li, JQ, Gao, DP, Zhang, LL, Zhou, HM, An, YJ, et al. Estimates of breeding value of Inner Mongolia cashmere goats using animal model BLUP method. Yi Chuan Xue Bao. (2000) 27:777–86.

PubMed Abstract | Google Scholar

2. Wang, Z, Wang, R, Zhang, W, Wang, Z, Wang, P, Liu, H, et al. Estimation of genetic parameters for fleece traits in yearling Inner Mongolia cashmere goat. Small Rumin Res. (2013) 109:15–21. doi: 10.1016/j.smallrumres.2012.07.016

Crossref Full Text | Google Scholar

3. Rabier, CE, Barre, P, Asp, T, Charmet, G, and Mangin, B. On the accuracy of genomic selection. PLoS One. (2016) 11:e0156086. doi: 10.1371/journal.pone.0156086

PubMed Abstract | Crossref Full Text | Google Scholar

4. Meuwissen, TH, Hayes, BJ, and Goddard, ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. (2001) 157:1819–29. doi: 10.1093/genetics/157.4.1819

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zhang, J, Wang, J, Li, Q, Wang, Q, Wen, J, and Zhao, G. Comparison of the efficiency of BLUP and GBLUP in genomic prediction of immune traits in chickens. Animals. (2020) 10:419. doi: 10.3390/ani10030419

Crossref Full Text | Google Scholar

6. Kim, EH, Kang, HC, Sun, DW, Myung, CH, Kim, JY, Lee, DH, et al. Estimation of breeding value and accuracy using pedigree and genotype of Hanwoo cows (Korean cattle). J Anim Breed Genet. (2022) 139:281–91. doi: 10.1111/jbg.12661

PubMed Abstract | Crossref Full Text | Google Scholar

7. Scott, B, Haile-Mariam, M, Cocks, B, and Pryce, J. How genomic selection has increased rates of genetic gain and inbreeding in the Australian national herd, genomic information nucleus, and bulls. J Dairy Sci. (2021) 104:11832–49. doi: 10.3168/jds.2021-20326

PubMed Abstract | Crossref Full Text | Google Scholar

8. Song, H, and Hu, H. Strategies to improve the accuracy and reduce costs of genomic prediction in aquaculture species. Evol Appl. (2021)

Google Scholar

9. Colombani, C, Legarra, A, Fritz, S, Guillaume, F, Croiseau, P, Ducrocq, V, et al. Application of Bayesian least absolute shrinkage and selection operator (LASSO) and bayes Cπ methods for genomic selection in French Holstein and Montbéliarde breeds. J Dairy Sci. (2013) 96:575–91. doi: 10.3168/jds.2011-5225

PubMed Abstract | Crossref Full Text | Google Scholar

10. Esfandyari, H, Sørensen, A, and Bijma, P. A crossbred reference population can improve the response to genomic selection for crossbred performance. Gen Select Evol. (2015) 47:76. doi: 10.1186/s12711-015-0155-z

PubMed Abstract | Crossref Full Text | Google Scholar

11. Villumsen, TM, Janss, L, and Lund, MS. The importance of haplotype length and heritability using genomic selection in dairy cattle. J Animal Breed Gen. (2015) 126. doi: 10.1111/j.1439-0388.2008.00747.x

Crossref Full Text | Google Scholar

12. Ma, P, Lund, MS, Aamand, GP, and Su, G. Use of a Bayesian model including QTL markers increases prediction reliability when test animals are distant from the reference population. J Dairy Sci. (2019) 102:7237–47. doi: 10.3168/jds.2018-15815

PubMed Abstract | Crossref Full Text | Google Scholar

13. Peters, SO, Kzlkaya, K, Ibeagha-Awemu, EM, Sinecen, M, and Zhao, X. Comparative accuracies of genetic values predicted for economically important milk traits, genome-wide association, and linkage disequilibrium patterns of Canadian Holstein cows. J Dairy Sci. (2020) 104:1900–16. doi: 10.3168/jds.2020-18489

Crossref Full Text | Google Scholar

14. Lopes, FB, Baldi, F, Passafaro, TL, Brunes, LC, Costa, MFO, Eifert, EC, et al. Genome-enabled prediction of meat and carcass traits using Bayesian regression, single-step genomic best linear unbiased prediction and blending methods in Nelore cattle. Animal. (2021) 15:100006. doi: 10.1016/j.animal.2020.100006

PubMed Abstract | Crossref Full Text | Google Scholar

15. Yan, X, Zhang, T, Liu, L, Yu, Y, Yang, G, Han, Y, et al. Accuracy of genomic selection for important economic traits of cashmere and meat goats assessed by simulation study. Front Vet Sci. (2022) 9. doi: 10.3389/fvets.2022.770539

Crossref Full Text | Google Scholar

16. Takeda, M, Inoue, K, Oyama, H, Uchiyama, K, Yoshinari, K, Sasago, N, et al. Exploring the size of reference population for expected accuracy of genomic prediction using simulated and real data in Japanese black cattle. BMC Genomics. (2021) 22:799. doi: 10.1186/s12864-021-08121-z

PubMed Abstract | Crossref Full Text | Google Scholar

17. Lillehammer, M, Sonesson, AK, and Meuwissen, THE. Use of field data in pig genomic selection schemes: a simulation study. Animal. (2016) 10:1025–32. doi: 10.1017/S1751731115002669

Crossref Full Text | Google Scholar

18. Li, J. Study on breeding methods in Inner Mongolia cashmere goats. PhD. China Agricultural University (2005).

Google Scholar

19. Wang, F. Design of goat SNP chip with applications in genome-wide association study and genomic selection of important economic traits in Inner Mongolia cashmere goat. Dr 博士: Inner Mongolia Agricultural University (2022).

Google Scholar

20. Wang, Z. Study on principle and method of early selection of fleece traits in Inner Mongolia cashmere goats. PhD. Inner Mongolia Agricultural University (2017).

Google Scholar

21. Pérez, P., and Campos, G. D. L. (2013). BGLR: A statistical package for whole genome regression and prediction.

Google Scholar

22. Van Raden, P. Efficient methods to compute genomic predictions. J Dairy Sci. (2008) 91:4414–23. doi: 10.3168/jds.2007-0980

Crossref Full Text | Google Scholar

23. Meuwissen, TH, Solberg, TR, Shepherd, R, and Woolliams, JA. A fast algorithm for BayesB type of prediction of genome-wide estimates of genetic value. Genet Sel Evol. (2009) 41:2. doi: 10.1186/1297-9686-41-2

PubMed Abstract | Crossref Full Text | Google Scholar

24. Park, T, and Casella, G. The Bayesian Lasso. J Am Stat Assoc. (2008) 103:681–6. doi: 10.1198/016214508000000337

Crossref Full Text | Google Scholar

25. Brøndum, RF, Su, G, Lund, MS, Bowman, PJ, Goddard, ME, and Hayes, BJ. Genome position specific priors for genomic prediction. BMC Genomics. (2012) 13:543. doi: 10.1186/1471-2164-13-543

PubMed Abstract | Crossref Full Text | Google Scholar

26. Baby, S, Hyeong, KE, Lee, YM, Jung, JH, Oh, DY, Nam, KC, et al. Evaluation of genome based estimated breeding values for meat quality in a berkshire population using high density single nucleotide polymorphism chips. Asian Australas J Anim Sci. (2014) 27:1540–7. doi: 10.5713/ajas.2014.14371

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhu, S, Guo, T, Yuan, C, Liu, J, Li, J, Han, M, et al. Evaluation of Bayesian alphabet and GBLUP based on different marker density for genomic prediction in alpine merino sheep. G3. (2021) 11:jkab206. doi: 10.1093/g3journal/jkab206

PubMed Abstract | Crossref Full Text | Google Scholar

28. Silva, RP, Espigolan, R, Berton, MP, Lbo, RB, and Baldi, F. Genomic prediction ability for carcass composition indicator traits in Nellore cattle. Livest Sci. (2021) 245:104421. doi: 10.1016/j.livsci.2021.104421

Crossref Full Text | Google Scholar

29. Vu, SV, Gondro, C, Nguyen, NTH, Gilmour, AR, and O'Connor, W. Prediction accuracies of genomic selection for nine commercially important traits in the Portuguese oyster (Crassostrea angulata) using DArT-Seq technology. Genes. (2021) 12:210. doi: 10.3390/genes12020210

PubMed Abstract | Crossref Full Text | Google Scholar

30. Uemoto, Y, Sasaki, S, Kojima, T, Sugimoto, Y, and Watanabe, T. Impact of QTL minor allele frequency on genomic evaluation using real genotype data and simulated phenotypes in Japanese black cattle. BMC Genet. (2015) 16:134. doi: 10.1186/s12863-015-0287-8

PubMed Abstract | Crossref Full Text | Google Scholar

31. Nwogwugwu, CP, Choi, Y, Lee, H, Heon, J, and Lee, S-H. Assessment of genomic prediction accuracy using different selection and evaluation approaches in a simulated Korean beef cattle population. Asian Australas J Anim Sci. (2020) 33:1912–21. doi: 10.5713/ajas.20.0217

PubMed Abstract | Crossref Full Text | Google Scholar

32. Kabanov, A, Melnikova, E, Nikitin, S, Somova, M, Fomenko, O, Volkova, V, et al. Weighted single-step genomic best linear unbiased prediction method application for assessing pigs on meat productivity and reproduction traits. Animal. (2022) 12:1693. doi: 10.3390/ani12131693

Crossref Full Text | Google Scholar

Keywords: genomic selection, Inner Mongolia Cashmere Goats, GBLUP method, Bayesian method, reference population sizes

Citation: Yan X, Zhang J, Li J, Wang N, Su R and Wang Z (2024) Impacts of reference population size and methods on the accuracy of genomic prediction for fleece traits in Inner Mongolia Cashmere Goats. Front. Vet. Sci. 11:1325831. doi: 10.3389/fvets.2024.1325831

Received: 22 October 2023; Accepted: 08 January 2024;
Published: 05 February 2024.

Edited by:

Filippo Biscarini, National Research Council (CNR), Italy

Reviewed by:

Hugo Oswaldo Toledo-Alvarado, National Autonomous University of Mexico, Mexico
Zeying Wang, Shenyang Agricultural University, China
Ran Di, Chinese Academy of Agricultural Sciences, China

Copyright © 2024 Yan, Zhang, Li, Wang, Su and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rui Su, c3VydWl5dUAxMjYuY29t; Zhiying Wang, d3poeTAzMjFAMTI2LmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Impacts of reference population size and methods on the accuracy of genomic prediction for fleece traits in Inner Mongolia Cashmere Goats

1 Introduction

2 Materials and methods

2.1 Genotype data

2.2 Phenotypic data

2.3 Estimation of genomic breeding value

2.3.1 GBLUP method

2.3.2 Bayesian series methods

2.4 Accuracy of predicted genomic breeding value

3 Results

3.1 Genotypic characteristics and phenotypic statistics

3.2 Effect of GBLUP and Bayesian methods on the accuracy of GEBV

3.3 Effect of reference population size on the accuracy of GEBV

4 Discussion

5 Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good