- 1Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia
- 2Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
- 3MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, United Kingdom
- 4INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas, France
- 5Department of Animal Science, University of California, Davis, Davis, CA, United States
- 6College of Animal Science, South China Agricultural University, Guangzhou, China
Editorial on the Research Topic
Multi-Layered Genome-Wide Association/Prediction in Animals
DNA mutations are the fundamental source of genomic variations that lead to phenotypic differences between individuals. Genomic variations in a population are usually assayed by single nucleotide polymorphism (SNP) arrays or whole-genome sequencing (WGS) to obtain genotype counts. If phenotypic measurements are also available on genotyped individuals in this population, genotype counts can be statistically linked to phenotypic measurements, i.e., genome-wide association studies (GWAS). Decades of GWAS in humans (Visscher et al., 2017) and animals (Hayes and Daetwyler, 2019) have shown that causal variants for complex traits are largely located at non-coding regions of the genome. This has been further supported by recent human studies of genetic variations with roles in gene regulation, e.g., those that are gene expression quantitative trait loci (eQTL) (Consortium, 2020) are enriched in causal variants of complex traits. Due to the vast availability of data in humans, such as proteomics and metabolomics, great efforts have been invested in the integration of multi-omics information and GWAS results (Hasin et al., 2017). The effort of functional annotation of animal genomes only started recently (Clark et al., 2020), although the size of multi-omics data has been increasing (Liu et al., 2021).
Unlike genomics research in humans, GWAS in animals is usually carried out amongst related individuals with small effective population sizes. This results in many SNPs in high linkage disequilibrium (LD) from a locus being associated with a trait, and it is difficult to distinguish which ones are causal. This is particularly difficult when the GWAS used imputed sequence variants (Hayes and Daetwyler, 2019) where a large number of variants are in very strong LD. Therefore, external information, such as multi-omics datasets independent of GWAS, is needed to pinpoint causal signals. Apart from the use of multi-omics data, multi-trait meta-analyses of GWAS (Xiang et al., 2020; Xiang et al., 2021) and large-scale GWAS of intermediate traits like milk composition (Sanchez et al., 2021) also improve the detection of causal variants. In addition, multi-breed meta-analyses can help to pinpoint causal mutations as LD is conserved over shorter distances across breeds (van den Berg et al., 2020).
The genomic information of domestic animals is used to improve animal breeding. In particular, genomic selection or genomic prediction (GP) (Meuwissen et al., 2001) using genome-wide marker information has greatly benefited animal breeding. GP was primarily designed to use all available markers to estimate genomic breeding values (gEBVs) reflecting the genetic merit of animals. However, the accuracy of GP, approximated as the correlation between gEBVs and phenotype in the validation population is far from being perfect. There are many ways to improve the accuracy of GP and emerging evidence shows that the use of functional information can enhance GP (MacLeod et al., 2016; Xiang et al., 2019; Teng et al., 2020). With the growing size of functional genomics data, it is anticipated that functional genomics priors will be routinely integrated into GP to improve its accuracy. This will then need the development of suitable methodologies that effectively fuse multi-omics data together with the genotype-phenotype association analysis in the GP model (Cheng et al., 2021).
The ‘Multi-layered Genome-wide Association and Prediction in Animals’ research topic intends to collect high-quality articles on the emerging area of integrating multi-omics datasets into GWAS and GP. In its conclusion, 9 articles from 58 authors have been collected, ranging from data generation, integrative analysis of multi-omics with GWAS and GP, and new method development across multiple domestic species.
Developing new methods for the integrative analysis of multi-omics and GWAS/GP is one of the key research areas in genetics. Due to flexibility in incorporating priors in the model, several new Bayesian methods have been proposed, including BayesHP and BayesHE (Shi et al.) that incorporate “global-local” shrinkage priors, and multi-class Bayesian Alphabet methods (Wang et al.) that incorporate biological information into multi-trait Bayesian analysis. The application of these methods into simulated and real data supports that incorporating biological priors into GP training improves its accuracy.
GWAS or GP using WGS is another emerging area. Due to high costs of in-depth WGS, there is a new shift toward using low-pass WGS which provides cost-effective options for GWAS or GP to use millions of sequence variants. By analyzing simulated data, Deng et al. show that imputation using low-pass WGS is more accurate than using SNP arrays. This was also found by Zhao et al. where real low-pass WGS from donkeys were generated, analyzed, and applied to GP.
GWAS in animals has been largely used to dissect causative loci associated with complex traits. Jiang et al. present such an effort in detecting loci associated with body size in Hu sheep. Also, Yang et al. identified loci associated with meat production in chicken. Apart from the standard linear mixed model, GWAS can also be carried out using single-step Bayesian regression, and Naserkheil et al. present such an effort in identifying loci associated with meat production traits of beef cattle.
In fact, loci prioritized by GWAS may be used as biological priors to enhance GP. However, Gebreyesus et al. found that adding GWAS-prioritized variants had no improvement in GP for survival traits of dairy cattle which have very low heritability estimates. This emphasizes that more studies are needed in this area. Other lowly heritable but important traits in cattle included female fertility. Chen et al. found that accounting for sire genetic effects improves the genetic evaluation of fertility of Holstein cows.
In conclusion, integrating multi-omics data with GWAS and GP in animals is an important and emerging research area in livestock genomics. We anticipate that the development and application of efficient methods, increased use of WGS, and integration of more types of multi-omics data will be future directions of this area. Understanding how DNA mutations shape complex traits not only furthers our understanding of biology, but also provides practical benefits in animal breeding.
Author Contributions
RX drafted the manuscript, and revised with all authors. All authors have proof-read the final version.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Cheng, H., Zhao, T., and Zeng, J. (2021). Extend Mixed Models to Multi-Layer Neural Networks for Genomic Prediction Including Intermediate Omics Data. bioRxiv 2021, 472186. doi:10.1101/2021.12.10.472186
Clark, E. L., Archibald, A. L., Daetwyler, H. D., Groenen, M. A. M., Harrison, P. W., Houston, R. D., et al. (2020). From FAANG to fork: Application of Highly Annotated Genomes to Improve Farmed Animal Production. Genome Biol. 21, 285–289. doi:10.1186/s13059-020-02197-8
Consortium, G. (2020). The GTEx Consortium Atlas of Genetic Regulatory Effects across Human Tissues. Science 369, 1318–1330. doi:10.1126/science.aaz1776
Hasin, Y., Seldin, M., and Lusis, A. (2017). Multi-omics Approaches to Disease. Genome Biol. 18, 83–15. doi:10.1186/s13059-017-1215-1
Hayes, B. J., and Daetwyler, H. D. (2019). 1000 Bull Genomes Project to Map Simple and Complex Genetic Traits in Cattle: Applications and Outcomes. Annu. Rev. Anim. Biosci. 7, 89–102. doi:10.1146/annurev-animal-020518-115024
Liu, S., Gao, Y., Canela-Xandri, O., Wang, S., Yu, Y., Cai, W., et al. (2021). A Comprehensive Catalogue of Regulatory Variants in the Cattle Transcriptome. bioRxiv 2020, 406280. doi:10.1101/2020.12.01.406280
MacLeod, I. M., Bowman, P. J., Vander Jagt, C. J., Haile-Mariam, M., Kemper, K. E., Chamberlain, A. J., et al. (2016). Exploiting Biological Priors and Sequence Variants Enhances QTL Discovery and Genomic Prediction of Complex Traits. BMC genomics 17, 144. doi:10.1186/s12864-016-2443-6
Meuwissen, T. H. E., Hayes, B. J., and Goddard, M. E. (2001). Prediction of Total Genetic Value Using Genome-wide Dense Marker Maps. Genetics 157, 1819–1829. doi:10.1093/genetics/157.4.1819
Sanchez, M.-P., Rocha, D., Charles, M., Boussaha, M., Hozé, C., Brochard, M., et al. (2021). Sequence-based GWAS and post-GWAS Analyses Reveal a Key Role of SLC37A1, ANKH, and Regulatory Regions on Bovine Milk mineral Content. Scientific Rep. 11, 1–15. doi:10.1038/s41598-021-87078-1
Teng, J., Huang, S., Chen, Z., Gao, N., Ye, S., Diao, S., et al. (2020). Optimizing Genomic Prediction Model Given Causal Genes in a Dairy Cattle Population. J. Dairy Sci. 103, 10299–10310. doi:10.3168/jds.2020-18233
van den Berg, I., Xiang, R., Jenko, J., Pausch, H., Boussaha, M., Schrooten, C., et al. (2020). Meta-analysis for Milk Fat and Protein Percentage Using Imputed Sequence Variant Genotypes in 94,321 Cattle from Eight Cattle Breeds. Genet. Sel Evol. 52, 37. doi:10.1186/s12711-020-00556-4
Visscher, P. M., Wray, N. R., Zhang, Q., Sklar, P., McCarthy, M. I., Brown, M. A., et al. (2017). 10 Years of GWAS Discovery: Biology, Function, and Translation. Am. J. Hum. Genet. 101, 5–22. doi:10.1016/j.ajhg.2017.06.005
Xiang, R., Berg, I. v. d., MacLeod, I. M., Hayes, B. J., Prowse-Wilkins, C. P., Wang, M., et al. (2019). Quantifying the Contribution of Sequence Variants with Regulatory and Evolutionary Significance to 34 Bovine Complex Traits. Proc. Natl. Acad. Sci. U.S.A. 116, 19398–19408. doi:10.1073/pnas.1904159116
Xiang, R., MacLeod, I. M., Daetwyler, H. D., de Jong, G., O’Connor, E., Schrooten, C., et al. (2021). Genome-wide fine-mapping Identifies Pleiotropic and Functional Variants that Predict many Traits across Global Cattle Populations. Nat. Commun. 12, 860. doi:10.1038/s41467-021-21001-0
Keywords: multi-omics, biological priors, genome-wide association studies, genomic prediction, genomic selection
Citation: Xiang R, Fang L, Sanchez M-, Cheng H and Zhang Z (2022) Editorial: Multi-Layered Genome-Wide Association/Prediction in Animals. Front. Genet. 13:877748. doi: 10.3389/fgene.2022.877748
Received: 17 February 2022; Accepted: 07 March 2022;
Published: 08 April 2022.
Edited and reviewed by:
Martino Cassandro, University of Padua, ItalyCopyright © 2022 Xiang, Fang, Sanchez, Cheng and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ruidong Xiang, ruidong.xiang@unimelb.edu.au