Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 28 March 2023
Sec. Livestock Genomics
This article is part of the Research Topic From Agriculture Genome to Phenome: Genome-Wide Association, Prediction and Selection View all 11 articles

A look under the hood of genomic-estimated breed compositions for brangus cattle: What have we learned?

Zhi LiZhi Li1Jun He
Jun He1*Fang YangFang Yang1Shishu YinShishu Yin1Zhendong GaoZhendong Gao1Wenwu ChenWenwu Chen1Chuanyu SunChuanyu Sun2Richard G. TaitRichard G. Tait2Stewart BauckStewart Bauck2Wei GuoWei Guo3Xiao-Lin Wu,
Xiao-Lin Wu3,4*
  • 1College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
  • 2Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, United States
  • 3Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, United States
  • 4Council on Dairy Cattle Breeding, Bowie, MD, United States

The Brangus cattle were developed to utilize the superior traits of Angus and Brahman cattle. Their genetic compositions are expected to be stabilized at 3/8 Brahman and 5/8 Angus. Previous studies have shown more than expected Angus lineage with Brangus cattle, and the reasons are yet to be investigated. In this study, we revisited the breed compositions for 3,605 Brangus cattle from three perspectives: genome-wise (GBC), per chromosomes (CBC), and per chromosome segments (SBC). The former (GBC) depicted an overall picture of the “mosaic” genome of the Brangus attributable to their ancestors, whereas the latter two criteria (CBC and SBC) corresponded to local ancestral contributions. The average GBC for the 3,605 Brangus cattle were 70.2% Angus and 29.8% Brahman. The K-means clustering supported the postulation of the mixture of 1/2 Ultrablack (UB) animals in Brangus. For the non-UB Brangus animals, the average GBC were estimated to be 67.4% Angus and 32.6% Brahman. The 95% confidence intervals of their overall GBC were 60.4%–73.5% Angus and 26.5%–39.6% Brahman. Possibly, genetic selection and drifting have resulted in an approximately 5% average deviation toward Angus lineage. The estimated ancestral contributions by chromosomes were heavily distributed toward Angus, with 27 chromosomes having an average Angus CBC greater than 62.5% but only two chromosomes (5 and 20) having Brahman CBC greater than 37.5%. The chromosomal regions with high Angus breed proportions were prevalent, tending to form larger blocks on most chromosomes. In contrast, chromosome segments with high Brahman breed proportion were relatively few and isolated, presenting only on seven chromosomes. Hence, genomic hitchhiking effects were strong where Angus favorable alleles resided but weak where Brahman favorable alleles were present. The functions of genes identified in the chromosomal regions with high (75%) Angus compositions were diverse yet may were related to growth and body development. In contrast, the genes identified in the regions with high (37.5%) Brahman compositions were primarily responsible for disease resistance. In conclusion, we have addressed the questions concerning the Brangus genetic make-ups. The results can help form a dynamic picture of the Brangus breed formation and the genomic reshaping.

Introduction

Brangus beef cattle were developed to combine the desirable traits of Angus and Brahman cattle. Angus cattle are well known for their superior carcass qualities, and Angus cows have excellent fertility and milking capability. The Brahman cattle have developed disease resistance, overall hardiness, and outstanding maternal instincts thanks to rigorous natural selection. The crossbreeding to create the Brangus breed dated to 1932, according to the USDA 1935 Yearbook in Agriculture. Yet, Brangus registration by the International Brangus Breeders Association (IBBA) started in 1949. For official registration, a Brangus animal needs to be genetically stabilized at 3/8 Brahman and 5/8 Angus by pedigree, be solid black or red, and be polled (Briggs and Briggs, 1980). Both sire and dam must be recorded with IBBA (San Antonio, TX). Hence, knowing the breed compositions of individual animals is a requisite to official animal registrations. Such information also allows for utilizing the “stable” heterosis to explore methods for predicting heterosis (Akanno et al., 2017), and it permits the implementation of precise animal farming management decisions (Berry, 2019).

After the breed formation, subsequent inter-se mating and selection have been conducted with Brangus over time. For example, the United States IBBA has developed expected progeny differences (EPD) for quantitative traits, such as birth weight, weaning weight, yearling weight, milk production, total maternal calving ease, and intramuscular fat. Likely, artificial selection pressure employed at varying levels on these traits of interest in the past decades could have resulted, to some extent, in the deviation of the ancestral genomic proportions in Brangus from the previously targeted ratios. Previous studies showed significantly elevated Angus genomic breed composition (GBC) in Brangus cattle (He et al., 2018; Paim T. D. P. et al., 2020; Paim T. D. P. et al., 2020; Li et al., 2020; Wang et al., 2020; Wu et al., 2020), and the reasons are yet to be revealed. There were several plausible assumptions. For example, it was postulated that selecting Brangus for Angus favorable traits (e.g., carcass, growth, feed efficiency) could increase the genomic breed compositions of Brangus cattle toward Angus (Paim T. D. P. et al., 2020; Wu et al., 2020). In theory, the proportion of actual genotypes passed from one generation to the next can vary between individuals owing to Mendelian sampling, genetic recombination rate and linkage disequilibrium (LD) (Falconer and Mackay, 1996). Hence, selecting Brangus for traits more prevalent in Angus (e.g., carcass, growth, feed efficiency) can favor Angus alleles, sweeping more ‘Angus’ haplotypes to further generations. Another possible reason could be the mixture of 1/2 Ultrablack (UB) animals (i.e., the first-generation progenies derived from Brangus × Angus), which are expected to have 81.25% Angus lineage. In October 2005, the International Brangus Breeders Association (IBBA) board of directors approved the creation of the Ultrablack and Ultrared (UR) program to take advantage of the strengths of the Brangus and Angus or Red Angus breeds, which combine environmental adaptability and maternal excellence of Brangus with the exceptional marbling, calving ease and name recognition of Angus or Red Angus. The UB and UR animals are registered composite animals with a validated and documented lineage between 12.5% and 87.5% Brangus breeding. The remaining 87.5%–12.5% must be a registered Angus to be a UB or a Red Angus to be a UR. The second assumption is likely, yet scientifically supporting evidence is needed. Apart from the possible mixture, the actual genomic breed compositions of Brangus cattle are not known precisely after decades of crossbreeding and selection.

Selection may have left signatures on the genome after the breed formation (Goszczynski et al., 2017; Paim T. D. P. et al., 2020). These genomic regions with selection sweeps can have different breed compositions than expected due to the selective advantages of genes from one of the founders. Paim T. D. P. et al. (2020) evaluated the overall ancestral breed compositions and local ancestral contributions by chromosomes in Brangus cattle. They also related haplotypes to ancestral traits under selection. Such information could lead to a better understanding of how hybridization and crossbreeding systems have shaped the genetic architecture of these composite animals. However, their conclusions were built on small samples of the two founder breeds with genotypes (i.e., 68 Brahman and 95 Angus). Statistically, the inference of allelic frequencies and haplotypes, and, therefore, ancestry origins, are subject to large errors in small samples.

In this study, we revisited the estimation of breed compositions for Brangus cattle from three perspectives, genomic-wise, per chromosome, and per chromosome segment. The genotyped animals in the two ancestral breeds included 20,359 Angus and 509 Brahman cattle. Hence, our sample sizes for the two ancestral breeds were significantly larger than those used by Paim T. D. P. et al. (2020). We took a consistent approach to estimate ancestral breed compositions from the three perspectives; all were assessed with an admixture model. Three measures of breed compositions were defined: 1) genomic-estimated breed compositions (GBC), 2) chromosomal-estimated breed compositions (CBC), and 3) segmental-estimated breed compositions (SBC). Note that the latter two quantities corresponded to local ancestral genomic contributions, measured per chromosome and chromosomal regions, respectively. Possible population stratification was inferred based on global ancestral genomic proportions, whereas genomics dynamics due to crossbreeding and selection were visualized through local ancestral contributions.

Materials and methods

Animals and genotype data

The experimental data consisted of 3,605 Brangus cattle, 20,359 Angus cattle, and 483 Brahman cattle. The latter two ancestral breeds were used as the reference populations for estimating GBC for Brangus cattle. All the animals were genotyped with a GeneSeek Genomic Profiling (GGP) bovine 50 K V1 (version 1) chip, except 349 Brahman cattle were genotyped with an Illumina 777 K bovine SNP chip (Table 1). The genotypes were extracted from the Neogen GeneSeek genotyping databases representing samples shared between the Neogen global laboratories. The SNP map positions were based on the UMD 3.1 reference bovine genome assembly (Merchant et al., 2014).

TABLE 1
www.frontiersin.org

TABLE 1. Descriptive statistics of genotype data for Brangus and their ancestral breeds (Angus and Brahman)a.

Reference SNPs were selected from the common set between the GGP bovine 50 K V1 chip and the Illumina 777 K bovine SNP Beadchip. The data cleaning removed SNPs with a call rate of less than 95%, SNPs on the two sex chromosomes, and SNPs without map position. SNPs violating the Hardy-Weinberg equilibrium (p < 1.0E-8) were also excluded. For SNPs with greater than 0.99 correlations on each chromosome, only the one with higher or the highest minor allelic frequency was kept. When there were ties in allelic frequencies, a random one was taken. The final SNP set retained 41,672 common SNPs for the subsequent analyses. In each ancestral population, outlier individuals were excluded as those with (−2)log (likelihood) exceeding a given cutoff value (i.e., 2.0 by default) (He et al., 2018). This test excluded 37 Angus and 26 Brahman animals from the reference populations. The means and standard deviations of allele A frequencies of the SNPs after data cleaning in the three populations are shown in Table 1. On average, Angus cattle had a higher allele A frequency (0.492) than Brahman cattle (0.431–0.439). The average allele A frequency for Brangus cattle was 0.477, which fell between the two ancestral populations yet closer to Angus. The Angus population had a smaller standard deviation of allele A frequency than the Brahman population. The Brangus cattle had a smaller standard deviation of allele A frequencies than the two ancestral populations (Table 1).

Estimation of breed compositions

Breed compositions were estimated for individual Brangus animals using the admixture model by Bansal and Libiger (2015) (BL-Admixture). This method utilizes the same form of likelihood model as in the STRUCTURE (Pritchard et al., 2000) and ADMIXTURE (Alexander et al., 2009) software packages. However, the BL-Admixture model runs faster. In particular, STRUCTURE takes a Bayesian approach and relies on a Markov Chain Monte Carlo (MCMC) algorithm to sample the posterior distribution, which can extremely computationally intensive with large data. STRUCTURE and Admixture represent unsupervised analysis of the ancestry of multiple individuals and jointly estimate allele frequencies for the ancestral populations and the relative contribution of each ancestral population to each individual’s genome. The BL-Admixture estimates the ancestry for a single individual using information about allele frequencies at a large number of loci for multiple reference populations. The latter allele frequencies are obtained from previous unsupervised admixture analysis, or simply from the reference population assuming no genetic drift and selection after these breeds were formed. We took the latter approach.

Consider one biallelic locus (say j) genotyped on an animal (say i). Assume that allele frequencies on this locus are known for each ancestry breed. The admixture model postulates that a progeny’s genotype at this locus is determined according to an allelic frequency as a weighted average of T reference (ancestry) breeds: fij=t=1Twitqtj, where qtj is the frequency of allele B for the jth SNP in the tth reference population,; wit is the corresponding weight. Then, under the assumption of Hardy-Weinberg equilibrium, the probabilities of observing each genotype (gij) on this animal are the following:

Prgij|fij=1fij2gij=02fij1fijgij=1fij2gij=2(1)

Now, consider j=1,,M SNPs genotyped on this animal, assuming their mutual independence. The log-likelihood computed for all the M SNPs measured on the ith animal, denoted by li, is as follows.

li=j=1MlnPrgij|fij=j=1Mgijlnfij+2gijln1fij+C(2)

where C=j=1Mln2gij. Note that the assumption of mutual independence between SNPs does not hold precisely due to linkage or/and random associations between them. Nevertheless, the admixture model is often robust to this assumption violation because the percentage of SNPs in high LD are rare when using moderate to high-density SNP panels (He et al., 2018; Li et al., 2020).

The admixture coefficients, Wi=wi1wiT, taken to the GBC of animal i attributable to the reference (ancestral) breeds, are obtained using the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method under the restrictions that wik0; k=1TwiT=1 (Nocedal and Wright, 2006). BFGS is a powerful, Quasi-Newton second derivative line search family method to solve non-linear optimization problems. Optimizing the likelihood function of BFGS iteratively removed the non-zero mixing coefficient, which did not significantly improve the model fitting, thus obtaining a concise set of individual mixture coefficients. In the admixture model, the value of each admixture coefficient is bounded between 0 and 1, and the sum of admixture coefficients (GBC) computed for each animal is one under the assumption of 100% genetic contributions by the T ancestral breeds to each animal.

Breed compositions for the Brangus animals were estimated genome-wide, per chromosome, and per chromosomal segment, respectively. The number of reference SNPs per chromosome varied from 712 (chromosome 25) to 2,568 (chromosome 1), and the average distance between SNPs ranged from 0.05 to 0.07 Mb. SBC were obtained on three window sizes: 1 Mb, 5 Mb, and 10 Mb, respectively. The widow sizes were taken arbitrarily yet still based on two factors, the average length of gene in the bovine genome and the minimum number of SNPs to give stable estimates. To minimize the errors in the estimated SBC due to insufficient SNP coverage, chromosomal segments with less than five SNPs were excluded from computing SBC. There were 2,522 1-Mb segments, 518 5-Mb segments, and 269 10-Mb segments, respectively. The average number of SNPs per segment ranged from 15.2 to 18.8 (1 Mb), 72.0 to 91.1 (5 Mb), and 132.0 to 169.1 (10 Mb), respectively (Table 2). Overall, the average R2 LD per segment decreased as the window size increased, ranging from 0.107 to 0.164 on 1 Mb chromosomal regions, from 0.062 to 0.104 on 5-Mb chromosomal regions, and from 0.050 to 0.081 on 10 Mb chromosomal regions (Table 2). The R2 LD on 1-Mb regions also had a larger average standard deviation (0.05) than those on 5-Mb and 10-Mb regions (0.02).

TABLE 2
www.frontiersin.org

TABLE 2. Mean and standard deviation (SD) of R2 linkage disequilibrium (LD) among SNPs evaluated by varying window sizes (1 Mb, 5 Mb, and 10 Mb) on each chromosomea.

Clustering of brangus cattle

K-means clustering (Jain, 2010) was conducted on the GBC for the 3,605 Brangus cattle to reveal possible population stratifications, where K = 2 and 3, respectively. Initially, the number of clusters K was specified a priori, and randomly assigned all the animals to each of K distinct, non-overlapping clusters as their initial clusters. Then, the K-means algorithm computed the cluster centroid for each cluster, which is a vector of genotype means for the M reference SNPs, and it re-assigned each animal to the cluster whose centroid was the closest. The last two steps proceeded iteratively till the total within-clustering variation, defined by the sum of all the pairwise squared Euclidean distance in each cluster and summed over all the K clusters, was minimized as much as possible (Hartigan and Wong, 1979):

minimizek=1K1nki,iCkj=1Mgijgij2

where Ck stands of cluster k, and nk is the number of animals in the kth cluster. The K-mean clustering analysis was implemented by the “stats” R package.

Gene set enrichment analysis

Gene ontology (GO) term enrichment analysis, or gene set enrichment analysis, was conducted using the “gprofiler” R package with genes identified in 1-Mb chromosomal regions featuring either ancestral breed. This R package performed functional gene enrichment analysis on the input gene lists, mapped genes to known functional resources, and detected statistically significantly enriched terms. Briefly, chromosomal segments satisfying Angus SBC ≥0.75 or Brahman SBC ≥0.50 were extracted. Both cutoff thresholds represented equal upward GBC deviations (i.e., 12.5%) from their expected values. Then, the gene information was extracted by querying in the Ensembl database according to their chromosomal locations on the selected chromosomal segments. Finally, GO clustering analysis was performed on the gene list generated with the different filters. The annotation databases included Gene Ontology–biological processes, cellular components, molecular function (http://geneontollogy.org/) (Ashburner et al., 2000), and Kyoto Encyclopedia of Genes and Genomes (KEGG; http://www.genome. jp/kegg/) (Kanehisa et al., 2016). Other relevant R packages used in this study included “org. Bt.e.g., db”, “biomaRt” (Smedley et al., 2015), and “clusterProfiler” (Yu et al., 2012). Gene lists were extracted using the “getBM” function in the “biomaRt” R package. Briefly, given a set of filters (e.g., the chromosome number, the start and end positions), it retrieved attributes of all genes in this interval from the BioMart database. The attributes information includes “ensembl gene id”, “chromosome number”, “gene start position”, “gene end position”, and “gene description”, etc. Finally, gene extraction was performed on all eligible SBC fragments, and all extracted genes were merged to form a gene list after removing duplication. Enrichment analysis was then performed. All the QTLs were queried and aggregated in the QTLdb database (Release 49, https://www.animalgenome.org/cgi-bin/QTLdb/BT/index) (Hu et al., 2022).

Results and discussion

Genomic-estimated breed compositions

The estimated GBC for the 3,605 Brangus cattle, on average, were 70.2% Angus and 29.8% Brahman (Table 3), which significantly deviated from the officially expected values (i.e., 62.5% Angus and 37.5% Brahman) (p < 2.2e-16). The 95% confidence intervals were 61.1%–85.8% Angus and 14.2%–38.9% Brahman. There were 459 (12.7%) Brangus cattle with Angus GBC ≥80.0% and 19 (0.5%) Brangus cattle with Angus GBC ≥90.0%. Similarly, elevated Angus GBC for the Brangus animals were documented in some previous studies. For example, Paim T. D. P. et al. (2020) estimated that Brangus were 70.4% Angus and 29.6% Brahman based on high-density SNP genotypes (777,962 SNP, BovineHD Beadchip, Illumina, San Diego, CA, United States). Li et al. (2020) showed that Brangus cattle was 69.8%–70.5% Angus and 29.5%–30.2% Brahman based on multiple models, including an admixture model, linear regression, and ridge-regression BLUP, each with a selectively uniform 20 K SNP panel. Wang et al. (2020) proposed using regularized admixture models to estimate GBC for purebred animals to deal with the so-called “Impure Purebred Paradox”, a phenomenon suggesting a higher-than-expected false-negative rate in the identification of purebred animals. They showed that Brangus were, on average, 71.1%–77.1% Angus and 22.9%–28.9% Braham based on various regularized admixture models. Using path analysis model, Wu et al. (2020) showed that Brangus cattle were 68.2%–71.8% Angus and 28.2%–31.8% Brahman. Hence, regardless of the population sizes and the statistical methods used, these studies have consistently pinpointed that significantly higher-than-expected Angus breed proportions in Brangus cattle.

TABLE 3
www.frontiersin.org

TABLE 3. Means and standard deviations (SD) of genomic-estimated breed composition (GBC) for the 3,605 Brangus cattle as a single population (K = 1) and as subpopulations according to K-means clustering (K = 2 and 3, respectively)1,2,3.

Multiple reasons are likely responsible for the elevated Angus lineage in Brangus. Firstly, population stratification could exist with Brangus, given the fact that the IBBA approved the creation of UB and UR animals in 2005. The density plots of the estimated Angus or Brahman breed proportions in the 3,605 animals were bimodal, which served as preliminary evidence for the Brangus population stratification (Figure 1). Then, K-means clustering was conducted to partition the 3,605 animals into two and three clusters, respectively (Table 3). With K = 2 (i.e., two clusters), the B-1 cluster with a higher average Angus GBC (81.6%) consisted of 713 (19.8%) Brangus cattle, whereas the B-2 cluster with a lower average Angus GBC (67.4%) included 2,892 (80.2%) animals. With K = 3 (i.e., three clusters), the cluster (C-1) with the highest average Angus GBC (82.1%) had 640 (17.8%) Brangus animals. The cluster with the lowest average Angus GBC (67.2%) included 2,584 (71.7%) Brangus animals. In both sets of clustering results (K = 2 versus K = 3), the average Angus GBC for animals in the top Angus composition clusters (B-1 versus C-1) agreed approximately with each other (81.6% versus 82.1%), and they corresponded roughly to the expected Angus breed proportion (81.25%) for the ½ UB animals. We thus suspected that these animals could be the ½ UB animals. Using a path analysis approach, Wu et al. (2020) confirmed that the ½ UB animals were, on average, 81.25% Angus and 18.75% Brahman. The path analysis decomposed the relationships between the ancestors and the composite animals into direct and indirect path effects. The above percentage only accounted for direct effects by the path-analysis interpretation, assuming a zero correlation between the two ancestral breeds (Wu et al., 2020). In reality, however, Angus and Brahman cattle are connected due to sharing common remote ancestors, though the correlation can be low. For example, the correlation of allele A frequencies ranged from 0.05 to 0.10 between the two ancestral breeds, subject to the SNP panel sizes. If considering the indirect path effects, the actual Angus breed proportions for the ½ UB animals could be slightly higher. On the other hand, the two majority clusters, B-2 and C-3, had roughly comparable averages of Angus GBC (67.4% versus 67.2%). These were likely Brangus cattle without the mixture of ½ UB animals. The average Angus breed proportion for non-UB Brangus was 67.4% based on the clustering analysis with K = 2. The 95% confidence interval of Angus breed proportions in Brangus was between 60.4% and 73.5%. Hence, there was, on average, an approximately 5% deviation of GBC toward Angus breed proportions in non-UB Brangus cattle, possibly resulting from selecting Brangus cattle for phenotypes where Angus has advantages. Without selection, genomic-estimated breed compositions would agree approximately with the expected ratios (e.g., Funkhouser et al., 2017; Gobena et al., 2018). Note that ½ UB animals are not precisely registered Brangus. They are officially given a “UB” prefix for registration purposes and are shown on their registered IDs. It is also worth mentioning that the B-2 (or C-3) cluster could include some advanced UB crosses because they had comparable Angus breed proportions as those in the non-UB Brangus cattle in clusters B-2 and C-3. In early 2013, the IBBA further approved the breeding-up of UB (or UR) cattle to registered Brangus. The rule states that an IBBA-registered Brangus sire or dam mated to an IBBA-registered UB or UR sire or dam that results in at least 7/8th (87.5%) Brangus also qualifies as a registered Brangus. For example, mating a ¾ UB animal (i.e., progenies derived from crossing ½ UB animals with Brangus) to a registered Brangus resulted in 7/8 UB animals, which meets the 87.5% Brangus makeup. On average, a 7/8 UB animal was 67.2% Angus. Finally, the average Angus GBC for the animals in cluster C2 was 70.9%, which roughly corresponded to the expected Angus lineage (71.9%) for ¾ UB animals.

FIGURE 1
www.frontiersin.org

FIGURE 1. Density plots of genomic-estimated Angus (red triangle) and Brahman (blue cross) breed compositions for 3,605 Brangus cattle.

Chromosomal-estimated breed compositions

The ancestral breed proportions were estimated by chromosomes in the 3,605 Brangus (Table 4). Overall, the average estimated CBC per chromosome varied substantially, from 56.9% (chromosome 5) to 79.8% (chromosome 15). Converse to the Angus CBC, the average Brahman CBC was the lowest on chromosome 15 (20.2%) and the highest on chromosome 5 (CBC 43.1%). We noted that the estimated ancestral breed proportions were heavily distributed toward Angus, which agreed with the breeding target for greater Angus “blood” than Brahman. There were 27 chromosomes with Angus CBC greater than 62.5%, but only two chromosomes (5 and 20) had Brahman CBC greater than 37.5%. Our results coincided with a previous study by Paim T. D. P. et al. (2020). They showed that chromosome 15 had the highest Angus proportion (84.7%), and chromosome five had the largest Brahman proportion (43.7%). Still, there were some differences. Paim T. D. P. et al. (2020) used principal component analysis to describe the population relationships. They showed that the first principal components (PC1), which accounted for ancestral breeds, were uniformly distributed between the two ancestral breeds when evaluated on chromosomes 16, 25, and 29. Because the average Brahman breed proportion is expected to be 5/8 (not 5/5), a uniform distribution of PC1 for Brangus between the two ancestral breeds would suggest equal ancestral genomic proportions. Hence, their results were an indication of significant deviates in breed compositions of Brangus toward Brahman on these three chromosomes. A principal analysis is a popular feature-reducing technique for analyzing large, high-dimension data, yet ignoring the detailed information of individual features, which were local ancestral contributions on specific chromosomal regions. In the present study, we directly evaluated breed compositions by chromosomes. Our results showed that only chromosomes 5 and 20 had less than 62.5% Angus breed proportions on average, which agreed with Paim T. D. P. et al. (2020). But all the other chromosomes (including 16, 25, and 29) had an average Angus CBC greater than 62.5%, meaning they deviated toward Angus instead. These differences could also arise from sampling biases, because the two ancestral breeds used by Paim T. D. P. et al. (2020) were small. There was a high chance that the sampled allelic frequencies may deviate substantially from the actual allelic frequencies. Furthermore, because the variance of an allelic frequency is inversely proportional to the population size (Falconer and Mackay, 1996), the estimated allelic frequencies for ancestral breeds were also subject to large deviations in small samples.

TABLE 4
www.frontiersin.org

TABLE 4. Means and standard deviations (SD) of chromosomal-estimated breed compositions (CBC) for the 3,605 Brangus cattle.

Despite the large chromosome-by-chromosome variability for ancestral breed proportions, GBC estimated by the mean of the average CBC across the 29 chromosomes per animal (denoted by C_GBC) agreed roughly with the average GBC in the 3,605 Brangus cattle. The mean (standard deviation) of Angus C_GBC for the 3,605 Brangus cattle was 70.9% (6.7%). The mean (standard deviation) of Brahman C-GBC was 29.1% (6.7%). The correlation between C_GBC and GBC for the 3,605 Brangus animals was 0.99 (See Supplementary Figure S1A). Within each subpopulation, the average CBC approximately agreed to (or slightly larger than) the corresponding GBC on average. For example, in the two clusters obtained by the K-means clustering with K = 2, the cluster for the mixed UB animals had, on average, 82.5% Angus and 17.5% Brahman breed compositions, and the cluster for non-UB animals had, on average, 68.2% Angus and 31.8% Brahman breed compositions.

Segmental-estimated breed compositions

Ancestral breed compositions were evaluated on 1-Mb, 5-Mb, and 10-Mb windows on each chromosome (Table 5). Like CBC, the mean and range of the estimated SBC per segment varied substantially with chromosomes (Figure 2). Regardless of the window sizes, all the chromosomes, except 5 and 20, had the mean Angus SBC per chromosome exceeding 62.5%, and only chromosomes 5 and 20 had greater than 37.5% Brahman SBC. The average Angus SBC was the largest on chromosome 15 (76.9%–81.4%) and the smallest on chromosome 5 (57.6%–59.7%) (Table 5). Complementary to the Angus SBC, the Brahman SBC was the largest on chromosome 5 (40.3%–42.4%) and the smallest on chromosome 15 (18.6%–23.1%) (Table 5). Nevertheless, the segment with the maximum Angus proportion appeared on chromosome 18 (94.0%–95.9%), although chromosome 15 had the largest average Angus proportion. The maximum Brahman proportion segment was found on chromosome 2 (74.4%) when evaluated on 1 M windows and chromosome 5 (62.2%–64.4%) on 5-Mb and 10-Mb windows. The overall average of Angus SBC across the 29 chromosomes in the 3,605 Brangus cattle were 69.2% (1-Mb windows), 72.3% (5-Mb windows), and 72.0% (10-Mb windows), respectively. The overall average of Brahman SBC across the 29 chromosomes was 30.8% (1-Mb windows), 27.7% (5-Mb windows), and 28.0% (10-Mb windows), respectively. These overall averages of genomic breed compositions by averaging Angus and Brahman SBC across all 29 chromosomes per animal (denoted by S_GBC) roughly agreed with the average Angus and Brahman CBC (70.9% Angus versus 29.1% Brahman; Table 4) and the average GBC (70.2% Angus versus 29.8% Brahman; Table 3). The correlation between Angus S_GBC and GBC for the 3,605 Brangus animals was greater than 0.99, regardless of the window sizes (See Supplementary Figures S1B–D). Overall, the variation was larger when evaluated on a smaller interval than a larger interval (Figure 3). The Angus SBC on chromosome 15 varied between 68.6% and 89.7% when assessed with a 5-Mb window size (Table 5). The range became smaller (70.5%–86.6%) when evaluated with a larger window size (i.e., 10-Mb), and the range became larger (48.6%–91.1%) when assessed with a smaller window size (e.g., 1-Mb). Similar trends generally held on all the chromosomes (Table 5). The Brahman SBC showed precisely opposite trends.

TABLE 5
www.frontiersin.org

TABLE 5. Minimum (min), maximum (max), and mean of segmental-estimated breed compositions (SBC) per chromosome for Brangus.

FIGURE 2
www.frontiersin.org

FIGURE 2. Boxplots of segmental-estimated breed compositions (SBC) for 3,605 Brangus cattle with the window size being 1 Mb on each chromosome: (A) Angus breed proportion; (B) Brahman breed proportion.

FIGURE 3
www.frontiersin.org

FIGURE 3. Distributions of Angus (red) or Brahman (yellow) breed proportions by varying window sizes on chromosome 15 in 3,605 Brangus cattle: (A) 1 Mb, (B) 5 Mb, and (C) 10 Mb. The black dashed line represents the population mean of the estimated genomic-estimated breed compositions (GBC), and the blue dashed line represents the population mean of the chromosomal-estimated breed compositions (CBC).

Ancestral breed proportions per segment varied considerably on each chromosome (Supplementary Figure S2). For example, the Brahman SBC on chromosome one were 4.4%–60.2% (1-Mb windows), 4.6%–47.8% (5-Mb windows), and 7.4%–46.6% (10-Mb windows), respectively (Table 5). There were segments with high Angus proportions and high Brahman proportions, respectively, on the chromosomes. For example, the average Angus SBC in 1–5 Mb and 6–10 Mb segments on chromosome 15 were 85.5% and 89.7%, respectively, which were substantially higher than the CBC for that chromosome. The average Angus SBC computed in the 40–75 Mb segment was between 68.6% and 79.1%, still higher than the officially expected Angus proportion (62.5%). Approximately 95% of the 2,522 1-Mb chromosomal segments had, on average, between 45.0% and 90.0% Angus breed proportions and between 10.0% and 65.0% Brahman breed proportions (Table 6). Likewise, approximately 90% of the total 5-Mb (or 10-Mb) chromosomal segments had between 50.0% and 90.0% Angus breed proportions and between 10.0% and 65.0% Brahman breed proportions (Table 6).

TABLE 6
www.frontiersin.org

TABLE 6. Distributions of segmental-estimated Angus and Brahman breed compositions (Angus-SBC and Brahman-SBC) with the segments defined by 1 MB, 5 Mb, and 10 Mb intervals, respectively.

The variability of ancestral breed compositions per chromosome or segment could be a direct effect of selection. For example, Goszczynski et al. (2017) showed increased indicine haplotypes in the bovine leucocyte antigen region of Brangus cattle raised in Argentina, potentially due to selection for adaptation to the environments. In a brief search of bovine QTL in the QTLdb database, we found two postnatal growth traits and one body mass QTL reported on these two chromosomes, which were located on chromosome 15 at 61.6 Mb (body weight gain) and 17.0 Mb (birth body weight) (Snelling et al., 2010), and chromosome 17 at 12.0 Mb (average daily gain) (Rolf et al., 2012). These two chromosomes had the highest Angus CBC (chromosome 15: 79.84%; chromosome 17: 79.21%). These QTL were also located in regions with high Angus proportions. There are QTLs associated with disease resistance on chromosome 5 (Machado et al., 2010), which had the highest Braham CBC (43.14%). There were also health-related QTLs on chromosome 5, which included infectious bovine keratoconjunctivitis susceptibility (Kizilkaya et al., 2013), cold tolerance (Howard et al., 2014), and immune capacity (Leach et al., 2010), to list a few of them. All these QTLs were located in regions with high Brahman proportions in chromosome 5. There was a pleiotropic QTL affecting birth, yearling, and mature weights on chromosome 20 at 7–8 Mb (Weng et al., 2016), which was a region with a high Angus proportion. However, this chromosome had a high Brahman proportion (40.2%) in our study and Paim T. D. P. et al. (2020). The presence of Brahman favorable alleles in chromosomal regions with high Angus breed proportions, or vice versa, exemplifies the successful complementary of favorable alleles for the traits of interest from both ancestral breeds.

Gene set enrichment analysis

Gene set enrichment analysis was conducted with functional genes and QTL extracted on chromosome segments with high (75%) Angus and high (50%) Brahman breed proportions, respectively, evaluated on 1-Mb chromosomal intervals (Figure 4). We adopted these two cutoff threshold values because each represented an equal (12.5%) upward deviation from their expected values. We computed SBC on 1-Mb intervals for the subsequent gene set enrichment analysis because we rendered it an appropriate window size to capture gene regions of interest. Based on the Bos taurus UMD 3.1.1 assembly, the genome size of domestic cattle is approximately 2,670 Mb, which contains 26,815 genes (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Bos_taurus/105/#FeatureCountsStats). Hence, the average spacing between neighboring genes is around 0.1 Mb. In contrast, the chromosome regions defined on 5-Mb or 10-Mb intervals could be too broad. Nevertheless, SNPs on 1 Mb intervals had higher LD than those on 5-Mb or 10-Mb intervals. The average R2 LD per segment on each chromosome ranged from 0.11 to 0.16 in 1-Mb intervals, from 0.06 to 0.10 in 5-Mb intervals, and from 0.05 to 0.08 in 10-Mb intervals (Table 2). When computing the likelihood, we noted that SNPs with higher LD tended to give more weight to these highly linked SNPs or their regions.

FIGURE 4
www.frontiersin.org

FIGURE 4. Chromosomal fragments with: (A) high (75%) Angus breed proportion (upper), and (B) high (50%) Brahman breed proportions (bottom), evaluated with a 1 Mb window size on 29 chromosomes in 3,605 Brangus cattle.

There were 852 segments with Angus SBC ≥75% and 169 segments with Brahman SBC ≥50%, which accounted for 33.8% and 6.7%, respectively, of the 2,522 1-Mb chromosomal segments on the genome (Table 7.). The total length of chromosomal segments with 75% Angus proportions was the longest on chromosome one and the shortest on chromosomes 5 and 13. Relatively speaking, chromosome 15 had the largest percentage (64.5%) of high Angus breed proportion segments, and Chromosome five had the least percentage (5.8%) of total high Angus breed proportion segments. Twenty-five chromosomes had more than 20% of the chromosome length as high (75%) Angus proportion regions. In contrast, only two chromosomes (5 and 20) had more than 20% of their total length as high (50%) Brahman breed proportions. Adjacent high Angus proportion segments often joined together to form larger blocks, presenting almost in all 29 chromosomes (Figure 4A). In contrast, high Brahman proportion segments were relatively sparse and isolated, though some formed small blocks, and they were present only on seven (2, 5, 6, 9, 10, 20, and 21) chromosomes (Figure 4B). These results agreed with a previous report yet with a different approach. Using a chromosome painting approach based on a copying model (Lawson et al., 2012), Paim T. D. P. et al. (2020) showed that chromosomal regions with high Angus breed proportions were prevalent on chromosomes, often large blocks, yet chromosome segments with high Brahman breed proportion were fewer and isolated. A follow-up study on selection signatures in Brangus revealed that the majority of selection signatures in Brangus cattle came from Angus (Paim T. D. P. et al., 2020). The copying model related the patterns of LD across chromosomes to the underlying recombination process and used a hidden Markov method to reconstruct a sampled haplotype. In the present study, we directly estimated breed compositions on each chromosome flanked by varying window sizes based on a mixture model. The admixture coefficients inferred from the admixture model can be probabilistically interpreted as reflecting that as identity-in-state (Jannink and Wu, 2003). Yet, when confined to long chromosomal chunks, it approximated the probability of identity by descent (Browning, 2008). Hence, though using a different approach, we came to similar findings.

TABLE 7
www.frontiersin.org

TABLE 7. Summary of chromosomal segments with high Angus breed proportions (Angus SBC 75%) and high Brahman breed proportions (Brahman SBC 50%), respectively.

In theory, crossing breaks and shuffles chromosomes randomly over time when directional selection is absent, which reflects genetic drift. However, with selection, it increases and even fixes favorable alleles. Meanwhile, it also changes the allelic frequencies of genes in LD with the genes under selection due to genomic hitchhiking, also known as genetic draft (Smith and Haigh, 1974; Ma et al., 2019). In other words, genomic hitchhiking occurs when a polymorphic locus is in LD with a second locus undergoing a selective sweep. As a result, the linked allele will also increase in frequency, in some cases, until it becomes fixed in the population. Overall, genomic hitchhiking reduces genetic variation and leaves footprints across the genome known as the signatures of selection (Sabeti et al., 2007; Singh et al., 2020). The many high Angus breed proportion regions reflected the presence of multiple favorable alleles scattered across the chromosomes. We observed the prevalence of large blocks with high Angus breed proportions (Figure 4A), possibly resulting from the genomic hitchhiking effects when selecting Brangus for Angus favorable traits. This is equivalent to saying that genomic hitchhiking effects were strong around the genomic regions with Angus favorable alleles, leading to the presence of large blocks with high Angus breed proportions. In contrast, genomic hitchhiking effects were weak around Brahman favorable alleles because chromosomal blocks with high Brahman breed proportions were relatively few, isolated, and small in size.

There were 9,025 genes on the chromosomal segments with Angus SBC ≥75% (Supplementary Table S1) and 1,877 genes on the chromosomal segments with Brahman SBC ≥50% (Supplementary Table S2). Many genes in high (≥75%) Angus regions are responsible for biological processes related to animal development, such as regulation of biological processes, anatomical structure development, anatomical structural morphogenesis, animal organ development, and skeletal system development, and, in KEGG, related to hormone regulation (such as the Estrogen signaling pathway). For example, the system development (GO:0048731) category’s related child terms include system development, such as central nervous, respiratory, and endocrine. We also found many genes associated with carcass and meat quality traits, such as PPP1R3B (Edwards et al., 2003; Cinar et al., 2012), ASXL1 (Grigoletto et al., 2020), DNMT3B (Liu et al., 2012), and TMEM68 (Lindholm-Perry et al., 2012; Terakado et al., 2018; Edea et al., 2020), just to list a few. Our gene list also included LEP on chromosome four and PLAG1 on chromosome 14. Paim T. D. P. et al. (2020) previously found these two genes in high Angus regions. The LEP gene is expressed in adipose tissue and codes for leptin, a hormone known to regulate feed intake and energy balance in mammals (Woronuk et al., 2012). This gene is associated with marbling, fat thickness, rib eye area, and feed intake in several beef cattle breeds (Souza et al., 2010; Woronuk et al., 2012; Kononoff et al., 2017). Leptin is an essential gene for puberty onset (Williams et al., 2002). This gene could be inherited from Angus ancestors, or its frequency was increased by the selection of Brangus for early puberty since breed formation because Bos indicus heifers have challenges achieving puberty early in life (Sartori et al., 2010; Fortes et al., 2012). PLAG1 is involved in regulating stature and weight (Littlejohn et al., 2012; Pryce et al., 2012; Song et al., 2016). This gene is associated with yearling weight in Australian Tropical Composite breeds (Porto-Neto et al., 2014). There is still another gene, XKR4, which is close to PLAG1. The XKR4 gene is associated with subcutaneous rump fat thickness, scrotal circumference, serum concentration of prolactin, and sexual precocity (Fortes et al., 2012; Porto Neto et al., 2012; Bastin et al., 2014; Takada et al., 2018). The genes in the high (50%) Brahman regions are primarily responsible for biosynthetic-related biological processes (e.g., cytolysis), molecular biological functions related to enzyme activity (e.g., lysozyme activity, peptidoglycan muralytic activity, hydrolase activity, and serine-type endopeptidase activity), and diseases and immunity (e.g., MHC class II protein complex, MHC protein complex, type I diabetes mellitus, allograft rejection, graft-versus-host disease, autoimmune thyroid disease, and pathogenic Escherichia coli infection). For example, peptidoglycan muralytic activity (GO:0061783), which contributes to the degradation of peptidoglycan, is a major structural component of bacterial cell walls (Nelson et al., 2012); Another example is the MHC Class II protein complex (GO:0042613). MHC is involved in the immune process and plays the role of transmitting antigens (Rodgers and Cook, 2005). Still, the gene functionalities in chromosomal regions with high Angus breed proportions were diverse, possibly due to genomic hitchhiking of genes linked to the favorable alleles under selection. For example, there was a category of genes called “cellular response to stress”. Cells respond to stress in various ways, from activating pathways that promote survival to eliciting programmed cell death that eliminates damaged cells. Cell death research has attracted much attention in the last 2 decades also because of its relevance to development, degenerative diseases, and cancer in human (Fulda et al., 2010). Overall, the Brangus cattle have successfully combined the favorable traits of the two highly successful parent breeds.

QTLs were extracted from the chromosomal regions with the top 1% highest Angus (Angus SBC 91.1%) and Brahman (Braham SBC 59.4%) breed proportions, respectively (Supplementary Table S3). The QTLs in the top 1% Angus regions are related to meat, carcass, production, and reproduction. This list included, for example, birth weight QTL (chromosomes three and 8), weaning weight QTL (chromosomes 1, 3, 15, 18, and 23), yearling weight QTL (chromosomes 3 and 10), mature body weight QTL (chromosomes 3, 4, 10, 17, and 18), carcass weight QTL (chromosomes three and 8), marbling score QTL (chromosomes 15, 18, and 22), fat thickness QTL (chromosomes 3, 4, 17, 18, and 23), calving ease QTL (chromosomes 3, 8, 10, and 17), and QTL for Longissimus muscle area (chromosomes 3, 8, 17, and 18). Some QTL have pleiotropic effects. The QTL in the regions with 1% highest Brahman proportions are related to health, such as bovine respiratory disease susceptibility (chromosome 2: 117.3–134.4 Mb), insulin-like growth factor 1 level on (chromosome 5: 40.3–59.7 Mb; chromosome 14: 26.5–27.0 Mb), and blood cortisol level (chromosome 16: 29.7 Mb).

Conclusions

We have revisited the genomic breed compositions in 3,605 Brangus cattle from three perspectives, genome-wise, per chromosome, and per chromosome segment. The K-means clustering analysis revealed population stratification. The B-1 cluster consisted of ½ UB (i.e., first generation from crossing Brangus with Angus), with 81.6% Angus genomic lineage. The non-UB Brangus animals (B-2), on average, were 67.4% Angus and 32.6% Brahman. It was likely that selecting Brangus for traits where Angus had advantages led to an approximately 5% average deviation in the estimated GBC toward Angus lineage. Further deviation of Brangus breed compositions from the officially expected ancestral breed ratios is worth paying attention to because it could diminish the complementarity of Brahman and Angus over time.

The ancestral breed proportions varied substantially by chromosomes and by chromosomal segments, likely shaped by cross-breeding and subsequent inter-se mating and selection over time. There were strong hitchhiking effects on genomic regions where favorable Angus alleles resided. The functions of genes identified in the chromosomal regions with high Angus proportions were diverse, yet many were responsible for biological processes related to growth and body development. The genes identified in the regions with high Brahman compositions were primarily responsible for biosynthetic-related processes, molecular biological functions related to enzyme activity, diseases, and immunity. The co-presence of segments of high-Angus and high-Brahman lineage, respectively, on the same chromosomes, exemplified the successful complementary of favorable alleles from both ancestral breeds.

While genomic-estimated breed compositions depicted an overall picture of the “mosaic” genome of animals attributable to their ancestors, local ancestry genomic compositions were visualized through chromosomal- and segmental-estimated breed compositions. Precisely, the admixture coefficients based on the admixture model inferred the probability of alleles identical in state when estimating the ancestral breed compositions for individual animals. Nevertheless, they corresponded approximately to the probability of alleles identical by descent when confined to long chromosome chunks. We noted that alternative approaches existed, such as path analysis (Wu et al., 2020) and the breed-of-origin (BOA) approach (Calus et al., 2022), that could better handle the situation when ancestral breeds were correlated. In the present study, however, the correlation between Angus and Brahman was low (0.05–0.10). Hence, we used the admixture model because is theoretically instructive and easily implemented in practice.

Data availability statement

All data were taken from the databases of the Neogen Global Laboratories. Example data and programs are available at: https://doi.org/10.6084/m9.figshare.22303009.v1. Inquiries can be directed to the corresponding author.

Ethics statement

Ethical review and approval was not required for the animal study because All data were taken from the databases of the Neogen Global Laboratories. No animal experiments were involved.

Author contributions

X-LW and JH conceived and planned this study in discussion with WG, CS, RT, and SB. ZL extracted the data and carried out the data analyses, with assistance provided by FY, SY, ZG, and WC. ZL drafted this manuscript, revised by X-LW, JH, and WG. All authors proofread and approved this manuscript.

Funding

This research was funded by the Special Fund for the Construction of Innovative Provinces in Hunan (Grant number 2021NK1009), and Hunan Provincial Natural Science Foundations of China (Grant numbers 2022JJ30286, 2021JJ40254).

Conflict of interest

CS, RT, and SB are employees of Neogen GeneSeek, an agri-genomics service provider. X-LW is an employee of the Council of Dairy Cattle Breeding (CDCB), a non-profit organization responsible for calculating and distributing genetic evaluations and genomic predictions and for managing genomic data storage.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1080279/full#supplementary-material

References

Akanno, E. C., Chen, L., Abo-Ismail, M. K., Crowley, J. J., Wang, Z., Li, C., et al. (2017). Genomic prediction of breed composition and heterosis effects in Angus, Charolais, and Hereford crosses using 50K genotypes. Can. J. Anim. Sci. 97 (3), 431–438. doi:10.1139/cjas-2016-0124

CrossRef Full Text | Google Scholar

Alexander, D. H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19 (9), 1655–1664. doi:10.1101/gr.094052.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., et al. (2000). Gene ontology: Tool for the unification of biology. The gene ontology consortium. Nat. Genet. 25, 25–29. doi:10.1038/75556

PubMed Abstract | CrossRef Full Text | Google Scholar

Bansal, V., and Libiger, O. (2015). Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations. BMC Bioinf 16 (1), 4. doi:10.1186/s12859-014-0418-7

CrossRef Full Text | Google Scholar

Bastin, B. C., Houser, A., Bagley, C. P., Ely, K. M., Payton, R. R., Saxton, A. M., et al. (2014). A polymorphism in XKR4 is significantly associated with serum prolactin concentrations in beef cows grazing tall fescueficantly associated with serum prolactin concentrations in beef cows grazing tall fescue. Anim. Genet. 45, 439–441. doi:10.1111/age.12134

PubMed Abstract | CrossRef Full Text | Google Scholar

Berry, D. P. (2019). Genomic information in livestock has multiple uses in precision breeding and management. Livestock 24 (1), 30–33. doi:10.12968/live.2019.24.1.30

CrossRef Full Text | Google Scholar

Briggs, H. M., and Briggs, D. M. (1980). Modern breeds of livestock. 4th Ed. New York: Macmillan Publishing Co.

Google Scholar

Browning, S. R. (2008). Estimation of pairwise identity by descent from dense genetic marker data in a population sample of haplotypes. Genetics 178 (4), 2123–2132. doi:10.1534/genetics.107.084624

PubMed Abstract | CrossRef Full Text | Google Scholar

Calus, M. P. L., Henshall, J. M., Hawken, R., and Vandenplas, J. (2022). Estimation of dam line composition of 3-way crossbred animals using genomic information. Genet. Sel. Evol. 54, 44. doi:10.1186/s12711-022-00728-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Cinar, M. U., Kayan, A., Uddin, M. J., Jonas, E., Tesfaye, D., Phatsara, C., et al. (2012). Association and expression quantitative trait loci (eQTL) analysis of porcine AMBP, GC and PPP1R3B genes with meat quality traits. Mol. Biol. Rep. 39, 4809–4821. doi:10.1007/s11033-011-1274-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Edea, Z., Jung, K. S., Shin, S. S., Yoo, S. W., Choi, J. W., and Kim, K. S. (2020). Signatures of positive selection underlying beef production traits in Korean cattle breeds. J. Anim. Sci. Technol. 62, 293–305. doi:10.5187/jast.2020.62.3.293

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, D. B., Bates, R. O., and Osburn, W. N. (2003). Evaluation of Duroc-vs. Pietrain-sired pigs for carcass and meat quality measures. J. Anim. Sci. 81, 1895–1899. doi:10.2527/2003.8181895x

PubMed Abstract | CrossRef Full Text | Google Scholar

Falconer, D. S., and Mackay, T. F. C. (1996). Introduction to quantitative genetics. 4th ed. Essex, England: Longman Group Limited.

Google Scholar

Fortes, M. R. S., Reverter, A., Hawken, R. J., Bolormaa, S., and Lehnert, S. A. (2012). Candidate genes associated with testicular development, sperm quality, and hormone levels of inhibin, luteinizing hormone, and insulin-like growth factor 1 in Brahman bulls. Biol. Reprod. 87 (3), 58–8. doi:10.1095/biolreprod.112.101089

PubMed Abstract | CrossRef Full Text | Google Scholar

Fulda, S., Gorman, A. M., Hori, O., and Samali, A. (2010). Cellular stress responses: Cell survival and cell death. Int. J. Mol. Biol. 2010, 214074. Article ID 214074. doi:10.1155/2010/214074

CrossRef Full Text | Google Scholar

Funkhouser, S. A., Bates, R. O., Ernst, C. W., Newcomb, D., and Steibel, J. P. (2017). Estimation of genome-wide and locus-specific breed composition in pigsfic breed composition in pigs. Transl. Anim. Sci. 1, 36–44. doi:10.2527/tas2016.0003

PubMed Abstract | CrossRef Full Text | Google Scholar

Gobena, M., Elzo, M. A., and Mateescu, R. G. (2018). Population structure and genomic breed composition in an angus–brahman crossbred cattle population. Front. Genet. 9, 90. doi:10.3389/fgene.2018.00090

PubMed Abstract | CrossRef Full Text | Google Scholar

Goszczynski, D. E., Corbi-Botto, C. M., Durand, H. M., Rogberg-Mu∼noz, A., Munilla, S., Peral-Garcia, P., et al. (2017). Evidence of positive selection towards Zebuine haplotypes in the BoLA region of Brangus cattle. Animal 12, 215–223. doi:10.1017/S1751731117001380

PubMed Abstract | CrossRef Full Text | Google Scholar

Grigoletto, L., Ferraz, J. B. S., Oliveira, H. R., Eler, J. P., Bussiman, F, O., Abreu Silva, B. C., et al. (2020). Genetic architecture of carcass and meat quality traits in Montana Tropical® composite beef cattle. Front. Genet. 11, 123. doi:10.3389/fgene.2020.00123

PubMed Abstract | CrossRef Full Text | Google Scholar

Hartigan, J. A., and Wong, M. A. (1979). Algorithm as 136: A k-means clustering algorithm. J. R. Stat. Soc. C 28 (1), 100–108. doi:10.2307/2346830

CrossRef Full Text | Google Scholar

He, J., Guo, Y., Xu, J., Li, H., Fuller, A., Tait, R. G., et al. (2018). Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds. BMC Genet. 19 (1), 56. doi:10.1186/s12863-018-0654-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Howard, J. T., Kachman, S. D., Snelling, W. M., Pollak, E. J., Ciobanu, D. C., Kuehn, L. A., et al. (2014). Beef cattle body temperature during climatic stress: A genome-wide association study. Int. J. Biometeorol. 58 (7), 1665–1672. doi:10.1007/s00484-013-0773-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, Z-L., Park, C. A., and Reecy, J. M. (2022). Bringing the animal QTLdb and CorrDB into the future: Meeting new challenges and providing updated services. Nucleic Acids Res. 50 (D1), D956–D961. doi:10.1093/nar/gkab1116

PubMed Abstract | CrossRef Full Text | Google Scholar

Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31 (8), 651–666. doi:10.1016/j.patrec.2009.09.011

CrossRef Full Text | Google Scholar

Jannink, J.-L., and Wu, X.-L. (2003). Estimating allelic number and identity in state of QTLs in interconnected families. Genet. Res. 81 (2), 133–144. doi:10.1017/s0016672303006153

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M., and Tanabe, M. (2016). KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44 (D1), D457–D462. doi:10.1093/nar/gkv1070

PubMed Abstract | CrossRef Full Text | Google Scholar

Kizilkaya, K., Tait, R. G., Garrick, D. J., Fernando, R. L., and Reecy, J. M. (2013). Genome-wide association study of infectious bovine keratoconjunctivitis in Angus cattle. BMC Genet. 14 (1), 23. doi:10.1186/1471-2156-14-23

PubMed Abstract | CrossRef Full Text | Google Scholar

Kononoff, P. J., Defoor, P. J., Engler, M. J., Swingle, R. S., Gleghorn, J. F., James, S. T., et al. (2017). Impacts of a leptin SNP on growth performance and carcass characters in finishing steers studied over timefinishing steers studied over time. J. Anim. Sci. 95, 194–200. doi:10.2527/jas2016.0926

PubMed Abstract | CrossRef Full Text | Google Scholar

Lawson, D. J., Hellenthal, G., Myers, S., Falush, D., and Zhang, F. (2012). Inference of population structure using dense haplotype data. PLoS Genet. 8, e1002453. doi:10.1371/journal.pgen.1002453

PubMed Abstract | CrossRef Full Text | Google Scholar

Leach, R. J., Craigmile, S. C., Knott, S. A., Williams, J. L., and Glass, E. J. (2010). Quantitative trait loci for variation in immune response to a Foot-and-Mouth Disease virus peptide. BMC Genet. 11 (1), 107. doi:10.1186/1471-2156-11-107

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., Wu, X.-L., Guo, W., He, J., Li, H., Rosa, G. J. M., et al. (2020). Estimation of genomic breed composition of individual animals in composite beef cattle. Anim. Genet. 51 (3), 457–460. doi:10.1111/age.12928

PubMed Abstract | CrossRef Full Text | Google Scholar

Lindholm-Perry, A. K., Kuehn, L. A., Smith, T. P. L., Ferrell, C. L., Jenkins, T. G., Freetly, H. C., et al. (2012). A region on BTA14 that includes the positional candidate genes LYPLA1, XKR4 and TMEM68 is associated with feed intake and growth phenotypes in cattle. Anim. Genet. 43, 216–219. doi:10.1111/j.1365-2052.2011.02232.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Littlejohn, M., Grala, T., Sanders, K., Walker, C., Waghorn, G., MacDonald, K., et al. (2012). Genetic variation in PLAG1 associates with early life body weight and peripubertal weight and growth in Bos taurus. Anim. Genet. 43, 591–594. doi:10.1111/j.1365-2052.2011.02293.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Guo, X. Y., Xu, X. Z., Wu, M., Zhang, X., Li, Q., et al. (2012). Novel single nucleotide polymorphisms of the bovine methyltransferase 3b gene and their association with meat quality traits in beef cattle. Genet. Mol. Res. 11, 2569–2577. doi:10.4238/2012.June.29.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, L., Sonstegard, T. S., Cole, J. B., VanTassell, C. P., Wiggans, G. R., Crooker, B. A., et al. (2019). Genome changes due to artificial selection in U.S. Holstein cattle. BMC Genomics 20 (1), 128. doi:10.1186/s12864-019-5459-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Machado, M. A., S Azevedo, A. L., Teodoro, R. L., Pires, M. A., Cd Peixoto, M. G., de Freitas, C., et al. (2010). Genome wide scan for quantitative trait loci affecting tick resistance in cattle (Bos taurus x Bos indicus). BMC Genomics 11 (1), 280. doi:10.1186/1471-2164-11-280

PubMed Abstract | CrossRef Full Text | Google Scholar

Merchant, S., Wood, D. E., and Salzberg, S. L. (2014). Unexpected cross-species contamination in genome sequencing projects. PeerJ 2, e675. doi:10.7717/peerj.675

PubMed Abstract | CrossRef Full Text | Google Scholar

Nelson, D. C., Schmelcher, M., Rodriguez-Rubio, L., Klumpp, J., Pritchard, D. G., Dong, S., et al. (2012). Endolysins as antimicrobials. Adv. Virus Res. 83, 299–365. doi:10.1016/B978-0-12-394438-2.00007-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Nocedal, J., and Wright, S. J. (2006). Numerical optimization. New York, NY: Spinger.

Google Scholar

Paim, T. D. P., Hay, E. H. A., Wilson, C., Thomas, M. G., Kuehn, L. A., Paiva, S. R., et al. (2020a). Dynamics of genomic architecture during composite breed development in cattle. Anim. Genet. 51, 224–234. doi:10.1111/age.12907

PubMed Abstract | CrossRef Full Text | Google Scholar

Paim, T. D. P., Hay, E. H. A., Wilson, C., Thomas, M. G., Kuehn, L. A., Paiva, S. R., et al. (2020b). Genomic breed composition of selection signatures in Brangus beef cattle. Front. Genet. 11 (710), 710. doi:10.3389/fgene.2020.00710

PubMed Abstract | CrossRef Full Text | Google Scholar

Porto Neto, L. R., Bunch, R. J., Harrison, B. E., and Barendse, W. (2012). Variation in the XKR4 gene was significantly associated with subcutaneous rump fat thickness in indicine and composite cattleficantly associated with subcutaneous rump fat thickness in indicine and composite cattle. Anim. Genet. 43, 785–789. doi:10.1111/j.1365-2052.2012.02330.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Porto-Neto, L. R., Reverter, A., Prayaga, K. C., Chan, E. K. F., Johnston, D. J., Hawken, R. J., et al. (2014). The genetic architecture of climatic adaptation of tropical cattle. PLoS One 9, e0113284. doi:10.1371/journal.pone.0113284

CrossRef Full Text | Google Scholar

Pritchard, J. K., Stephens, M., and Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics 155 (2), 945–959. doi:10.1093/genetics/155.2.945

PubMed Abstract | CrossRef Full Text | Google Scholar

Pryce, J. E., Arias, J., Bowman, P. J., Davis, S. R., Macdonald, K. A., Waghorn, G. C., et al. (2012). Accuracy of genomic predictions of residual feed intake and 250-day body weight in growing heifers using 625,000 single nucleotide polymorphism markers. J. Dairy Sci. 95, 2108–2119. doi:10.3168/jds.2011-4628

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodgers, J. R., and Cook, R. G. (2005). MHC class Ib molecules bridge innate and acquired immunity. Nat. Rev. Immunol. 5 (6), 459–471. doi:10.1038/nri1635

PubMed Abstract | CrossRef Full Text | Google Scholar

Rolf, M. M., Taylor, J. F., Schnabel, R. D., McKay, S. D., McClure, M. C., Northcutt, S. L., et al. (2012). Genome-wide association analysis for feed efficiency in Angus cattle. Anim. Genet. 43 (4), 367–374. doi:10.1111/j.1365-2052.2011.02273.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Sabeti, P. C., Varilly, P., Varilly, P., Fry, B., Lohmueller, J., Hostetter, E., et al. (2007). Genome-wide detection and characterization of positive selection in human populations. Nature 449 (7164), 913–918. doi:10.1038/nature06250

PubMed Abstract | CrossRef Full Text | Google Scholar

Sartori, R., Bastos, M., Baruselli, P., Gimenes, L., Ereno, R., and Barros, C. (2010). Physiological differences and implications to reproductive management of Bos taurus and Bos indicus cattle in a tropical environment. Soc. Reprod. Fertil. Suppl. 67, 357–375. doi:10.7313/upo9781907284991.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, A., Mehrotra, A., Gondro, C., Romero, A. R., Pandey, A. K., Karthikeyan, A., et al. (2020). Signatures of selection in composite vrindavani cattle of India. Front. Genet. 11, 589496. doi:10.3389/fgene.2020.589496

PubMed Abstract | CrossRef Full Text | Google Scholar

Smedley, D., Haider, S., Durinck, S., Pandini, L., Provero, P., Allen, J., et al. (2015). The BioMart community portal: An innovative alternative to large, centralized data repositories. Nucleic Acids Res. 43 (W1), W589–W598. doi:10.1093/nar/gkv350

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, J. M., and Haigh, J. (1974). The hitch-hiking effect of a favourable gene. Genet. Res. 23 (1), 391–403. doi:10.1017/S0016672308009579

CrossRef Full Text | Google Scholar

Snelling, W. M., Allan, M. F., Keele, J. W., Kuehn, L. A., McDaneld, T., Smith, T. P. L., et al. (2010). Genome-wide association study of growth in crossbred beef cattle. J. Anim. Sci. 88 (3), 837–848. doi:10.2527/jas.2009-2257

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, Y., Xu, L., Chen, Y., Zhang, L., Gao, H., Zhu, B., et al. (2016). Genome-wide association study reveals the PLAG1 gene for knuckle, biceps and shank weight in Simmental beef cattle. PLoS One 11, e0168316. doi:10.1371/journal.pone.0168316

PubMed Abstract | CrossRef Full Text | Google Scholar

Souza, F. R. P. P., Mercadante, M. E. Z. Z., Fonseca, L. F. S. S., Ferreira, L. M. S. S., Regatieri, I. C., Ayres, D. R., et al. (2010). Assessment of DGAT1 and LEP gene polymorphisms in three Nelore (Bos indicus) lines selected for growth and their relationship with growth and carcass traits. J. Anim. Sci. 88, 435–441. doi:10.2527/jas.2009-2174

PubMed Abstract | CrossRef Full Text | Google Scholar

Takada, L., Barbero, M. M. D., Oliveira, H. N., de Camargo, G. M. F., Fernandes Júnior, G. A., Aspilcueta-Borquis, R. R., et al. (2018). Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction. PLoS One 13, e0190197. doi:10.1371/journal.pone.0190197

PubMed Abstract | CrossRef Full Text | Google Scholar

Terakado, P. N., Costa, R. B., de Camargo, G. M. F., Irano, N., Bresolin, T., Takada, L., et al. (2018). Genome-wide association study for growth traits in Nelore cattle. Animal 12, 1358–1362. doi:10.1017/S1751731117003068

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Wu, X.-L., Li, Z., Bao, Z., Tait, R. G., Bauck, S., et al. (2020). Estimation of genomic breed composition for purebred and crossbred animals using sparsely regularized admixture models. Front. Genet. 11, 576. doi:10.3389/fgene.2020.00576

PubMed Abstract | CrossRef Full Text | Google Scholar

Weng, Z., Su, H., Saatchi, M., Lee, J., Thomas, M. G., Dunkelberger, J. R., et al. (2016). Genome-wide association study of growth and body composition traits in Brangus beef cattle. Livest. Sci. 183, 4–11. doi:10.1016/j.livsci.2015.11.011

CrossRef Full Text | Google Scholar

Williams, G. L., Amstalden, M., Garcia, M. R., Stanko, R. L., Nizielski, S. E., Morrison, C. D., et al. (2002). Leptin and its role in the central regulation of reproduction in cattle. Domest. Anim. Endocrinol. 23, 339–349. doi:10.1016/S0739-7240(02)00169-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Woronuk, G. N., Marquess, F. L., James, S. T., Palmer, J., Berryere, T., Deobald, H., et al. (2012). Association of leptin genotypes with beef cattle characteristics. Anim. Genet. 43, 608–610. doi:10.1111/j.1365-2052.2012.02320.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, X.-L., Li, Z., Wang, Y., He, J., Rosa, G. J. M., Ferretti, R., et al. (2020). A causality perspective of genomic breed composition for composite animals. Front. Genet. 11 (1369), 546052. doi:10.3389/fgene.2020.546052

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L.-G., Han, Y., and He, Q.-Y. (2012). clusterProfiler: an R Package for comparing biological themes among gene clusters. OMICS 16 (5), 284–287. doi:10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: admixture model, ancestry, composite animals, genomic hitchhiking, linkage disequiblibrium, SNP

Citation: Li Z, He J, Yang F, Yin S, Gao Z, Chen W, Sun C, Tait RG, Bauck S, Guo W and Wu X-L (2023) A look under the hood of genomic-estimated breed compositions for brangus cattle: What have we learned?. Front. Genet. 14:1080279. doi: 10.3389/fgene.2023.1080279

Received: 26 October 2022; Accepted: 14 March 2023;
Published: 28 March 2023.

Edited by:

Kefei Chen, Curtin University, Australia

Reviewed by:

Peipei Ma, Shanghai Jiao Tong University, China
Lingyang Xu, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences (CAAS), China

Copyright © 2023 Li, He, Yang, Yin, Gao, Chen, Sun, Tait, Bauck, Guo and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jun He, hejun@hunau.edu.cn; Xiao-Lin Wu, xuw8@wisc.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.