Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 17 April 2024
Sec. Livestock Genomics

Detection and characterization of copy number variation in three differentially-selected Nellore cattle populations

Lorena F. Benfica,
Lorena F. Benfica1,2*Luiz F. BritoLuiz F. Brito1Ricardo D. do BemRicardo D. do Bem2Leticia F. de OliveiraLeticia F. de Oliveira1Henrique A. MulimHenrique A. Mulim1Larissa G. Braga,Larissa G. Braga2,3Joslaine N. S. G. CyrilloJoslaine N. S. G. Cyrillo4Sarah F. M. BonilhaSarah F. M. Bonilha4Maria Eugenia Z. Mercadante,Maria Eugenia Z. Mercadante2,4
  • 1Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
  • 2Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University, Jaboticabal, São Paulo, Brazil
  • 3Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
  • 4Beef Cattle Research Center, Institute of Animal Science, Sertaozinho, São Paulo, Brazil

Introduction: Nellore cattle (Bos taurus indicus) is the main beef cattle breed raised in Brazil. This breed is well adapted to tropical conditions and, more recently, has experienced intensive genetic selection for multiple performance traits. Over the past 43 years, an experimental breeding program has been developed in the Institute of Animal Science (IZ, Sertaozinho, SP, Brazil), which resulted in three differentially-selected lines known as Nellore Control (NeC), Nellore Selection (NeS), and Nellore Traditional (NeT). The primary goal of this selection experiment was to determine the response to selection for yearling weight (YW) and residual feed intake (RFI) on Nellore cattle. The main objectives of this study were to: 1) identify copy number variation (CNVs) in Nellore cattle from three selection lines; 2) identify and characterize CNV regions (CNVR) on these three lines; and 3) perform functional enrichment analyses of the CNVR identified.

Results: A total of 14,914 unique CNVs and 1,884 CNVRs were identified when considering all lines as a single population. The CNVRs were non-uniformly distributed across the chromosomes of the three selection lines included in the study. The NeT line had the highest number of CNVRs (n = 1,493), followed by the NeS (n = 823) and NeC (n = 482) lines. The CNVRs covered 23,449,890 bp (0.94%), 40,175,556 bp (1.61%), and 63,212,273 bp (2.54%) of the genome of the NeC, NeS, and NeT lines, respectively. Two CNVRs were commonly identified between the three lines, and six, two, and four exclusive regions were identified for NeC, NeS, and NeT, respectively. All the exclusive regions overlap with important genes, such as SMARCD3, SLC15A1, and MAPK1. Key biological processes associated with the candidate genes were identified, including pathways related to growth and metabolism.

Conclusion: This study revealed large variability in CNVs and CNVRs across three Nellore lines differentially selected for YW and RFI. Gene annotation and gene ontology analyses of the exclusive CNVRs to each line revealed specific genes and biological processes involved in the expression of growth and feed efficiency traits. These findings contribute to the understanding of the genetic mechanisms underlying the phenotypic differences among the three Nellore selection lines.

1 Introduction

Nellore cattle (Bos taurus indicus) is the main beef cattle breed raised in Brazil, i.e., one of the largest beef producers and exporters in the world (United States Department of Agriculture, 2023). Nellore animals are well adapted to harsh climatic conditions and Brazilian herds have experienced major genetic progress for performance traits over the past decades (Fernandes Junior. et al., 2022). In addition to the national Nellore breeding programs, an experimental breeding program was initiated in 1980 in the Institute of Animal Science (IZ; Sertãozinho, SP, Brazil), with the establishment of three selection lines. At the beginning of the breeding program, the primary goal of the experiment was to assess the response to selection for heavier weights in a tropical beef cattle population (Mercadante et al., 2003). Briefly, the three selection lines were established by randomly dividing the founder animals into three groups: Nellore Control (NeC), Nellore Selection (NeS), and Nellore Traditional (NeT). NeC was maintained under stabilizing selection, in which animals with a yearling weight (YW) close to the average of the contemporary group were selected for breeding each year. NeS and NeT were selected for higher selection differentials for YW, and in 2008, residual feed intake (RFI) was also introduced as a selection criterion in the NeT line (Mercadante et al., 2003; Cardoso et al., 2018; Benfica et al., 2020).

After more than 40 years of selection, there are clear phenotypic and genetic differences among the lines subjected to stabilizing and directional selection. Cardoso et al. (2018) reported average yearling weight (YW) for males of 275 kg for the NeC line, 350 kg for the NeS line, and 360 kg for the NeT line, and Benfica et al. (2020) also reported average EBV for YW of 14.5 kg for NeC, 69.3 kg for NeS, and 72.2 kg for NeT, highlighting substantial phenotypic differences for YW between the three selection lines. Besides YW, substantial differences have been observed in other traits such as average body weight at different ages, body measurements, RFI, scrotal circumference, and carcass quality (Mercadante et al., 2003; Monteiro et al., 2013; Ceacero et al., 2016). Therefore, these three lines are a valuable resource for identifying genomic regions related to selection signatures, offering insights into the genes governing the phenotypic expression of these traits. Several studies have delved into the genetic mechanisms underlying phenotypic variations among these Nellore lines. For instance, genome-wide association studies (GWAS) have pinpointed key genes associated with growth and feed efficiency traits, while population genetic stratification has highlighted autosomal genomic regions exhibiting selection footprints (Ayres et al., 2010; Souza et al., 2011; Cardoso et al., 2018). Additionally, a new approach that could be further explored is the copy number variation (CNV), since artificial selection for desired traits has also been reported to impact the number of CNVs in animal genomes (Seol et al., 2019; Shi et al., 2023). For instance, a previous study has reported 3,161 CNVs and 561 CNV regions (CNVRs) in Nellore cattle, in which various CNVRs were significantly associated with dry matter intake and frequency of visits to the feed bunk (Benfica et al., 2024).

Copy Number Variations are structural variations within an individual’s genome, involving the loss or gain of DNA fragments, which can range from 1 kilobase pairs (kb) to several megabases (Mb) in size when compared to the reference genome of the species (Henrichsen et al., 2009). CNVs span extensive chromosomal regions and can change gene structure, regulatory modifications, gene dosage, and exposure of recessive alleles, leading to significant impact on gene expression (Zhang et al., 2009; Stafuzza et al., 2019) and phenotypic variability in complex traits (Zhang et al., 2009). The study of CNVs serves as a valuable source of information to elucidate some of the biological mechanisms contributing to the differences among the three experimental selection lines and in the phenotypic variations observed in economically important traits. Genetic selection for specific traits can lead to differential changes in allele frequencies across populations, and consequently, alterations in the genome of the animals (Bickhart et al., 2016; Buffalo and Coop, 2020; Das et al., 2021). CNV is a type of genome structural change that could drive phenotypic variation, evolution, and adaptation in populations under selection (Redon et al., 2006; Zhang et al., 2009; Lemos et al., 2018). Therefore, direct selection for weight gain may have shaped the landscape of CNVs in the genome of the cattle cattle lines with directional selection. Hence, the primary objectives of this study were to: 1) identify and characterize CNVs and CNVRs in Nellore cattle from three differentially-selected lines; and, 2) perform functional enrichment analyses of the identified CNVRs.

2 Materials and methods

2.1 Animals and experimental breeding program design

Data were collected from 928 animals, including 114 from the NeC line, 245 from the NeS line, and 569 from the NeT line. These animals were born between 2004 and 2019 and are part of the Nellore cattle herd from the Institute of Animal Science (IZ) in Sertãozinho, SP, Brazil. The animals are part of an experimental breeding program initiated in 1980 and separated into three selection lines: NeC, NeS, and NeT. These three lines are considered closed lines (Mercadante et al., 2003). Bulls were chosen from contemporary groups (defined by line and year) based on their YW adjusted to 378 days (W378) after a 168-day feedlot performance test. Replacement females, on the other hand, were selected based on their YW adjusted to 550 days (W550) while kept on pasture.

In the NeC line, males and females with a selection differential close to zero for YW were retained for breeding. Animals from the NeC line have maintained YW values that are close to the average observed at the outset of the breeding program in 1980. In contrast, for the selected NeS and NeT lines, both males and females with higher adjusted weights were selected over time. Starting in 2008, the bulls from the NeT line have been selected based on higher genomic estimated breeding values (GEBV) for YW and lower GEBV for RFI (more feed efficient animals) (Mercadante et al., 2003; Cardoso et al., 2018; Benfica et al., 2020). RFI was estimated as the residual of the linear regression equation of dry matter intake (DMI) on average daily gain (ADG) and mid-test metabolic weight (BW0.75) (Koch et al., 1963) in each test group.

The sire selection strategy has been consistently applied to this day, involving the annual replacement of 50% of the three-year-old sires within each line. Furthermore, the annual culling rate for cows is approximately 20%. Figure 1 illustrates the differentiation in the phenotypic performance of the lines achieved through selection.

Figure 1
www.frontiersin.org

Figure 1. (A,B) Four-year-old sires from two differentially selected Nellore lines. NeS (right) and NeC (left) (Institute of Animal Science, 2020).

2.2 Genomic datasets

A total of 928 Nellore animals, including 625 males and 303 females, were genotyped with the Illumina BovineHD BeadChip (HD, Illumina Inc., San Diego, CA, United States; n = 770) or GeneSeek Genomic Profiler 50K (50K, GeneSeek Inc., Lincoln, NE, United States; n = 158) SNP panels. Approximately 75% of animals from the NeC line, 79% from the NeS line, and 86% from the NeT line were genotyped using the HD SNP panel (Supplementary Material S1). The HD and 50K SNP panels contained 777,962 and 54,791 SNPs, respectively, distributed throughout the genome. The mean distance between markers in the HD SNP panel was approximately equal to 3.43 ± 4.4 kilobases (Kb), while in the 50K panel, it was 49.2 ± 99.1 Kb. To ensure genomic data quality, non-autosomal SNPs, SNPs with an unknown genomic position, and SNPs with a GenCall score below 0.15 were removed during the quality control step. After the quality control process, 734,593 and 51,613 SNPs remained for subsequent analyses in the HD and 50K SNP panels, respectively.

2.3 Identification of copy number variation

The CNV identification was carried out separately for each SNP panel dataset using the PennCNV.1.0.5 software (Wang et al., 2007). This software integrates Log R Ratio (LRR) and B Allele Frequency (BAF) data on a per-sample basis into a hidden Markov model to determine the number of copies and genotypes of each CNV. LRR measures the total signal intensity, while BAF measures the proportion of the B allele in each sample. The population frequency of the B allele was calculated using the BAF value of each SNP in all samples. Furthermore, the LRR values were adjusted for the guanine-cytosine content at 500 kb upstream and downstream of each SNP based on a regression model (Diskin et al., 2008). This correction aims to reduce waviness that may result from the correlation between LRR and guanine-cytosine content in genomic regions, which could interfere with CNV detection.

Following CNV calling, a sample-based quality control process was implemented. This quality control step entailed the removal of CNVs with a BAF drift of less than 0.01, a standard deviation of LRR exceeding 0.30, a minimum length of 1,000 bp, a maximum length of 5,000,000 bp, and GC wave factor less than 0.05 (after genomic wave correction based on guanine-cytosine content). CNVs with less than three consecutive SNPs were also discarded. After this quality control, 883 animals and 14,914 CNVs (14,391 from the HD panel and 523 from the 50K panel) remained for further analyses. The CNVs identified were categorized and separated into the three distinct selection lines. This segregation led to the creation of distinct CNV datasets for each line, which were then utilized for conducting line-specific analyses. This approach enabled a thorough evaluation of CNVs within each selection line, providing valuable insights into the genetic diversity and potential functional significance of CNVs in these Nellore lines.

2.4 Identification of copy number variation regions

The CNVR were defined by grouping CNVs that had at least 1 bp overlap (Yan et al., 2015; Ma et al., 2017; Yang et al., 2017; Zhou et al., 2020; Zhou et al., 2022) using the mergeBed option of the BEDtools suite tool (Quinlan and Hall, 2010). This approach was applied in two contexts: across the entire population and within the specific selection lines being studied. CNVRs were classified as “loss” when an animal displayed a region with a loss of a chromosomal segment in comparison to the reference genome (deletions), “gain” for repeated chromosomal regions (duplications), and “mixed” when both loss and gain were identified within the same genomic region. Furthermore, CNVRs that were present in at least 10% of each line were identified. The CNVs and CNVRs were also identified separately for each selection line and compared across lines. An analysis of the overlapping CNVRs from each line was performed, and common and exclusive regions were identified.

2.5 Gene annotation and functional analyses

The CNVRs exclusive to each line were used for annotation purposes. The gene and QTL annotation in these regions were performed using the GALLO package (Fonseca et al., 2020), utilizing annotated data for Bos taurus retrieved from the Ensembl database (www.ensembl.org/Bos_taurus/Info/Index) and reference genome ARS-UCD1.2 (Rosen et al., 2020). Additionally, the Cattle QTL database (www.animalgenome.org/cgi-bin/QTLdb/BT/index) was used as a resource for obtaining previously-reported QTL information. The gprofiler2 package (Kolberg et al., 2020) was used for conducting Gene Ontology (GO) and KEGG pathway enrichment (p < 0.05) analyses to identify biological processes, molecular functions, cellular components, and biological pathways associated with the positional candidate genes identified.

3 Results

Table 1 presents descriptive statistics of all the animals from the three selection lines included in this study. The NeT line comprises the largest number of animals, followed by NeS and NeC. W378 ranged from 298 kg for NeC to 382 kg for NeS. In the case of W550, NeT had the highest average weight (363 ± 28 kg). Furthermore, the NeC line had the lowest average RFI (−0.112 ± 0.53 kg/day), followed by NeS (−0.032 ± 0.61 kg/day) and NeT (0.032 ± 0.60 kg/day).

Table 1
www.frontiersin.org

Table 1. Descriptive statistics for Nellore Control (NeC), Nellore Selection (NeS), and Nellore Traditional (NeT).

3.1 Copy number variation and CNVR detection for the Nellore population

Initially, 20,259 CNVs were identified in 922 animals. After quality control, 14,914 CNVs located on autosomal chromosomes of 883 animals remained for further analyses, with an average of 16 CNVs per animal (range: 1–45). Among these identified CNVs, 3,680 were categorized as losses and 11,234 as gains. The length of the CNVs varied from 1,216 bp to 1,119,208 bp, with an average length of 75,632 ± 100,827 bp. Notably, CNVs were detected on all autosomal chromosomes and were non-uniformly distributed across the genome.

The 14,914 CNVs that remained after quality control were used to infer CNVRs by merging CNVs with at least a 1 bp overlap. This resulted in the identification of 1,884 CNVRs, with an average CNVR length of 40,887 ± 104,812 bp (range: 1,215 to 1,807,286 bp). Among these CNVRs, 400 of them were associated with genome losses, 1,412 with gains, and 72 with a mixed pattern, where the same chromosomal segment exhibited both deletion and duplication in the population. The number and proportion of chromosomes covered by CNVRs varied considerably (Table 2). BTA1 had the highest number of CNVRs (n = 181), covering 4.03% of the chromosome, while BTA12 had the highest coverage of a chromosome sequence (7.94%) with 107 CNVRs. In contrast, BTA25 had the lowest number of CNVR (n = 23) and BTA24 had the lowest coverage of a chromosome sequence at 0.87%. In total, the CNVRs identified in this study covered 77,031,673 bp of the autosomal genome sequence, which corresponds to approximately 3.09% of the cattle genome size.

Table 2
www.frontiersin.org

Table 2. Chromosome distribution of all 1,884 copy number variation regions (CNVRs) detected in the Nellore cattle genome.

A noteworthy CNVR was identified in 847 animals, encompassing approximately 90% of the studied population (928 animals). This particular mixed type CNVR is located on BTA7, spanning a length of 1,133,904 bp. The gene content of this CNVR was thoroughly investigated, revealing an overlap with a total of 62 annotated genes (Supplementary Material S2).

The number and length of CNVs and CNVRs identified per SNP panel (50K and HD) were compared (Supplementary Material S1). The number of CNVs (50K: 523; HD: 14,391) and CNVRs (50K: 115; HD: 1,796) for the 50K SNP panel was higher compared to the HD SNP panel. Conversely, the average length of CNVs (50K: 114.4 ± 103 kb; HD: 74.2 ± 100 kb) and CNVRs (50K: 121.3 ± 129 kb; HD: 36.2 ± 96 kb) was smaller for the HD panel.

3.2 Copy number variation and CNVR detection by selection line

The 14,914 identified CNVs were categorized based on their respective selection lines, resulting in 1,510 CNVs in NeC animals, 3,899 CNVs in NeS, and 9,448 CNVs in NeT. The average CNV length were similar across the three selection lines, ranging from 71,886 ± 97,489 bp in NeC to 78,724 ± 102,183 bp in NeS. In all three lines, the number of loss type CNVs exceed that of gain CNVs, and the average (SD) number of CNVs per animal were 13.9 ± 7, 16.3 ± 8, and 17.6 ± 7 for NeC, NeS, and NeT, respectively. Detailed information about the CNVs per selection line after the quality control can be found in Table 3.

Table 3
www.frontiersin.org

Table 3. Descriptive statistics of copy number variation (CNV) per Nellore selection line.

The CNVRs were non-uniformly distributed across the chromosomes of the three Nellore lines (Figure 2). NeT had the highest number of CNVRs (n = 1,493), followed by NeS (n = 823) and NeC (n = 482). Among the three lines, BTA1 had the largest number of CNVRs, with 34 CNVRs identified in NeC, 81 in NeS, and 130 in NeT. On the other hand, BTA24 had the lowest CNVR count in both the NeC and NeS lines, with seven CNVRs in each line. NeT’s lowest CNVR count was observed on BTA25, with a total of 18 CNVRs. The CNVR coverage in the genomes of NeC, NeS, and NeT summed up to 23,449,890 bp, 40,175,556 bp, and 63,212,273 bp, respectively. This represents 0.94%, 1.61%, and 2.54% of the bovine autosomal genome for NeC, NeS, and NeT, respectively.

Figure 2
www.frontiersin.org

Figure 2. Distribution of copy number variation regions (deletions or losses, duplications or gains, and mixed type) by chromosome and selection line. (A) Nellore Control (NeC); (B) Nellore Selection (NeS); (C) Nellore Traditional (NeT).

3.3 Common and exclusive CNVRs in the Nellore lines and gene annotation

Twenty-five CNVRs, consisting of 6 losses, 4 gains, and 15 mixed type CNVRs, were identified in at least 10% of the NeC animals. In the NeS line, 32 CNVRs were observed, including 3 losses, 17 gains, and 12 mixed CNVRs. In the NeT line, 33 CNVRs were identified, with 4 losses, 18 gains, and 11 mixed CNVRs. The average length of these CNVRs was 283,307 ± 283,739 bp for NeC, 355,917 ± 290,815 bp for NeS, and 381,594 ± 354,594 bp for NeT. Interestingly, two CNVRs were commonly identified across all three selected lines. Additionally, there were 18 regions shared between NeC and NeS, 18 regions shared between NeC and NeT, and 29 regions shared between NeS and NeT, as illustrated in Figure 3. The two regions that were identified as common to all three lines overlapped with 11 annotated genes, as shown in Table 4.

Figure 3
www.frontiersin.org

Figure 3. Venn Diagram for the copy number variation regions (CNVR) present in at least 10% of animals from Nellore Control (NeC), Nellore Selection (NeS), and Nellore Traditional (NeT) selection lines.

Table 4
www.frontiersin.org

Table 4. Description of the copy number variation regions (CNVR) commonly identified among the three selection lines.

Regarding the exclusive regions, there were 6 CNVR identified for NeC, 2 regions for NeS, and 4 regions for the NeT line. Out of the 6 exclusive CNVRs in the NeC line, there were 3 loss type CNVR and 3 gain type CNVR, distributed across 6 chromosomes, with an average length of 91,745 ± 119,203 bp. Out of the 6 CNVRs, 3 of them overlapped with 16 annotated genes (Table 5).

Table 5
www.frontiersin.org

Table 5. Description of the copy number variation regions (CNVR) identified exclusively in the Nellore Control line.

In the case of the NeS line, there were two exclusive CNVRs, and both of these regions were classified as mixed type, indicating both deletions and duplications. These CNVRs were found on BTA12, with an average length of approximately 812,093 ± 147,962 bp each. Notably, both CNVRs were identified within genomic regions in the reference genome assembly and overlapped with 8 genes, as shown in Table 6.

Table 6
www.frontiersin.org

Table 6. Description of the copy number variation regions identified exclusively in the Nellore Selection line.

In the NeT line, there were two exclusive loss regions and two exclusive gain regions, distributed across four chromosomes (BTA1, BTA6, BTA17, BTA21). The average length of these exclusive CNVRs was approximately 233,107 ± 279,300 bp. Among these regions, three overlapped with 21 genes, as presented in Table 7.

Table 7
www.frontiersin.org

Table 7. Description of the copy number variation regions (CNVR) identified exclusively in the Nellore Traditional line.

3.4 Gene ontology and QTL identification

The genes that overlapped with exclusive CNVRs from each selection line were included in the gene ontology (GO) analyses. While the functional analyses of genes conducted using the gprofiler2 package (Kolberg et al., 2020) did not yield significant results for the NeC and NeT cattle lines, a closer investigation of the functions of biological processes associated with these genes revealed their involvement in specific biological pathways. These genes were involved in pathways such as thermogenesis (NeC), fatty acid metabolism (NeC), and protein digestion and absorption (NeS). For the NeT line, functional enrichment was observed in the cellular component category, specifically for the term GO:0016020—Integral component of membrane. Genes within the exclusive regions of NeT also contribute to various biological processes, including positive regulation of growth (GO:0045927), positive regulation of gene expression (GO:0010628), and insulin-like growth factor receptor signaling pathway (GO:0048009). Furthermore, these genes play important roles in metabolic pathways related to growth hormone synthesis and secretion. Within the exclusive CNVRs of each selection line, the number of previously reported QTL overlapping with the genomic regions identified for NeC, NeS, and NeT were 12, 27, and 146, respectively. Among these, 8 QTL previously associated with production traits (e.g., ADG) overlap with the NeC regions, 2 QTL associated with production (e.g., ADG and maturity rate) for NeS, and 2 QTL associated with production traits (body weight gain and metabolic body weight) in NeT (Supplementary Material S3).

4 Discussion

The Nellore experimental breeding program from IZ has gained national recognition and contributed substantially to the field of beef cattle breeding and genetics. The differential selection among the three selection lines has enabled in-depth studies of weight-related traits and feed efficiency, providing essential insights into the genetic information of livestock (e.g., Ayres et al., 2010; Cardoso et al., 2014; Cardoso et al., 2018).

The NeC line had the lowest average for W378 and W550, which was expected since this line is characterized by stabilizing selection with an average YW close to the weight at the start of the breeding program. The NeS line exhibited the highest mean for W378, which aligns with this line’s selection focus on increased post-weaning weight, highlighting the success of the breeding program in attaining its specific breeding objective. Considering the substantial difference in the average of W378 and W550 between lines, the three lines provide a great opportunity to identify genomic regions altered by selection. The NeC animals can be used as a reference point to compare the lines and understand the genetic progress achieved over time and the mechanisms involved in the phenotypic expression of the selected traits. NeC exhibited the lowest phenotypic average for RFI (more efficient), followed by NeS and NeT. However, it is important to highlight that the standard deviations (SD) were high for these averages, and these values are representative of only a small subset of Nellore animals, thus not accurately reflecting the population mean of each line.

4.1 Copy number variation and CNVR detection in Nellore cattle

Numerous studies have previously investigated the distribution and characterization of CNVs and CNVRs within the cattle genome (e.g., Fadista et al., 2010; Liu et al., 2010; Hou et al., 2012; Peripolli et al., 2023), each yielding diverse findings and insights about the presence and the function of these variants in the cattle genome. For instance, Silva et al. (2016) identified 68,007 CNVs and 7,319 CNVRs in a population of 1,509 Nellore animals. Additionally, Upadhyay et al. (2017) reported 9,944 CNVs and 923 CNVRs in 149 European cattle, while Lemos et al. (2018) identified 195,873 CNVs and 9,805 CNVRs in 3,794 Nellore animals. In a study of Holstein cattle, Butty et al. (2021) found 23,256 CNVs and 1,645 CNVRs. There is a clear notable discrepancy in the number of CNVs and CNVRs between the previously reported study and our current findings. However, each study utilized different SNP panel densities, quality control thresholds, and sample sizes, which may have contributed to these differences (Fadista et al., 2010; Hou et al., 2012). Furthermore, the implementation of quality control measures, accounting for batch effects, addressing population stratification, managing experimental variations, and the robustness of statistical models can all impact the detection and accuracy of CNVs (Dellinger et al., 2010). Therefore, any comparisons between studies should be made cautiously, considering all these factors described above. The proportion of the genome covered by CNVRs (3.09%) falls within the range reported in the literature. Previous studies have reported values ranging from 0.68% to 13.0% in cattle populations (Fadista et al., 2010; Zhou et al., 2016; Lemos et al., 2018).

The distribution of CNVRs across chromosomes did not follow any clear pattern and BTA1 exhibited the highest number of CNVRs (n = 181), a trend also noted by Silva et al. (2016). Although no particular pattern or correlation was observed, this result may be associated with the fact that BTA1 is the largest chromosome in the cattle genome. Another interesting finding in the present study was the identification of a CNVR present in 90% of the individuals included in the study. This observation suggests the existence of a region that has remained conserved within this Nellore population over time, highlighting potential genetic stability or selection pressure within this genomic region. This might also reflect the fact that the reference genome used was based on a taurine (Bos taurus taurus) animal while Nellore is a different subspecies (Bos taurus indicus). This highlights the need to develop cattle pangenomes (e.g., Zhou et al., 2022).

This common region observed in 90% of the studied population is a gene-rich region containing 62 annotated genes. Several genes associated with male and female reproductive traits were identified, including THEG (Nayernia et al., 1999; Mannan et al., 2003), FGF22 (Castilho et al., 2017; 2019), KISS1R (D’Occhio et al., 2020; Singh et al., 2020), and ARID3A (Yang et al., 2018). Furthermore, genes linked to the immune system such as AZU1 (Xu et al., 2018; Verardo et al., 2021) and ELANE (Cassatella et al., 2019; Verardo et al., 2021) were also identified. The CFD gene was also previously associated with fat accumulation (Wang et al., 2023) and overlapped with the region cited above.

It is important to note that the present study utilized two genotyping panels of different densities for the CNV analyses, including one with 777,962 SNPs and one with 54,791 SNPs. Although 83% of the animals used in this study were genotyped with the HD SNP panel, the use of the 50K SNP panel may be considered as a limitation of the study. Genotyping panels with higher density contain a greater number of genomic markers distributed throughout the genome, and generally enable more accurate detection of CNVs with higher genomic location resolution (Wang et al., 2007). This may explain why the number of CNVs and CNVRs found was higher for the HD SNP panel while their length was shorter as compared to the CNVs and CNVRs identified based on the 50K data. The use of a 50K SNP panel may impact CNV detection (e.g., longer CNVs may be incorrectly identified) and limit the ability to identify CNVs in genomic regions containing less SNPs after the quality control. Additionally, the number of animals genotyped with the HD SNP panel in this study is ∼5 times larger than the number of animals genotyped with the 50K SNP panel, which may also have contributed to the higher number of CNVs and CNVRs detected based on the HD SNP panel. In this study, no animals were genotyped with the same SNP panel to enable comparison of the results on an animal basis. Although out of the scope of this current study, future studies using genotyping platforms of different densities as well as molecular approaches for validating the identified CNVs are warranted. This will enable the evaluation of the impact of the SNP density on CNV detection.

4.2 Copy number variation and CNVR detection by line

While previous studies have identified CNVs within and between cattle populations, our study is one of the first endeavors to investigate the population-genetic properties in three closed Nellore lines that were differentially selected for high post-weaning weight and RFI. Substantial differences in CNV counts were identified among the three lines studied. NeS and NeT exhibited a relatively high number of CNVs and CNVs per individual compared to NeC, along with a high chromosome coverage by CNVRs. The results in this study are based on a population of 928 animals with an uneven distribution among the lines. However, for the purpose of comparison and confirmation of the results, CNVs and CNVRs were also identified considering a reduced number of animals with an equal number of samples per line (n = 114). Remarkably, the results remained consistent with the same pattern (results not shown), where animals from the NeS and NeT lines exhibited a higher number of CNVs and CNVRs.

The results obtained align with previous expectations and are supported by the findings from Upadhyay et al. (2017), who reported that the population size, gene flow, and the selection process in a population can contribute to differential CNV abundance among populations. Selection for a specific trait can indeed lead to changes in allele frequencies within the population, resulting in alterations within the cattle genome and giving rise to significant phenotypic and genetic variability (Bickhart et al., 2016). Furthermore, the present findings are consistent with the results of Strillacci et al. (2018), who reported CNVs and CNVRs within the genome of Valdostana Red Pied cattle, an Italian dual-purpose cattle population that did not undergo strong artificial selection for production traits. Following the CNV identification, the authors conducted a comparative analysis of the CNVs detected in their study with those available from published research in the Italian Brown Swiss and Mexican Holstein populations (Strillacci et al., 2018). Their findings revealed the presence of unique and highly differentiated CNVs, leading to the conclusion that directional selection occurring within a population exerts a significant impact on the genome in terms of CNVs.

Despite differences in the numbers of CNVs identified, all three selection lines exhibited a higher frequency of duplications than deletions. This observation aligns with findings from previous studies, such as Laseca et al. (2022) in horses, Ladeira et al. (2022) in sheep, and Liu et al. (2010) in cattle. While there is no clear pattern of duplication and deletion distribution across the genome, duplications are more likely to occur in CNVs with greater lengths (Locke et al., 2006). Furthermore, according to Amos et al. (2003) and Conrad et al. (2006), deletion events may go unnoticed using SNP genotyping methods.

4.3 Gene annotation, gene ontology, and QTL identification

The deletion or duplication of genomic regions can have various consequences. The deletion of a genomic region that contains important genes can lead to the loss of gene function, potentially being associated with diseases, genetic disorders, and reduced fitness (Stenson et al., 2017). Moreover, the duplication of gene-rich regions may also be associated with adaptation (Sharma et al., 2018; Meredith et al., 2024). On the other hand, the duplication of gene-rich regions is typically linked to genetic diversity. Gene duplication is believed to play an important role in evolution and adaptation and may be involved in the development of new gene functions (Zhang, 2003; Magadum et al., 2013; Lallemand et al., 2020). Thus, we identified genes present in exclusive regions for each selection line, which may help elucidate differences between lines and the expression of traits in a selection process.

Gene ontology analysis is also an essential tool for elucidating the functional landscape of genetic elements, as it helps to comprehend and interpret the functions of genes. In the current study, no enrichment of biological processes was observed for the genes identified. This suggests that collectively, they do not participate in any similar biological process, potentially indicating a diverse array of gene functions. However, even though enriched processes were not identified, the genes individually participate in crucial biological processes and pathways. These findings suggest that while there may not be overall enriched processes, the individual genes within these regions may collectively contribute to the regulation of vital biological processes associated with growth and gene expression.

In the NeC line, the CNVR4 is a gain region that harbors 11 genes and 12 QTL. Within this genomic region, the gene SMARCD3 stands out as it overlaps with 8 previously reported QTL that are related to ADG. The SMARCD3 gene plays a crucial role as a subunit of the SWI/SNF family of proteins, which are known for their helicase and ATPase activities and their capacity to modulate the transcription of specific genes by modifying the chromatin structure surrounding those genes. ATPase is an enzyme that catalyzes the hydrolysis of ATP (adenosine triphosphate), releasing energy that is utilized in a variety of cellular processes, including ion transport, macromolecule synthesis, and muscular contraction (Rappas et al., 2004; Hargreaves and Spriet, 2020). Therefore, the activity of ATPase can influence the energy metabolism and, consequently, ADG and body weight gain of animals. The fact that the SMARCD3 overlaps with 8 QTL related to ADG is a significant finding, suggesting a potential functional relationship between this gene and ADG and YW. This indicates that the CNVR4 might be directly involved in the expression of the trait, potentially explaining some of the phenotypic differences observed between the NeC line and the NeS and NeT lines. Additionally, the SMARCD3 gene has been linked to biological processes related to muscle cell differentiation and thermogenesis pathways. Muscle cell differentiation is essential for the development of animal muscle tissue (Purslow, 2022) and the efficiency in the muscle cell differentiation process can affect the rate and magnitude of weight gain. Thermogenesis is also an important process that can impact animal weight as it is essential for maintaining body temperature and basal metabolism (Hhmms-Hagen, 1989; Cannon and Nedergaard, 2011). Considering that thermogenesis is linked to energy expenditure, it is plausible that it may also influence the ADG of animals, and consequently body weight at specific time points (e.g., YW). Another important NeC region is the CNVR5 on BTA8, which contains the UHF2 gene. This gene encodes a nuclear protein involved in cell-cycle regulation (Lu and Hallstrom, 2013). The UHF2 gene has been reported to be involved in the regulation of many biological processes, including metabolic pathways, growth, and reproduction (Magoro et al., 2022).

In the NeS line, the CNVR10 located on BTA12 overlaps with the SLC15A1 gene. This gene encodes an intestinal hydrogen peptide cotransporter and belongs to the solute carrier family 15. SLC15A1 plays a crucial role in the uptake and digestion of dietary proteins (Liang et al., 1995). Additionally, SLC15A1 has been associated with small intestine weight and embryo development in chickens (Zeng et al., 2011; Li et al., 2013) as well as with protein digestion and absorption pathways. Efficient digestion and absorption of proteins are essential to ensure that cattle receive the necessary nutrients and can affect the growth and weight gain of animals (Pierzynowski et al., 2006). Furthermore, a QTL related to ADG also overlapped with CNVR10. This evidence suggest the potential significance in regulating critical processes related to nutrient absorption, intestinal development, and overall growth in cattle. Another noteworthy point is that despite only one QTL related to ADG being identified in the NeS line, a total of 20 QTL related to milk production traits were identified in CNVR9 and CNVR10, and associations between milk production and YW have been previously reported (e.g., Lee and Pollak, 2002; Gershoni et al., 2021).

In the NeT line, several exclusive regions overlapping with important genes were identified. One of these regions, CNVR13, stood out as a gain type CNVR located on BTA17. This region encompasses 10 genes, with particular emphasis on MAPK1. MAPK1 encodes a member of the MAP kinase family. MAP kinases, also known as extracellular signal-regulated kinases, serve as a central hub for integrating multiple biochemical signals and play integral roles in a wide array of cellular processes, including proliferation, differentiation, transcription regulation, and development (Jiang et al., 2011; Liu et al., 2016). Moreover, previous studies have reported that the MAPK1 gene is linked to cell growth in phosphorylation and protein modification process, which are needed for the muscle growth mechanism (Shin et al., 2014). Furthermore, the MAPK1 gene is associated with biological processes related to Insulin-like growth factor receptor signaling pathway and growth hormone synthesis pathways. These processes play a pivotal role in the growth and development of cattle. Growth hormone synthesis and Insulin-like growth factor are crucial for regulating energy metabolism, adipose tissue deposition, and muscle growth, ensuring adequate animal weight gains (Dichtel et al., 2022; Zhang et al., 2022). The MAPK1 gene is also associated with the biological process term GO:0010628, defined as positive regulation of gene expression. Another gene identified in this region is PPM1F. Although no significant results were found in the GO analyses, PPM1F gene is related to biological terms associated with growth factors (GO:0045927, defined as positive regulation of growth).

Another important region identified for the NeT line is the CNVR12, located on BTA6, which overlaps with eight genes and QTL related to body weight gain, metabolic body weight, and carcass weight. The ACOX3 gene within this region has been associated with metabolic pathways related to fatty acid degradation and fatty acid metabolism. Fatty acid metabolism is directly linked to energy regulation, fat storage, and overall lipid metabolism. Efficient fatty acid degradation can contribute to energy release and the maintenance of adequate energy balance (Miyamoto et al., 2016), which is essential for controlling body weight and vital biological functions.

Considering the significant phenotypic differences observed in YW among the three selections lines, it was expected to find differences in the identification of CNVs and CNVRs between the lines. The discovery of unique regions containing distinct genes, biological processes, pathways, and QTL related to the traits is an important finding. This suggests that the presence of these exclusive CNVRs may control the expression of phenotypes related to YW and feed efficiency and contribute to phenotypic response to selection. However, the studied populations were selected for quantitative traits, which are influenced by many genes (and genomic regions). Therefore, there are likely many other genes and genomic structural variations not identified in this study affecting the phenotypic variability on the traits under selection.

5 Conclusion

We described a variability of CNVs and CNVRs within three Nellore lines differentially selected for YW and RFI. Through the gene annotation and gene ontology analyses of the exclusive CNVRs identified in each line, specific genes and biological processes involved in the expression of growth and feed efficiency traits were found. These results not only show the structural differences present in the genomes of animals from the three studied selection lines but also indicate that these variations may account for a portion of the observed differences among them. These findings provide valuable insights for future research and breeding strategies to enhance these important traits in Nellore cattle populations.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: The data supporting this study’s findings belongs to an experimental animal breeding program and can be made available by contacting the corresponding author upon reasonable request and with permission of the breeding program. Requests to access these datasets should be directed to MM, bWV6bWVyY2FkYW50ZUBnbWFpbC5jb20=.

Ethics statement

Ethical approval was not required for the study involving animals in accordance with the local legislation and institutional requirements because analyses were performed on pre-existing datasets.

Author contributions

LoB: Writing–original draft, Writing–review and editing, Conceptualization, Formal Analysis, Investigation, Methodology. LuB: Writing–review and editing, Supervision, Methodology, Conceptualization. RdB: Writing–review and editing. LdO: Writing–review and editing, Formal Analysis. HM: Writing–review and editing. LaB: Writing–review and editing. JC: Writing–review and editing. SB: Writing–review and editing. MM: Writing–review and editing, Supervision, Methodology, Conceptualization.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was financially supported by the São Paulo Research Foundation (FAPESP, 2017/10630-2 and 2017/50339-5) and Coordination for the Improvement of Higher Education Personnel (CAPES, Brasilia, DF, Brazil; Finance Code 001).

Acknowledgments

The authors thank Dr. Gabriel Soares Campos (Purdue University, West Lafayette, IN, United States) for technical assistance and the Institute of Animal Science (Sertaozinho, SP, Brazil) for providing the datasets used for the research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1377130/full#supplementary-material

References

Amos, C. I., Shete, S., Chen, J., and Yu, R. K. (2003). Positional identification of microdeletions with genetic markers. Hum. Hered. 56, 107–118. doi:10.1159/000073738

PubMed Abstract | CrossRef Full Text | Google Scholar

Ayres, D. R., Souza, F. R. P., Mercadante, M. E. Z., Fonseca, L. F. S., Tonhati, H., Cyrillo, J. N. S. G., et al. (2010). Evaluation of tfam and fabp4 gene polymorphisms in three lines of nellore cattle selected for growth. Genet. Mol. Res. 9, 2050–2059. doi:10.4238/vol9-4gmr850

PubMed Abstract | CrossRef Full Text | Google Scholar

Benfica, L. F., Brito, L. F., do Bem, R. D., Mulim, H. A., Glessner, J., Braga, L. G., et al. (2024). Genome-wide association study between copy number variation and feeding behavior, feed efficiency, and growth traits in Nellore cattle. BMC Genomics 25, 54. doi:10.1186/s12864-024-09976-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Benfica, L. F., Sakamoto, L. S., Magalhães, A. F. B., De Oliveira, M. H. V., De Albuquerque, L. G., Cavalheiro, R., et al. (2020). Genetic association among feeding behavior, feed efficiency, and growth traits in growing indicine cattle. J. Anim. Sci. 98, skaa350. doi:10.1093/JAS/SKAA350

PubMed Abstract | CrossRef Full Text | Google Scholar

Bickhart, D. M., Xu, L., Hutchison, J. L., Cole, J. B., Null, D. J., Schroeder, S. G., et al. (2016). Diversity and population-genetic properties of copy number variations and multicopy genes in cattle. DNA Res. 23, 253–262. doi:10.1093/dnares/dsw013

PubMed Abstract | CrossRef Full Text | Google Scholar

Buffalo, V., and Coop, G. (2020). Estimating the genome-wide contribution of selection to temporal allele frequency change. Proc. Natl. Acad. Sci. U. S. A. 117, 20672–20680. doi:10.1073/pnas.1919039117

PubMed Abstract | CrossRef Full Text | Google Scholar

Butty, A. M., Chud, T. C. S., Cardoso, D. F., Lopes, L. S. F., Miglior, F., Schenkel, F. S., et al. (2021). Genome-wide association study between copy number variants and hoof health traits in Holstein dairy cattle. J. Dairy Sci. 104, 8050–8061. doi:10.3168/jds.2020-19879

PubMed Abstract | CrossRef Full Text | Google Scholar

Cannon, B., and Nedergaard, J. (2011). Nonshivering thermogenesis and its adequate measurement in metabolic studies. J. Exp. Biol. 214, 242–253. doi:10.1242/jeb.050989

PubMed Abstract | CrossRef Full Text | Google Scholar

Cardoso, D. F., De Albuquerque, L. G., Reimer, C., Qanbari, S., Erbe, M., Do Nascimento, A. V., et al. (2018). Genome-wide scan reveals population stratification and footprints of recent selection in Nelore cattle. Genet. Sel. Evol. 50, 22. doi:10.1186/s12711-018-0381-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Cardoso, D. F., de Souza, F. R. P., de Camargo, G. M. F., Fonseca, P. D. da S., Fonseca, L. F. S., Braz, C. U., et al. (2014). Polymorphism analysis in genes of the somatotropic axis in Nellore cattle selected for growth. Gene 545, 215–219. doi:10.1016/j.gene.2014.05.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Cassatella, M. A., Östberg, N. K., Tamassia, N., and Soehnlein, O. (2019). Biological roles of neutrophil-derived granule proteins and cytokines. Trends Immunol. 40, 648–664. doi:10.1016/j.it.2019.05.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Castilho, A. C. S., Dalanezi, F. M., Franchi, F. F., Price, C. A., Ferreira, J. C. P., Trevisol, E., et al. (2019). Expression of fibroblast growth factor 22 (FGF22) and its receptor, FGFR1B, during development and regression of bovine corpus luteum. Theriogenology 125, 1–5. doi:10.1016/j.theriogenology.2018.09.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Castilho, A. C. S., Price, C. A., Dalanezi, F., Ereno, R. L., Machado, M. F., Barros, C. M., et al. (2017). Evidence that fibroblast growth factor 10 plays a role in follicle selection in cattle. Reprod. Fertil. Dev. 29, 234–243. doi:10.1071/RD15017

PubMed Abstract | CrossRef Full Text | Google Scholar

Ceacero, T. M., Mercadante, M. E. Z., Cyrillo, J. N. D. S. G., Canesin, R. C., Bonilha, S. F. M., and De Albuquerque, L. G. (2016). Phenotypic and genetic correlations of feed efficiency traits with growth and carcass traits in nellore cattle selected for postweaning weight. PLoS One 11, e0161366. doi:10.1371/journal.pone.0161366

PubMed Abstract | CrossRef Full Text | Google Scholar

Conrad, D. F., Andrews, T. D., Carter, N. P., Hurles, M. E., and Pritchard, J. K. (2006). A high-resolution survey of deletion polymorphism in the human genome. Nat. Genet. 38, 75–81. doi:10.1038/ng1697

PubMed Abstract | CrossRef Full Text | Google Scholar

Das, D. N., Karuthadurai, T., and Gnanasekaran, S. (2021). “Genomic selection: a molecular tool for genetic improvement in livestock,” in Advances in animal genomics, 141–163. doi:10.1016/B978-0-12-820595-2.00010-2

CrossRef Full Text | Google Scholar

Dellinger, A. E., Saw, S. M., Goh, L. K., Seielstad, M., Young, T. L., and Li, Y. J. (2010). Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays. Nucleic Acids Res. 38, e105. doi:10.1093/nar/gkq040

PubMed Abstract | CrossRef Full Text | Google Scholar

Dichtel, L. E., Cordoba-Chacon, J., and Kineman, R. D. (2022). Growth hormone and insulin-like growth factor 1 regulation of nonalcoholic fatty liver disease. J. Clin. Endocrinol. Metabolism 107, 1812–1824. doi:10.1210/clinem/dgac088

CrossRef Full Text | Google Scholar

Diskin, S. J., Li, M., Hou, C., Yang, S., Glessner, J., Hakonarson, H., et al. (2008). Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms. Nucleic Acids Res. 36, e126. doi:10.1093/nar/gkn556

PubMed Abstract | CrossRef Full Text | Google Scholar

D’Occhio, M. J., Campanile, G., and Baruselli, P. S. (2020). Peripheral action of kisspeptin at reproductive tissues-role in ovarian function and embryo implantation and relevance to assisted reproductive technology in livestock: a review. Biol. Reprod. 103, 1157–1170. doi:10.1093/biolre/ioaa135

PubMed Abstract | CrossRef Full Text | Google Scholar

Fadista, J., Thomsen, B., Holm, L.-E., and Bendixen, C. (2010). Copy number variation in the bovine genome. BMC Genomics 11, 284. doi:10.1186/1471-2164-11-284

PubMed Abstract | CrossRef Full Text | Google Scholar

Fernandes Júnior, G. A., Peripolli, E., Schmidt, P. I., Campos, G. S., Mota, L. F. M., Mercadante, M. E. Z., et al. (2022). Current applications and perspectives of genomic selection in Bos indicus (Nellore) cattle. Livest. Sci. 263, 105001. doi:10.1016/j.livsci.2022.105001

CrossRef Full Text | Google Scholar

Fonseca, P. A. S., Suárez-Vega, A., Marras, G., and Cánovas, Á. (2020). GALLO: an R package for genomic annotation and integration of multiple data sources in livestock for positional candidate loci. Gigascience 9, giaa149. doi:10.1093/gigascience/giaa149

PubMed Abstract | CrossRef Full Text | Google Scholar

Gershoni, M., Weller, J. I., and Ezra, E. (2021). Genetic and genome-wide association analysis of yearling weight gain in Israel holstein dairy calves. Genes (Basel) 12, 708. doi:10.3390/genes12050708

PubMed Abstract | CrossRef Full Text | Google Scholar

Hargreaves, M., and Spriet, L. L. (2020). Skeletal muscle energy metabolism during exercise. Nat. Metab. 2, 817–828. doi:10.1038/s42255-020-0251-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Henrichsen, C. N., Vinckenbosch, N., Zöllner, S., Chaignat, E., Pradervand, S., Schütz, F., et al. (2009). Segmental copy number variation shapes tissue transcriptomes. Nat. Genet. 41, 424–429. doi:10.1038/ng.345

PubMed Abstract | CrossRef Full Text | Google Scholar

Hhmms-Hagen, J. (1989). Role of thermogenesis in the regulation of energy balance in relation to obesity1.

Google Scholar

Hou, Y., Bickhart, D. M., Chung, H., Hutchison, J. L., Norman, H. D., Connor, E. E., et al. (2012). Analysis of copy number variations in Holstein cows identify potential mechanisms contributing to differences in residual feed intake. Funct. Integr. Genomics 12, 717–723. doi:10.1007/s10142-012-0295-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Q., Ho, Y. Y., Hao, L., Berrios, C. N., and Chakravarti, A. (2011). Copy number variants in candidate genes are genetic modifiers of Hirschsprung disease. PLoS One 6, e21219. doi:10.1371/journal.pone.0021219

PubMed Abstract | CrossRef Full Text | Google Scholar

Koch, R. M., Swiger, L. A., Chambers, D., and Gregory, K. E. (1963). Efficiency of feed use in beef cattle. J. Anim. Sci. 22, 486–494. doi:10.2527/jas1963.222486x

CrossRef Full Text | Google Scholar

Kolberg, L., Raudvere, U., Kuzmin, I., Vilo, J., and Peterson, H. (2020). gprofiler2 -- an R package for gene list functional enrichment analysis and namespace conversion toolset g:Profiler. F1000Res 9, 709. doi:10.12688/f1000research.24956.1

CrossRef Full Text | Google Scholar

Ladeira, G. C., Pilonetto, F., Fernandes, A. C., Bóscollo, P. P., Dauria, B. D., Titto, C. G., et al. (2022). CNV detection and their association with growth, efficiency and carcass traits in Santa Inês sheep. J. Animal Breed. Genet. 139, 476–487. doi:10.1111/jbg.12671

CrossRef Full Text | Google Scholar

Lallemand, T., Leduc, M., Landès, C., Rizzon, C., and Lerat, E. (2020). An overview of duplicated gene detection methods: why the duplication mechanism has to be accounted for in their choice. Genes (Basel) 11, 1046–1140. doi:10.3390/genes11091046

PubMed Abstract | CrossRef Full Text | Google Scholar

Laseca, N., Molina, A., Valera, M., Antonini, A., and Demyda-Peyrás, S. (2022). Copy number variation (CNV): a new genomic insight in horses. Animals 12, 1435. doi:10.3390/ani12111435

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, C., and Pollak, E. J. (2002). Genetic antagonism between body weight and milk production in beef cattle 1., 80, 316, 321. doi:10.2527/2002.802316x

PubMed Abstract | CrossRef Full Text | Google Scholar

Lemos, M. V., Berton, M. P., Ferreira de Camargo, G. M., Peripolli, E., de Oliveira Silva, R. M., Ferreira Olivieri, B., et al. (2018). Copy number variation regions in Nellore cattle: evidences of environment adaptation. Livest. Sci. 207, 51–58. doi:10.1016/j.livsci.2017.11.008

CrossRef Full Text | Google Scholar

Li, X. G., Chen, X.-L., and Wang, X.-Q. (2013). Changes in relative organ weights and intestinal transporter gene expression in embryos from white Plymouth Rock and WENS Yellow Feather Chickens. Comp. Biochem. Physiology - A Mol. Integr. Physiology 164, 368–375. doi:10.1016/j.cbpa.2012.11.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, R., Fei, Y. J., Prasad, P. D., Ramamoorthy, S., Han, H., Yang-Feng, T. L., et al. (1995). Human intestinal H+/peptide cotransporter. Cloning, functional expression, and chromosomal localization. J. Biol. Chem. 270, 6456–6463. doi:10.1074/jbc.270.12.6456

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, G. E., Hou, Y., Zhu, B., Cardone, M. F., Jiang, L., Cellamare, A., et al. (2010). Analysis of copy number variations among diverse cattle breeds. Genome Res. 20, 693–703. doi:10.1101/gr.105403.110

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, M., Li, B., Huang, Y., Yang, M., Lan, X., Lei, C., et al. (2016). Copy number variation of bovine MAPK10 modulates the transcriptional activity and affects growth traits. Livest. Sci. 194, 44–50. doi:10.1016/j.livsci.2016.09.014

CrossRef Full Text | Google Scholar

Locke, D. P., Sharp, A. J., McCarroll, S. A., McGrath, S. D., Newman, T. L., Cheng, Z., et al. (2006). Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. Am. J. Hum. Genet. 79, 275–290. doi:10.1086/505653

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, H., and Hallstrom, T. C. (2013). The nuclear protein UHRF2 is a direct target of the transcription factor E2F1 in the induction of apoptosis. J. Biol. Chem. 288, 23833–23843. doi:10.1074/jbc.M112.447276

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, Q., Liu, X., Pan, J., Ma, L., Ma, Y., and He, X. (2017). Genome-wide detection of copy number variation in Chinese indigenous sheep using an ovine high-density 600 K SNP array. Sci. Rep. 7, 912. doi:10.1038/s41598-017-00847-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Magadum, S., Banerjee, U., Murugan, P., Gangapur, D., and And Ravikesavan, R. (2013). Gene duplication as a major force in evolution. J. Genet. 92, 155–161. doi:10.1007/s12041-013-0212-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Magoro, A. M., Mtileni, B., Hadebe, K., and Zwane, A. (2022). Assessment of genetic diversity and conservation in South African indigenous goat ecotypes: a review. Animals 12, 3353. doi:10.3390/ani12233353

PubMed Abstract | CrossRef Full Text | Google Scholar

Mannan, A. U., Nayernia, K., Mueller, C., Burfeind, P., Adham, I. M., and Enge, W. (2003). Male mice lacking the Theg (testicular haploid expressed gene) protein undergo normal spermatogenesis and are fertile. Biol. Reprod. 69, 788–796. doi:10.1095/biolreprod.103.017400

PubMed Abstract | CrossRef Full Text | Google Scholar

Mercadante, M. E. Z., Packer, I. U., Razook, A. G., Cyrillo, J. N. S. G., and Figueiredo, L. A. (2003). Direct and correlated responses to selection for yearling weight on reproductive performance of Nelore cows 1., 81, 376, 384. doi:10.2527/2003.812376x

PubMed Abstract | CrossRef Full Text | Google Scholar

Meredith, R. W., Zhang, G., Thomas, M., Gilbert, P., Jarvis, E. D., and Springer, M. S. (2024). Evidence for a single loss of mineralized teeth in the common avian ancestor.

Google Scholar

Miyamoto, J., Hasegawa, S., Kasubuchi, M., Ichimura, A., Nakajima, A., and Kimura, I. (2016). Nutritional signaling via free fatty acid receptors. Int. J. Mol. Sci. 17, 450. doi:10.3390/ijms17040450

PubMed Abstract | CrossRef Full Text | Google Scholar

Monteiro, F. M., Mercadante, M. E. Z., Barros, C. M., Satrapa, R. A., Silva, J. A. V., Oliveira, L. Z., et al. (2013). Reproductive tract development and puberty in two lines of Nellore heifers selected for postweaning weight. Theriogenology 80, 10–17. doi:10.1016/j.theriogenology.2013.02.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Nayernia, K., Von Mering, M. H. P., Kraszucka, K., Burfeind, P., Wehrend, A., Kö, M., et al. (1999). A novel testicular haploid expressed gene (THEG) involved in mouse spermatid-sertoli cell interaction 1. Biol. Reprod. 60, 1488–1495. doi:10.1095/biolreprod60.6.1488

PubMed Abstract | CrossRef Full Text | Google Scholar

Peripolli, E., Stafuzza, N. B., Machado, M. A., do Carmo Panetto, J. C., do Egito, A. A., Baldi, F., et al. (2023). Assessment of copy number variants in three Brazilian locally adapted cattle breeds using whole-genome re-sequencing data. Anim. Genet. 54, 254–270. doi:10.1111/age.13298

PubMed Abstract | CrossRef Full Text | Google Scholar

Pierzynowski, S. G., Kruszewska, D., and Weström, B. W. (2006). Chapter 3 the quality of dietary protein digestion affects animal performance and regulates gut bacteria growth: hypotheses and facts. Biol. Grow. Animals 4, 65–79. doi:10.1016/S1877-1823(09)70090-6

CrossRef Full Text | Google Scholar

Purslow, P. P. (2022). “The structure and growth of muscle,” in Lawrie’s meat science (Elsevier), 51–103. doi:10.1016/B978-0-323-85408-5.00004-2

CrossRef Full Text | Google Scholar

Quinlan, A. R., and Hall, I. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842. doi:10.1093/bioinformatics/btq033

PubMed Abstract | CrossRef Full Text | Google Scholar

Rappas, M., Niwa, H., and Zhang, X. (2004). Mechanisms of ATPases--a multi-disciplinary approach. Curr. Protein Pept. Sci. 5, 89–105. doi:10.2174/1389203043486874

PubMed Abstract | CrossRef Full Text | Google Scholar

Redon, R., Ishikawa, S., Fitch, K. R., Feuk, L., Perry, G. H., Andrews, T. D., et al. (2006). Global variation in copy number in the human genome. Nature 444, 444–454. doi:10.1038/nature05329

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosen, B. D., Bickhart, D. M., Schnabel, R. D., Koren, S., Elsik, C. G., Tseng, E., et al. (2020). De novo assembly of the cattle reference genome with single-molecule sequencing. Gigascience 9, giaa021. doi:10.1093/gigascience/giaa021

PubMed Abstract | CrossRef Full Text | Google Scholar

Seol, D., Ko, B. J., Kim, B., Chai, H., Lim, D., and Kim, H. (2019). Identification of Copy Number Variation in domestic chicken using whole-genome sequencing reveals evidence of selection in the genome. Animals. 9, 809. doi:10.3390/ani9100809

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharma, V., Hecker, N., Roscito, J. G., Foerster, L., Langer, B. E., and Hiller, M. (2018). A genomics approach reveals insights into the importance of gene losses for mammalian adaptations. Nat. Commun. 9, 1215. doi:10.1038/s41467-018-03667-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, H., Li, T., Su, M., Wang, H., Li, Q., Lang, X., et al. (2023). Identification of copy number variation in Tibetan sheep using whole genome resequencing reveals evidence of genomic selection. BMC Genomics 24, 555. doi:10.1186/s12864-023-09672-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Shin, D. H., Lee, H. J., Cho, S., Kim, H. J., Hwang, J. Y., Lee, C. K., et al. (2014). Deleted copy number variation of Hanwoo and Holstein using next generation sequencing at the population level. BMC Genomics 15, 240. doi:10.1186/1471-2164-15-240

PubMed Abstract | CrossRef Full Text | Google Scholar

Silva, J. M., Giachetto, P. F., da Silva, L. O., Cintra, L. C., Paiva, S. R., Yamagishi, M. E. B., et al. (2016). Genome-wide copy number variation (CNV) detection in Nelore cattle reveals highly frequent variants in genome regions harboring QTLs affecting production traits. BMC Genomics 17, 454. doi:10.1186/s12864-016-2752-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, U., Alex, R., Kumar, S., Deb, R., Venkatesan Raja, T., Singhal, S., et al. (2020). Association of bovine KISS1 single nucleotide polymorphisms with reproductive traits in Indian Cattle. Reproduction Domest. Animals 55, 922–930. doi:10.1111/rda.13704

PubMed Abstract | CrossRef Full Text | Google Scholar

Souza, F. R. P., Mercadante, M. E. Z., Fonseca, L. F. S., Ferreira, L. M. S., Regatieri, I. C., Ayres, H., et al. (2011). Assessment of DGAT1 and LEP gene polymorphisms in three Nelore (Bos indicus) lines selected for growth and their relationship with growth and carcass traits. J Anim Sci. 88, 435–441. doi:10.2527/jas.2009-2174

PubMed Abstract | CrossRef Full Text | Google Scholar

Stafuzza, N. B., Silva, R. M. D. O., Fragomeni, B. D. O., Masuda, Y., Huang, Y., Gray, K., et al. (2019). A genome-wide single nucleotide polymorphism and copy number variation analysis for number of piglets born alive. BMC Genomics 20, 321. doi:10.1186/s12864-019-5687-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Stenson, P. D., Mort, M., Ball, E. V., Evans, K., Hayden, M., Heywood, S., et al. (2017). The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum. Genet. 136, 665–677. doi:10.1007/s00439-017-1779-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Strillacci, M. G., Gorla, E., Cozzi, M. C., Vevey, M., Genova, F., Scienski, K., et al. (2018). A copy number variant scan in the autochthonous Valdostana Red Pied cattle breed and comparison with specialized dairy populations. PLoS One 13, e0204669. doi:10.1371/journal.pone.0204669

PubMed Abstract | CrossRef Full Text | Google Scholar

Upadhyay, M., da Silva, V. H., Megens, H. J., Visker, M. H. P. W., Ajmone-Marsan, P., Bâlteanu, V. A., et al. (2017). Distribution and functionality of copy number variation across European cattle populations. Front. Genet. 8, 108. doi:10.3389/fgene.2017.00108

PubMed Abstract | CrossRef Full Text | Google Scholar

United States Department of Agriculture (2023). Livestock and Products Annual. Available at: https://apps.fas.usda.gov/newgainapi/api/Report/DownloadReportByFileName?fileName=Livestock%20and%20Products%20Annual_Brasilia_Brazil_BR2023-0017.pdf (Accessed November 30, 2023).

Google Scholar

Verardo, L. L., e Silva, F. F., Machado, M. A., do Carmo Panetto, J. C., de Lima Reis Faza, D. R., Otto, P. I., et al. (2021). Genome-wide analyses reveal the genetic architecture and candidate genes of indicine, taurine, synthetic crossbreds, and locally adapted cattle in Brazil. Front. Genet. 12, 702822. doi:10.3389/fgene.2021.702822

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, K., Li, M., Hadley, D., Liu, R., Glessner, J., Grant, S. F. A., et al. (2007). PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 17, 1665–1674. doi:10.1101/gr.6861907

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Gao, P., Li, C., Liu, Q., Yao, Z., Li, Y., et al. (2023). A single-cell atlas of bovine skeletal muscle reveals mechanisms regulating intramuscular adipogenesis and fibrogenesis. J. Cachexia Sarcopenia Muscle 14, 2152–2167. doi:10.1002/jcsm.13292

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, L., Zhang, W. G., Shen, H. X., Zhang, Y., Zhao, Y. M., Jia, Y. T., et al. (2018). Genome-wide scanning reveals genetic diversity and signatures of selection in Chinese indigenous cattle breeds. Livest. Sci. 216, 100–108. doi:10.1016/j.livsci.2018.08.005

CrossRef Full Text | Google Scholar

Yan, Y., Yang, N., Cheng, H. H., Song, J., and Qu, L. (2015). Genome-wide identification of copy number variations between two chicken lines that differ in genetic resistance to Marek’s disease. BMC Genomics. 16, 843. doi:10.1186/s12864-015-2080-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, C., Wang, J., Liu, J., Sun, Y., Guo, Y., Jiang, Q., et al. (2018). Functional haplotypes of ARID4A affect promoter activity and semen quality of bulls. Anim. Reprod. Sci. 197, 257–267. doi:10.1016/j.anireprosci.2018.08.038

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, L., Xu, L., Zhu, B., Niu, H., Zhang, W., Miao, J., et al. (2017). Genome-wide analysis reveals differential selection involved with copy number variation in diverse Chinese Cattle. Sci. Rep. 7, 14299. doi:10.1038/s41598-017-14768-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, P. L., Li, X. G., Wang, X. Q., Zhang, D. X., Shu, G., and Luo, Q. B. (2011). The relationship between gene expression of cationic and neutral amino acid transporters in the small intestine of chick embryos and chick breed, development, sex, and egg amino acid concentration. Poult. Sci. 90, 2548–2556. doi:10.3382/ps.2011-01458

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, D., Wei, Y., Huang, Q., Chen, Y., Zeng, K., Yang, W., et al. (2022). Important hormones regulating lipid metabolism. Molecules 27, 7052. doi:10.3390/molecules27207052

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, F., Gu, W., Hurles, M. E., and Lupski, J. R. (2009). Copy number variation in human health, disease, and evolution. Annu. Rev. Genomics Hum. Genet. 10, 451–481. doi:10.1146/annurev.genom.9.081307.164217

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J. (2003). Evolution by gene duplication: an update. Trends Ecol. Evol. 18, 292–298. doi:10.1016/S0169-5347(03)00033-8

CrossRef Full Text | Google Scholar

Zhang, Y., Hu, Y., Wang, X., Jiang, Q., Zhao, H., Wang, J., et al. (2020). Population structure, and selection signatures underlying high-altitude adaptation inferred from genome-wide copy number variations in Chinese indigenous cattle. Front. Genet. 10, 1404. doi:10.3389/fgene.2019.01404

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Utsunomiya, Y. T., Xu, L., Hay, E. H. abdel, Bickhart, D. M., Alexandre, P. A., et al. (2016). Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus. BMC Genomics 17, 419–9. doi:10.1186/s12864-016-2461-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Yang, L., Han, X., Han, J., Hu, Y., Li, F., et al. (2022). Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history. Genome Res. 32, 1585–1601. doi:10.1101/gr.276550.122

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: beef cattle, copy number variation, gene annotation, Nellore, residual feed intake, SNP panel

Citation: Benfica LF, Brito LF, do Bem RD, de Oliveira LF, Mulim HA, Braga LG, Cyrillo JNSG, Bonilha SFM and Mercadante MEZ (2024) Detection and characterization of copy number variation in three differentially-selected Nellore cattle populations. Front. Genet. 15:1377130. doi: 10.3389/fgene.2024.1377130

Received: 26 January 2024; Accepted: 05 April 2024;
Published: 17 April 2024.

Edited by:

Juan José Arranz, University of León, Spain

Reviewed by:

Pablo A. S. Fonseca, University of León, Spain
Tara G. McDaneld, Agricultural Research Service (USDA), United States

Copyright © 2024 Benfica, Brito, do Bem, de Oliveira, Mulim, Braga, Cyrillo, Bonilha and Mercadante. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lorena F. Benfica, bG9yZW5hZmJlbmZpY2FAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.