Skip to main content

ORIGINAL RESEARCH article

Front. Cell Dev. Biol. , 20 February 2025

Sec. Epigenomics and Epigenetics

Volume 13 - 2025 | https://doi.org/10.3389/fcell.2025.1532711

This article is part of the Research Topic Sperm Epigenetic Code: Implications in Reproductive Health and Paternal Contribution to Embryo Development View all 6 articles

Defining bovine CpG epigenetic diversity by analyzing RRBS data from sperm of Montbéliarde and Holstein bulls

  • 1Institute of Agricultural Biology and Biotechnology, National Research Council IBBA CNR, Lodi, Italy
  • 2Department of Animal Science, Food and Nutrition – DIANA, and Romea and Enrica Invernizzi Research Center on Sustainable Dairy Production - CREI, Università Cattolica del Sacro Cuore, Piacenza, Italy

Introduction: Breed epigenetic diversity was recently detected in pig muscle and cattle blood, probably as a result of long-term selection for morphological adaptive and quantitative traits, persisting after embryo epigenetic reprogramming.

Methods: In our study, breed epigenetic diversity in the male germline from Holstein (H) and Montbéliarde (M) bulls was investigated using Reduced Representation Bisulfite Sequencing (RRBS) data publicly available at the NCBI database. Open-source Whole Genome Sequencing (WGS) data from H and M animals were used to estimate genetic diversity between the two breeds and, thus, correctly assess CpG positions with low frequencies or absence of SNPs.

Results: Sperm epigenetic diversity was studied in 356,635 SNP-free CpG positions, and a total of 6,074 differentially methylated cytosines (DMCs) were identified. The analysis of the DMCs pattern of distribution revealed that DMCs: i) are partially associated with genetic variation, ii) are consistent with epigenetic diversity previously observed in bovine blood, iii) present long-CpG stretches in specific genomic regions, and iv) are enriched in specific repeat elements, such as ERV-LTR transposable elements, ribosomal 5S rRNA, BTSAT4 Satellites and long interspersed nuclear elements (LINE).

Discussion: This study, based on publicly available data from two cattle breeds, contributes to the identification and definition of distinct epigenetic signatures in sperm, that may have potential implications for mammalian embryo development.

Introduction

Epigenome variation in livestock has been explored for its influence on production (e.g., milk production), disease traits, reproduction, and environmental adaptation (Schenkel, 2021). Studies in insects, fishes and birds have already reported how different epigenetic mechanisms can contribute to phenotypic variability in response to environmental adaptation (Alvarado et al., 2015; Fanti et al., 2017; Gore et al., 2018; Heckwolf et al., 2020; Lindner et al., 2021). Epigenetic inheritance is only minimally documented in mammals, with few well-characterized examples. One classic case is the agouti locus, where the CpG methylation level in repetitive elements of the Class II endogenous retrovirus (ERV) family in the intracisternal A-particle (IAP) insertions, upstream of the agouti gene, is epigenetically inherited and influences gene expression and mouse coat color (Morgan et al., 1999). However, the driving mechanisms governing epigenetic inheritance in mammals remain poorly understood due to the extensive genome-wide epigenetic reprogramming that takes place during mammalian development (Zeng and Chen, 2019). In the preimplantation embryos and in the germline a global epigenetic reprogramming occurs, giving rise to changes in DNA methylation. During fertilization, parental epigenomes undergo a remarkable demethylation in the zygote, followed by remethylation during the subsequent cell division, necessary to differentiate and direct cells toward their future lineages and ensure that cell type specification is a one-way street. However, the same epigenetic somatic signatures need to be erased in the primordial germ cells (PGCs) by a comprehensive reprogramming process, able to redirect DNA methylation for the establishment of sex-specific and germ cell-specific epigenetic signatures through a specialized processes of meiotic maturation and fertilization (Messerschmidt et al., 2014).

The genome-wide methylation reprogramming driven by demethylation that is observed in mammalian embryos in early preimplantation development, is conserved in many species (Dean et al., 2001). Remethylation typically occurs from the eight-cell stage, but with different rates in different mammalian species. In cattle, a considerable de novo methylation is observed in embryos between the eight-cell and the 16-cell steps (Dean et al., 2001; Triantaphyllopoulos et al., 2016). Nonetheless, specific genomic regions resist the global demethylation and are able to escape from global epigenetic reprogramming (Jiang et al., 2018).

It was reported that muscle CpG methylation shows important differences across breeds in many genes, probably as a result of long-term selection for quantitative traits (Ponsuksili et al., 2019). Breed epigenetic diversity could be explained by genetic variation as a consequence of the selection process. However, we recently evaluated inter-species epigenetic diversity in Bos taurus and Bos indicus providing results that support the assumption that epigenetic diversity is partially independent from genotype and may potentially impact on anatomical morphogenesis and breed traits (Capra et al., 2023). This assumption requires that the epigenetic signature is conserved in various tissues including the male and female germlines, to be transferred through generations. The abovementioned studies reported breed epigenetic differences in two somatic tissues: muscle (Ponsuksili et al., 2019) and blood (Capra et al., 2023). We decided to investigate breed epigenetic diversity in the male germline by evaluating differentially methylated CpGs in two different cattle breeds by using publicly available Reduced Representation Bisulfite Sequencing (RRBS) data collected from sperm of Montbéliarde and Holstein bulls (Costes et al., 2022; Johnson et al., 2022). Following this approach, we were able to identify epigenomic elements characterizing bovine inter-breed epigenetic diversity.

Materials and methods

NCBI bioproject and biosamples selection

This study utilized publicly available data for assessing sperm epigenetic and genetic variation in different cattle breeds. Fastq sequences were selected from the NCBI Bioproject database and retrieved from NCBI SRA. To ensure that comparisons between breeds were based on a substantial number of animals having an adequate sequencing coverage (granting to have a high number of cytosines with at least 10X coverage), results were filtered taking into account the following criteria: i) library preparation method (RRBS) ii) high number of available samples (n > 8), iii) sequencing coverage (RRBS>20 million reads). Two projects met the above-mentioned criteria and were deemed comparable: PRJEB46371 for Montbéliarde breeds (Costes et al., 2022), and bioprojects PRJEB35854 for Holstein (Johnson et al., 2022). Bioproject PRJEB35854 contains a total of 18 Biosamples from 9 Holstein bulls collected at two different timepoints (55-59 and 69–71 weeks). Bioproject PRJEB46371 contains a total of 120 Biosamples from 120 Montbéliarde bulls collected at different ages. Nine samples from the Holstein breed (69–71 weeks) and other nine randomly chosen samples with comparable age (74–82 weeks) for the Montbéliarde breed were selected for this study (Supplementary Table S1). The chosen age-window of 69–82 weeks (approximately 16–19 months) aligns well with the period of full sexual maturity for both breeds (Barth and Waldner, 2002; Rawlings et al., 2008). Finally, to assess genetic variation in the two breeds, NCBI publicly available WGS data form Holstein and Montbéliarde (Bioproject PRJEB9343) were used (Boussaha et al., 2015). Finally, another RRBS dataset (Bioproject PRJNA433629) with data from cells isolated in different stages of cattle embryogenesis: 2-Cell (2C), Compact Morula (CM), Blastocyst (B) and Spermatozoa (S), from Jiang et al., 2018, was included in this study to evaluate if breed specific CpG methylation variation can potentially escape from epigenetic reprogramming.

Bioinformatic analysis

Public RRBS raw fastq sequences were downloaded from the NCBI SRA database and analysed via nf-core (Garrison and Marth, 2012), using the nf-core/methylseq v1.5 pipeline (doi: 10.5281/zenodo.2555454). Bismark v0.22.3 (https://www.bioinformatics.babraham.ac.uk/projects/bismark/) was used as aligner. FastQC v0.11.9 (http://www.bioinfor.matics.babraham.ac.uk/projects/fastqc/). is included in the pipeline for raw data quality control, Trim Galore v0.6.4 (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) performs adapter sequence trimming, and Qualimap v2.2.2 (Ewels et al., 2020), is used for alignment quality control. As reference genome, B. taurus ARS-UCD1.2 (GCF_002263795.1) was chosen, with the addition of chromosome Y from Btau_5.0.1 (NC_016145.1). Methylation calls were extracted with Bismark methylation extractor v0.22.3.

WGS raw fastq sequences were also downloaded from the NCBI SRA database and analyzed with a custom pipeline built on the nf-core infrastructure. As for RRBS, the pipeline includes FastQC and TrimGalore, sequences were then aligned to the same reference genome with BWA-MEM v0.7.17, and SNP detection was performed with FreeBayes v1.3.6. Accessory steps were performed with bcftools v1.16, Picard v2. 27.4, samtools v1.16.1, and Tabix v1.12. For the variance partition analysis, the cytosines of interest were extracted from the whole dataset in pileup format using samtools pileup, and analyzed with the ‘variancePartition’ R package (http://bioconductor.org/packages/release/bioc/html/variancePartition.html).

In order to estimate breed genetic diversity, any SNP type with an allele frequency of more than 10% was considered. The Seqmonk software v1.47.1 (https://github.com/s-andrews/SeqMonk), was chosen for visualization and analysis of the Bismark output. To identify cytosines suitable for differential methylation analysis discarding polymorphic positions between the two breeds, cytosines matching SNPs in either species were removed from the analysis. Differentially methylated cytosines (DMCs) between the two breeds were detected among cytosines with at least 10X coverage in all nine animals, using logistic regression (False Discovery Rate (FDR) ≤ 0.05). Random subsets from the identified R-Cs depurated from DMCs were created using the “sample_n” function in R, t-test analysis was used to evaluate the hypothesis that elements characterizing bovine interbreed epigenetic diversity are not exclusively due to chance (p-value <0.05 was considered significant). Gene ontology (GO) classification was performed with the Cytoscape plug-in ClueGO (Bindea et al., 2009), which integrates GO and enhances biological interpretation of large lists of genes.

Results

Study workflow

This study utilized public RRBS raw fastq sequences from Bioprojects PRJEB35854 and PRJEB46371 to assess sperm epigenetic diversity between H and M breeds. To distinguish with confidence between genomic thymines and unmethylated cytosines (which are read as thymines in bisulfite sequencing), we also included the Bioproject PRJEB9343 dataset containing whole genome sequencing (WGS) raw fastq sequences from animals from same breeds (Boussaha et al., 2015). This dataset allowed also the identification of genetic variation between the two breeds, which was superposed to the methylated cytosines dataset to remove positions overlapping SNPs. The remaining CpGs were used to find breed specific differentially methylated cytosines (DMCs). In order to thoroughly investigate breed epigenetic diversity, we generated 10 sets of randomly selected (excluding DMCs) control cytosines (r-Cs), which were used to validate the estimation of the occurrence of epigenetic variation in proximity of genomic positions associated to animal genetic variation, cattle subspecies blood epigenetic diversity, and repetitive elements. Figure 1 displays the study workflow.

Figure 1
www.frontiersin.org

Figure 1. Study Workflow. Datasets available at the NCBI database are PRJEB35854 and PRJEB46371 to assess CpG methylation and PRJEB9343 for SNP identification. The differentially Methylated Cytosines dataset (DMCs) and 10 random cytosine r-Cs subsets, each containing the same number of cytosines as the DMCs dataset, were compared to other previously defined datasets to identify specific features characterizing H and M breed epigenetic diversity.

Global mapping of DNA methylation and principal component analysis (PCA) (is breed epigenetic diversity observable in sperm?)

Epigenetic variation in bull sperm from two different breeds, Holstein (H) and Montbéliarde (M), was observed by analyzing NCBI datasets from 9 animals of each breed (Bioprojects; PRJEB35854 and PRJEB46371) (Costes et al., 2022; Johnson et al., 2022).

RRBS reads were mapped to the B. taurus genome (ARS-UCD1.2, plus chromosome Y from Btau_5.0.1). In RRBS analysis, a cutoff of 10X coverage was imposed. The average number of reads per sample was 32.2 M (range: 25.5M– 40.7 M) (Supplementary Table S1). Within the CpG enriched regions represented in RRBS, 30.2% were methylated. The dataset counts 5,182,326 cytosines within the CpG context with 10X coverage per position in at least one sample. To be sure to focus our attention on cytosines well represented in all the animals under investigation, we extracted from this dataset 661,206 CpGs with at least 10X coverage in each of the aligned samples.

Due to bisulfite treatment, differential CpG methylation calling may be affected by sequence polymorphism with respect to the genotype of the reference sequence used in the alignment procedures of both the RRBS and WGS pipelines. Unfortunately, no public WGS sequence of the same animals selected for RRBS analysis was available, thus, we analyzed a subset of NCBI publicly available WGS data from different animals of the same breeds (M and H, Bioproject PRJEB9343) (Boussaha et al., 2015) (Supplementary Table S2), to detect breed polymorphisms and clean CpG positions identified by RRBS analysis, retaining exclusively those positions not overlapping polymorphisms observed in either the M and H breeds. At the end of this procedure, a total of 356,635 CpG positions were selected, representing a clean and reliable dataset to be used as input for all the subsequent analyses on methylation distribution. A principal component analysis (PCA) of the samples according to the CpG methylation level of the selected 356,635 cytosines well separates sperm from H and M animals along PC2 (Figure 2).

Figure 2
www.frontiersin.org

Figure 2. PCA of bull sperm CpG methylation considering the selected 356,635 cytosines. In blue Holstein (n = 9), in red Montbéliarde (n = 9).

Differentially methylated cytosines (DMCs) and genetic variation (are genetics and epigenetics overlapping but distinguishable?)

A total of 6,074 DMCs were identified between the H and M breeds (Supplementary Table S3). These DMCs are differently distributed along chromosomes (Supplementary Figure S1). Interestingly, the Y chromosome shows the highest number of DMCs, whereas on the X chromosome only a few regions exhibit changes in DNA methylation between two breeds.

DMCs distribution was compared with that of 10 control subsets of random Cytosines (r-Cs, n = 6,074 for each of the 10 datasets, which do not include DMCs) to highlight possible patterns of distribution of differential methylation among breeds not occurring by chance. Because epigenetic and genetic diversity are partially correlated, we tested if DMCs have a preferential distribution close to SNP positions when compared to r-Cs. H and M genetic diversity was assessed from WGS analysis (Bioproject PRJEB9343) (Boussaha et al., 2015), which identified a total of 925,801 SNPs along the reference genome. To validate the use of the 10 r-Cs subsets in our analyses, we initially tested whether they were representative of the genome-wide distribution of r-Cs. The incidence of SNPs was calculated for the whole 350,561 r-Cs covered by the RRBS analysis (i.e., the 356,635 CpG positions selected after removal of the positions matching breed polymorphisms minus the 6,074 DMCs), showing consistency with the percentage found in the 10 random subsets (Supplementary Table S4). By comparing distances of DMCs and of the 10 r-Cs subsets from SNP positions and determining their statistical significance by running a t-test analysis, we observed that the former fall near SNPs with a higher frequency with respect to the latter (which represent the distribution expected by chance). To represent this situation, we collected DMCs and r-Cs in four groups according to their genomic distance from breed SNPs (up to 2 bps, up to 10 bp, up to 100 bps, and up to 2 kbs from a SNPs position; Figure 3; Supplementary Table S4). In each of the considered intervals, we observe a significantly higher proportion of DMCs rather than r-Cs that fall close to SNPs. If we consider the 100 bps interval, the proportion of cytosines close to SNPs is 8.78% DMCs vs. 1.50% r-Cs, while this difference lowers in the 2 kbs interval, when drawing away from SNPs. This distribution suggests that many epigenetic differences could be driven by genetic variation and vice versa.

Figure 3
www.frontiersin.org

Figure 3. Percentage of differentially methylated cytosines (DMCs, blue) and random cytosines (r-Cs, orange) near SNPs between H and M breeds in the four considered intervals (2 bps, 10 bps, 100 bps, and 2 kbs from a SNPs position).

A functional analysis of breed epigenetic diversity was run on a dataset comprising 533 DMCs representing positions where breeds exhibit their most evident differences. The dataset includes only DMCs showing the highest methylation variation (more than 20%) between H and M, and excludes DMCs falling less or equal than 100 bps from SNPs, that could be related to genetic variation. DMCs selected according to these criteria are enriched in genes involved in regulation of Wnt signaling pathway, GTPase activity signal transduction, cell communication, multicellular organism development, system development and cell morphogenesis (Supplementary Table S5). We finally evaluated if breed epigenetic signatures can be maintained during epigenetic reprogramming. We explored data from cells isolated in different stages of cattle embryogenesis: 2C, CM, B and S from the Bioproject PRJNA433629. Due to the low CpG coverage of the PRJNA433629 dataset, the methylation percentage for different embryonic developmental stages was calculated for each cellular type by combining replicates, and the resulting datasets were compared to DMCs obtained by H and M sperm methylation comparison. Only a little proportion of the DMCs individuated by our analysis (308/6,074) showed at least a CpG 5X coverage in all the four developmental stages of the PRJNA433629 dataset. Hierarchical clustering showed that methylation profile of the PRJNA433629 sperm sample was similar to those from H and M sperm, and clustered more similar to blastocyst rather than 2-cell and compact morula stages. Some cytosines were consistent with the occurrence of a demethylation and re-methylaion reprogramming wave (Supplementary Figure S2). These results, however, should be interpreted with caution because of the low level of CpG coverage in the PRJNA433629 dataset, and because only a portion of the entire epigenome is represented in RRBS experiments. In fact, only a few genes (LOC112442278, LOC112446467, PPM1B, LOC112443250 and RN18S1) are present in the genomic regions interested by these DMCs, which for the overmentioned technical reasons represent only a very little subset of the DMCs potentially involved in the methylation reprogramming wave.

Differentially methylated cytosines (DMCs) and comparison with subspecies blood epigenetic diversity. (is epigenetic breed diversity conserved in different tissues?)

We recently assessed Bos subspecies epigenetic diversity in blood from Angus and Nellore identifying 25,765 DMCs between B. taurus and B. indicus (tau_ind_DMCs) (Capra et al., 2023). In order to assess if blood bovine subspecies epigenetic diversity can be associated to breed epigenetic diversity in sperm, we evaluated whether H vs. M DMCs were enriched for tau_ind_DMCs with respect to r-Cs control subsets. Comparison of the H vs. M DMCs (n = 6,074) with those found in this previous study showed that 24.0%, 5.5%, 1,7%, and 1.2% cytosines are close or adjacent (within 2,000 bps, 100 bps, 10 bps, 2 bps) to tau_ind_DMCs. On the contrary, the average number of cytosines belonging to the 10 control subsets of that are close or adjacent to tau_ind_DMCs, was statistically lower (13.87%, 1.59%, 0.31%, and 0.22%) for less than 2000 bps, 100 bps, 10 bps and 2 bps intervals, respectively, as confirmed by t-test analysis (Figure 4; Supplementary Table S6). This significant difference highlights a partially overlapping pattern of genomic localization of breed epigenetic variation in blood and sperm.

Figure 4
www.frontiersin.org

Figure 4. Percentage of differentially methylated cytosines (DMCs, blue) and SNP free cytosines (r-Cs, orange) near subspecies-specific differentially methylated cytosines (DMCs between Bos taurus and Bos indicus: 2 bps, 10 bps, 100 bps, 2 kbs from a SNPs position).

Differentially methylated cytosines (DMCs) organization. (do epigenetic breed diversity variations show specific features?)

A previous study showed that taurus and indicus subspecies present a high number of adjacent differentially methylated CpGs (DMC stretches) (Capra et al., 2023). Here, H and M show a higher percentage of CpG stretches with at least 2 consequent cytosines at a distance of 2,000 bp or less in DMCs, when compared to r-Cs subsets (Figure 5A). Moreover, the number of adjacent cytosines (i.e., cytosines included in stretches) occurring by chance in r-Cs is much lower with respect to the one observed in DMCs (p-value <2.2e-22) (Figure 5A; Supplementary Table S7). Considering that we are analyzinng RRBS experiments, we should assume that the sequenced cytosines are mostly contained in methylated regions. Nonetheless, we observe these differences in the distribution within CpG stretches of DMCs versus other cytosines that are methylated but not differentially methylated between H and M. Interestingly, among genes having longest CpG stretches, LOC112443340 (36 adjacent DMCs in 381bps) and LOC112443250 (16 adjacent DMCs in 304 bps), both coding for 5S ribosomal RNA, present a specific feature by sorrounding a 25 Kb genomic tract encompassing other two genes (LOC112443149 and LOC112443150, Figure 5B).

Figure 5
www.frontiersin.org

Figure 5. (A) Percentage of CpGs according to the proximity to the next CpG (1 bp, between 2 bps and 2 kbs and more than 2 kbs) for DMCs and the ten random r-Cs subsets. (B) The Bos taurus ARS-UCD1.2 NC_037348.1:32996630-3027258 region and the level of methylation in 9 H and 9 M animals for DMC stretches (DMCs distance from the adjacent cytosine below 2 kbs) encompassing LOC112443250 and LOC112443340 5S rRNA genes, which surround LOC112443149 and LOC112443150, codifying liprin-alpha-1-like and S-antigen like proteins.

Differentially methylated cytosines (DMCs) and repeat regions. (do breed epigenetic variations preferentially overlap with regions containing repeats?)

We finally tested if DMCs are preferentially distributed within repeated regions. In general, in our datasets DMCs are more abundantly located in repetitive regions than due to chance (i.e., than in r-Cs), mostly occurring in ERV-LTR, 5S rRNA, BTSAT4 Satellites repeat elements and long interspersed nuclear elements (LINE) (Figure 6; Supplementary Table S8).

Figure 6
www.frontiersin.org

Figure 6. Plot of the number of CpGs overlapping repeats for differentially methylated cytosines (DMCs) and control subsets of random SNP-free cytosines (r-Cs).

DMCs overlapping positions show enrichments and depletions for specific classes of repeat elements. In particular, when compared to control subsets, DMCs show enrichment in specific classes of ERV-LTR, 5S rRNA, BTSAT4 Satellites repeat elements and long interspersed nuclear elements (LINE).

Discussion

Using publicly available RRBS data, our work characterized interbreed epigenetic diversity in the cattle male germline. Since 1988, The National Center for Biotechnology Information (NCBI) collects open-access nucleotide and genomic sequences from many organisms, including next-generation sequencing data from genomic, transcriptomic and epigenetic studies (Kodama et al., 2012; NCBI Resource Coordinators, 2013), contained in the sequence read archive (SRA). SRA data are organized through BioProject and BioSample records that link to corresponding data stored in archival repositories, thus facilitating and improving the organization of the metadata describing the experiment (Barrett et al., 2012). In order to study cytosines free of breeds polymorphisms, thus avoiding biases due to incorrect CpG methylation calls, we identified two suitable RRBS experiments (Costes et al., 2022; Johnson et al., 2022) and a WGS dataset for the same two breeds (Boussaha et al., 2015), that were processed following previously optimized strategies (Capra et al., 2023).

We observed epigenetic specificity characterizing cattle breed diversity. First of all, epigenetic variation in H and M sperm in part adheres to the pattern of genomic regions showing CpG methylation variation in blood from B. taurus and B. indicus subspecies (Capra et al., 2023). Secondly, sperm epigenetic diversity partially correlates with genetic differences in the two breeds (Heyn et al., 2013; Zhi et al., 2013). Another aspect is that breed epigenetic diversity exhibits a high level of coordination in methylation, represented by the presence of adjacent CpG sites that are differentially co-methylated (methylated CpG stretches). A similar CpG methylation configuration was also previously observed in different cell types (Shoemaker et al., 2010) and tissues (Guo et al., 2017).Finally, we found that breed epigenetic signature co-localizes with different classes of repetitive elements, indicating that repeat elements play an important and conserved role in the establishment of inter-breed epigenetic variability. DMCs were enriched in Endogenous Retrovirus 2 with long terminal repeats ERV2 LTRs, 5S rRNA, BTSAT4 Bovine satellite and LINEs. Interestingly, both ERV LTRs repeat and ribosomal DNA (rDNA) act as important transcriptional regulatory elements with silencing or activation of the expression of different sets of genes in mammals, which is mediated by DNA methylation (Thompson et al., 2016; Tchurikov and Kravatsky, 2021; Zeng et al., 2022). We can consider the well-known example of mammalian epigenetic transmission mediated by repetitive elements of the Class II endogenous retrovirus (ERV) intracisternal A-particle (IAP), which methylation influences the expression of the Agouti coat colour gene locus in mouse (Blewitt et al., 2006; Bertozzi and Ferguson-Smith, 2020). A number of studies suggest transgenerational epigenetic inheritance (TEI) of DNA methylation changes, which involves genomic regions that are thought to almost partially escape epigenetic reprogramming through generations (Xavier et al., 2019). As recently observed in sheep (Braz et al., 2023), a consistent part of these methylation signatures falls within LINEs. H and M epigenetic comparison also evidenced a variation of CpG methylation in two adjacent 5SrRNA regions surrounding a 25 Kb genomic region containing the genes codifying a liprin-alpha-1-like (known to be important for axon guidance and mammary gland development) and S-antigen like proteins. Interestingly, the same 5SrRNA region has been observed to be differentially methylated between mammary gland tissues from Holstein cows producing milk with high and low protein contents (Wang et al., 2021). We cannot exclude that part of the sperm CpG methylation variability in the two breeds could be in part associated to semen quality in H and M animals. In fact, methylation within BTSAT4 was also previously observed to change in high and low motile B. taurus sperm populations (Capra et al., 2019). However, most of the epigenetic variation between the two breeds was observed to target genes related to cell signaling, multicellular organism development, system development and cell morphogenesis. Interestingly, H and M breed epigenetic diversity deeply affects Wnt/β-catenin signaling, an essential regulator in embryonic development and adult tissue homeostasis (Liu et al., 2022). Several experiments provide emerging evidence that sperm epigenome serves as a model for embryonic development (Lismer and Kimmins, 2023). Human and animal exposure to different environmental factors was linked to sperm methylation alteration, targeting important reproductive and developmental loci associated with offspring phenotypes (Rakyan et al., 2003; Lismer et al., 2024). Our data show that HvsM DMCs could maintain the methylation state after epigenetic reprogramming, particularly in long-CpG stretches close to genes that have a potential impact on anatomical morphogenesis and breed traits (LOC112442278, LOC112446467, PPM1B, LOC112443250, RN18S1; Wang et al., 2021; Capra et al., 2023). On the other hand, more in depth studies are necessary to fully understand the impact of breed epigenetic diversity on embryo development and to fully explore the rules governing breed epigenetic inheritance.

Conclusion

In conclusion, interbreed epigenetic diversity assessed by RRBS sperm profiling in H and M animals showed specific CpG methylation signatures, with peculiar characteristics. H and M CpG methylation variation partially correlates with blood bovine interspecies methyl CpG diversity, with H and M genetic diversity, it presents many adjacent co-methylated CpG sites and is enriched in specific repeat elements. Many important questions remain unanswered by our study. Further studies will be necessary to fully trace the existence of a breed epigenetic imprint shared among different tissues and to understand how breed epigenetic signatures can persist after embryo epigenetic reprogramming and possibly contribute to breed differentiation.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: (https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB46371, https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB35854, https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB9343).

Ethics statement

Ethical approval was not required for the study involving animals in accordance with the local legislation and institutional requirements because this study utilize pubblic available data.

Author contributions

EC: Conceptualization, Funding acquisition, Methodology, Writing–original draft, Writing–review and editing. BL: Data curation, Formal Analysis, Writing–original draft, Writing–review and editing. PC: Formal Analysis, Writing–review and editing. FT: Writing–review and editing. RN: Writing–review and editing. PA-M: Supervision, Writing–review and editing. AS: Funding acquisition, Supervision, Writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was carried out within the Agritech National Research Center and received funding from the European Union Next-GenerationEU (National Recovery and Resilience Plan (NRRP), Mission 4, Component 2, Investment 1.4—D.D. 1032 17/06/2022, CN00000022, and under the National Recovery and Resilience Plan (NRRP), Mission 4, Component 2, Investment 1.1, Call for tender No. 1409 published on 14.9.2022 by the Italian Ministry of University and Research (MUR), funded by the European Union–NextGenerationEU–Project Title Beef and Dairy Cattle Epigenetic Diversity Associated to Functional Adaptation (EpiC Fun) – CUP B53D23025020001- Grant Assignment Decree No. 1364 adopted on 01/09/23 by the Italian Ministry of Ministry of University and Research (MUR).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2025.1532711/full#supplementary-material

References

Alvarado, S., Rajakumar, R., Abouheif, E., and Szyf, M. (2015). Epigenetic variation in the Egfr gene generates quantitative variation in a complex trait in ants. Nat. Commun. 6, 6513. doi:10.1038/ncomms7513

PubMed Abstract | CrossRef Full Text | Google Scholar

Barrett, T., Clark, K., Gevorgyan, R., Gorelenkov, V., Gribov, E., Karsch-Mizrachi, I., et al. (2012). BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 40, D57–D63. doi:10.1093/nar/gkr1163

PubMed Abstract | CrossRef Full Text | Google Scholar

Barth, A. D., and Waldner, C. L. (2002). Factors affecting breeding soundness classification of beef bulls examined at the Western College of Veterinary Medicine. Can. Vet. J. 43, 274–284.

PubMed Abstract | Google Scholar

Bertozzi, T. M., and Ferguson-Smith, A. C. (2020). Metastable epialleles and their contribution to epigenetic inheritance in mammals. Semin. Cell Dev. Biol. 97, 93–105. doi:10.1016/j.semcdb.2019.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Bindea, G., Mlecnik, B., Hackl, H., Charoentong, P., Tosolini, M., Kirilovsky, A., et al. (2009). ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–1093. doi:10.1093/bioinformatics/btp101

PubMed Abstract | CrossRef Full Text | Google Scholar

Blewitt, M. E., Vickaryous, N. K., Paldi, A., Koseki, H., and Whitelaw, E. (2006). Dynamic reprogramming of DNA methylation at an epigenetically sensitive allele in mice. PLoS Genet. 2, e49. doi:10.1371/journal.pgen.0020049

PubMed Abstract | CrossRef Full Text | Google Scholar

Boussaha, M., Esquerré, D., Barbieri, J., Djari, A., Pinton, A., Letaief, R., et al. (2015). Genome-wide study of structural variants in bovine Holstein, Montbéliarde and normande Dairy breeds. PLoS One 10, e0135931. doi:10.1371/journal.pone.0135931

PubMed Abstract | CrossRef Full Text | Google Scholar

Braz, C. U., Passamonti, M. M., and Khatib, H. (2023). Characterization of genomic regions escaping epigenetic reprogramming in sheep. Environ. Epigenetics 10, dvad010. doi:10.1093/eep/dvad010

PubMed Abstract | CrossRef Full Text | Google Scholar

Capra, E., Lazzari, B., Milanesi, M., Nogueira, G. P., Garcia, J. F., Utsunomiya, Y. T., et al. (2023). Comparison between indicine and taurine cattle DNA methylation reveals epigenetic variation associated to differences in morphological adaptive traits. Epigenetics 18, 2163363. doi:10.1080/15592294.2022.2163363

PubMed Abstract | CrossRef Full Text | Google Scholar

Capra, E., Lazzari, B., Turri, F., Cremonesi, P., Portela, A. M. R., Ajmone-Marsan, P., et al. (2019). Epigenetic analysis of high and low motile sperm populations reveals methylation variation in satellite regions within the pericentromeric position and in genes functionally related to sperm DNA organization and maintenance in Bos taurus. BMC Genomics 20, 940. doi:10.1186/s12864-019-6317-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Costes, V., Chaulot-Talmon, A., Sellem, E., Perrier, J. P., Aubert-Frambourg, A., Jouneau, L., et al. (2022). Predicting male fertility from the sperm methylome: application to 120 bulls with hundreds of artificial insemination records. Clin. Epigenetics 14, 54. doi:10.1186/s13148-022-01275-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Dean, W., Santos, F., Stojkovic, M., Zakhartchenko, V., Walter, J., Wolf, E., et al. (2001). Conservation of methylation reprogramming in mammalian development: aberrant reprogramming in cloned embryos. Proc. Natl. Acad. Sci. U. S. A. 98, 13734–13738. doi:10.1073/pnas.241522698

PubMed Abstract | CrossRef Full Text | Google Scholar

Ewels, P. A., Peltzer, A., Fillinger, S., Patel, H., Alneberg, J., Wilm, A., et al. (2020). The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 38, 276–278. doi:10.1038/s41587-020-0439-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Fanti, L., Piacentini, L., Cappucci, U., Casale, A. M., and Pimpinelli, S. (2017). Canalization by selection of de novo induced mutations. Genetics 206, 1995–2006. doi:10.1534/genetics.117.201079

PubMed Abstract | CrossRef Full Text | Google Scholar

Garrison, E., and Marth, G. (2012). Haplotype-based variant detection from short-read sequencing. arXiv Prepr. arXiv:1207.3907 [q-bio.GN]. doi:10.48550/arXiv.1207.3907

CrossRef Full Text | Google Scholar

Gore, A. V., Tomins, K. A., Iben, J., Ma, L., Castranova, D., Davis, A. E., et al. (2018). An epigenetic mechanism for cavefish eye degeneration. Nat. Ecol. Evol. 2, 1155–1160. doi:10.1038/s41559-018-0569-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, S., Diep, D., Plongthongkum, N., Fung, H. L., Zhang, K., and Zhang, K. (2017). Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA. Nat. Genet. 49, 635–642. doi:10.1038/ng.3805

PubMed Abstract | CrossRef Full Text | Google Scholar

Heckwolf, M. J., Meyer, B. S., Häsler, R., Höppner, M. P., Eizaguirre, C., and Reusch, T. B. H. (2020). Two different epigenetic information channels in wild three spined sticklebacks are involved in salinity adaptation. Sci. Adv. 6, eaaz1138. doi:10.1126/sciadv.aaz1138

PubMed Abstract | CrossRef Full Text | Google Scholar

Heyn, H., Moran, S., Hernando-Herraez, I., Sayols, S., Gomez, A., Sandoval, J., et al. (2013). DNA methylation contributes to natural human variation. Genome Res. 23, 1363–1372. doi:10.1101/gr.154187.112

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Z., Lin, J., Dong, H., Zheng, X., Marjani, S. L., Duan, J., et al. (2018). DNA methylomes of bovine gametes and in vivo produced preimplantation embryos. Biol. Reprod. 99, 949–959. doi:10.1093/biolre/ioy138

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, C., Kiefer, H., Chaulot-Talmon, A., Dance, A., Sellem, E., Jouneau, L., et al. (2022). Prepubertal nutritional modulation in the bull and its impact on sperm DNA methylation. Cell Tissue Res. 389, 587–601. doi:10.1007/s00441-022-03659-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Kodama, Y., Shumway, M., and Leinonen, R.International Nucleotide Sequence Database Collaboration (2012). The Sequence Read Archive: explosive growth of sequencing data. Nucleic Acids Res. 40, D54–D56. doi:10.1093/nar/gkr854

PubMed Abstract | CrossRef Full Text | Google Scholar

Lindner, M., Laine, V. N., Verhagen, I., Viitaniemi, H. M., Visser, M. E., van Oers, K., et al. (2021). Rapid changes in DNA methylation associated with the initiation of reproduction in a small songbird. Mol. Ecol. 30, 3645–3659. doi:10.1111/mec.15803

PubMed Abstract | CrossRef Full Text | Google Scholar

Lismer, A., and Kimmins, S. (2023). Emerging evidence that the mammalian sperm epigenome serves as a template for embryo development. Nat. Commun. 14, 2142. doi:10.1038/s41467-023-37820-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Lismer, A., Shao, X., Dumargne, M. C., Lafleur, C., Lambrot, R., Chan, D., et al. (2024). The association between long-term DDT or DDE exposures and an altered sperm epigenome-a cross-sectional study of Greenlandic inuit and South African VhaVenda men. Environ. Health Perspect. 132, 17008. doi:10.1289/EHP12013

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Xiao, Q., Xiao, J., Niu, C., Li, Y., Zhang, X., et al. (2022). Wnt/β-catenin signalling: function, biological mechanisms, and therapeutic opportunities. Signal Transduct. Target Ther. 7, 3. doi:10.1038/s41392-021-00762-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Messerschmidt, D. M., Knowles, B. B., and Solter, D. (2014). DNA methylation dynamics during epigenetic reprogramming in the germline and preimplantation embryos. Genes Dev. 28, 812–828. doi:10.1101/gad.234294.113

PubMed Abstract | CrossRef Full Text | Google Scholar

Morgan, H. D., Sutherland, H. G., Martin, D. I., and Whitelaw, E. (1999). Epigenetic inheritance at the agouti locus in the mouse. Nat. Genet. 23, 314–318. doi:10.1038/15490

PubMed Abstract | CrossRef Full Text | Google Scholar

NCBI Resource Coordinators (2013). Database resources of the national center for Biotechnology information. Nucleic Acids Res. 41, D8–D20. doi:10.1093/nar/gks1189

PubMed Abstract | CrossRef Full Text | Google Scholar

Ponsuksili, S., Trakooljul, N., Basavaraj, S., Hadlich, F., Murani, E., and Wimmers, K. (2019). Epigenome-wide skeletal muscle DNA methylation profiles at the background of distinct metabolic types and ryanodine receptor variation in pigs. BMC Genomics 20, 492. doi:10.1186/s12864-019-5880-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Rakyan, V. K., Chong, S., Champ, M. E., Cuthbert, P. C., Morgan, H. D., Luu, K. V., et al. (2003). Transgenerational inheritance of epigenetic states at the murine Axin(Fu) allele occurs after maternal and paternal transmission. Proc. Natl. Acad. Sci. U. S. A. 100, 2538–2543. doi:10.1073/pnas.0436776100

PubMed Abstract | CrossRef Full Text | Google Scholar

Rawlings, N., Evans, A. C., Chandolia, R. K., and Bagu, E. T. (2008). Sexual maturation in the bull. Reprod. Domest. Anim. 43, 295–301. doi:10.1111/j.1439-0531.2008.01177.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Schenkel, F. S. (2021). Prospects for exploiting epigenetic effects in livestock production. Anim. Front. 11, 3–4. doi:10.1093/af/vfab071

PubMed Abstract | CrossRef Full Text | Google Scholar

Shoemaker, R., Deng, J., Wang, W., and Zhang, K. (2010). Allele-specific methylation is prevalent and is contributed by CpG-SNPs in the human genome. Genome Res. 20, 883–889. doi:10.1101/gr.104695.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Tchurikov, N. A., and Kravatsky, Y. V. (2021). The role of rDNA clusters in global epigenetic gene regulation. Front. Genet. 12, 730633. doi:10.3389/fgene.2021.730633

PubMed Abstract | CrossRef Full Text | Google Scholar

Thompson, P. J., Macfarlan, T. S., and Lorincz, M. C. (2016). Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol. Cell 62, 766–776. doi:10.1016/j.molcel.2016.03.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Triantaphyllopoulos, K. A., Ikonomopoulos, I., and Bannister, A. J. (2016). Epigenetics and inheritance of phenotype variation in livestock. Epigenetics Chromatin 9, 31. doi:10.1186/s13072-016-0081-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, M., Bissonnette, N., Dudemaine, P. L., Zhao, X., and Ibeagha-Awemu, E. M. (2021). Whole genome DNA methylation variations in mammary gland tissues from Holstein cattle producing milk with various fat and protein contents. Genes (Basel) 12, 1727. doi:10.3390/genes12111727

PubMed Abstract | CrossRef Full Text | Google Scholar

Xavier, M. J., Roman, S. D., Aitken, R. J., and Nixon, B. (2019). Transgenerational inheritance: how impacts to the epigenetic and genetic information of parents affect offspring health. Hum. Reprod. Update 25, 518–540. doi:10.1093/humupd/dmz017

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, L., Wang, M., Zhou, J., Wang, X., Zhang, Y., and Su, P. (2022). A hypothesis: retrotransposons as a relay of epigenetic marks in intergenerational epigenetic inheritance. Gene 817, 146229. doi:10.1016/j.gene.2022.146229

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, Y., and Chen, T. (2019). DNA methylation reprogramming during mammalian development. Genes 10, 257. doi:10.3390/genes10040257

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhi, D., Aslibekyan, S., Irvin, M. R., Claas, S. A., Borecki, I. B., Ordovas, J. M., et al. (2013). SNPs located at CpG sites modulate genome-epigenome interaction. Epigenetics 8, 802–806. doi:10.4161/epi.25501

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: epigenetic, cattle, breed, sperm, DNA methylation

Citation: Capra E, Lazzari B, Cozzi P, Turri F, Negrini R, Ajmone-Marsan P and Stella A (2025) Defining bovine CpG epigenetic diversity by analyzing RRBS data from sperm of Montbéliarde and Holstein bulls. Front. Cell Dev. Biol. 13:1532711. doi: 10.3389/fcell.2025.1532711

Received: 22 November 2024; Accepted: 06 February 2025;
Published: 20 February 2025.

Edited by:

Marta Lombó, University of León, Spain

Reviewed by:

Francesco Manfrevola, University of Campania Luigi Vanvitelli, Italy
Fei-Man Hsu, University of California, Los Angeles, United States

Copyright © 2025 Capra, Lazzari, Cozzi, Turri, Negrini, Ajmone-Marsan and Stella. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Emanuele Capra, ZW1hbnVlbGUuY2FwcmFAaWJiYS5jbnIuaXQ=

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more