
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Genet. , 19 March 2025
Sec. Livestock Genomics
Volume 16 - 2025 | https://doi.org/10.3389/fgene.2025.1507315
The origin of sheep and their spread following domestication have been widely investigated using archaeology, genetics, and genomics. A thorough investigation of the genetic diversity of the breeds is key to providing useful information for conservation and breeding programmes. In Hungary, sheep farming contributes to the agricultural sector and national economy. It plays a crucial role in rural livelihoods, exports, and environmental management while also contributing to the national economy and preserving Hungary’s cultural and agricultural heritage through traditional breeds. This study aims to analyse the population structure and patterns of admixture in four local Hungarian sheep breeds, namely, Indigenous Tsigai, Hortobagyi Racka, Cikta, and Bábolna Tetra. Our results revealed that the indigenous Hungarian Hortobagyi Racka sheep are distinct from the other Hungarian breeds studied. The effective population sizes were found to be low, with varying levels of genomic inbreeding both within and across breeds. These results align with documented bottlenecks and instances of crossbreeding with other local or improved breeds. Ancestry analysis demonstrated some introgression between Scandinavian and Hungarian sheep breeds and vice versa. This gene flow may have occurred recently due to the widespread use of northern breeds such as Finnsheep and Romanov to enhance productivity, but it could also date back much further. Despite some limitations, our outcomes can contribute to future conservation plans, and a more comprehensive analysis of all native Hungarian sheep breeds should be highlighted to the relevant authorities in order to secure further funds.
Elucidating the genetic composition and the origin of local breeds is essential to maintaining global genetic biodiversity via appropriate management plans, particularly in light of the modern environmental crisis and rapidly changing climatic conditions. Several studies have described the global genetic diversity of sheep using genotype data (Ceccobelli et al., 2023; Wanjala et al., 2023; Zsolnai et al., 2021; Ciani et al., 2020; Deniskova et al., 2018; Kijas et al., 2012; Zhao et al., 2017), but research on breed origin and history in Central–Eastern Europe is still in its early stages and is mostly based on microsatellites or mitochondrial data (Loukovitis et al., 2023; Astuti et al., 2023a; Gáspárdy et al., 2021; Dudu et al., 2016; Kawęcka et al., 2022; Kusza et al., 2008; 2009; 2010; 2011; Rannamäe et al., 2023). Hungary’s vast pasturelands and moderate climate provide a favourable environment for animal production. Hence, agriculture plays an important economic role, especially in rural areas (Bakucs et al., 2020). Agriculture employs 4.8% of Hungary’s workforce, and in livestock production, sheep rank third in economic significance after poultry and pigs (Astuti et al., 2023b; https://www.ksh.hu/?lang=en). In this study, we analysed genotype data from four local Hungarian sheep breeds, namely, indigenous Tsigai, Hortobagyi Racka, Cikta, and Bábolna Tetra. All these breeds are long-tailed. The Indigenous Tsigai is a traditional sheep breed that belongs to the Tsigai group of breeds, indigenous to Eastern and Central Europe (Astuti et al., 2023a; Kusza et al., 2011; 2008). The Hortobagyi Racka sheep is an ancient Hungarian breed known for its striking spiral-shaped horns, which are present in both male and female sheep. This breed is well-adapted to harsh environmental conditions, thriving in both hot summers and cold winters. The breed comes in two distinct colour variants, namely, black and white. They are believed to be similar to the Zackel sheep of ancient Egypt, which were the origin of various migrations to the Middle East and Europe (Draganescu and Grosu, 2010). Phenotypically, breeds from the Zackel group are small to medium-sized, with long, coarse wool and spiralled horns (Kusza et al., 2008). Cikta historically belongs to the “mountain sheep breeds,” a group considered to be relics of the extinct “Zaupel or Neolithic Turf sheep,” which, in turn, was one of the original breeds of the European Alps (Gáspárdy et al., 2021; Heinz et al., 2021; Kovács et al., 2019). Bábolna Tetra is a white, prolific breed developed from the Hungarian Merino, the Romanov, and the Finnish Landrace, combining a “Nordic” background and Hungarian influence (Neubauer et al., 2015). During the last two centuries, Merino and Texel have contributed genetically to several Hungarian breeds (Beynon et al., 2015; Ciani et al., 2020). Following Mujitaba et al. (2024), the local risk status of these Hungarian sheep breeds is as follows: the Hungarian Hortobagyi Racka (both varieties) and Indigenous Tsigai are categorised as vulnerable, while Cikta was categorised as endangered. It is, therefore, essential to conserve the genetic resources of these important local sheep. The decline in the populations of these breeds was predominantly driven by a shift in breeders’ preference towards more productive and modern breeds. However, several conservation programmes, both in situ and ex situ, have already been developed to support their preservation and promote their restoration (Mujitaba et al., 2024). The primary objective of this study was to evaluate the genomic diversity of four indigenous Hungarian sheep breeds in comparison to other Central–Eastern European breeds using genotyping data, with the goal of clarifying their origins and filling the knowledge gap of this poorly explored group of breeds. The research further included local breeds from neighbouring regions to examine gene flow and historical admixture events that occurred during the Viking Age and early Middle Ages (800–1300 CE), a period marked by extensive Norse influence across Europe, spanning from Iceland to Russia (Ellis, 2021). These findings provide valuable insights into the population structure and genetic connections that have shaped contemporary Hungarian sheep breeds.
Due to the sampling framework of the SMARTER project, only the breeds included in the present analysis have been genotyped. However, in the FAO DAD-IS system (https://www.fao.org/dad-is/browse-by-country-and-species/en/), a total of 33 sheep breeds are listed as both local and commercial. Three Hungarian sheep breeds were newly genotyped for this study in the context of the SMARTER project (https://www.smarterproject.eu/): Hortobagyi Racka (28 animals from 2 farms; n = 16 white and n = 12 black), Indigenous Tsigai (34 animals from 2 farms), and Bábolna Tetra (27 animals from 1 farm). The total blood samples were collected from the jugular veins, preserved in tubes containing EDTA, and transferred to FTA cards. The dried FTA cards were kept at room temperature until they were shipped to the genotyping company for further processing. The entire workflow, including DNA isolation, genotyping, and the conversion of raw signals into Ovine 50K genotypes, was outsourced to Neogen Chemicals Limited (https://www.neogen.com). Neogen utilised the Infinium® OvineSNP50 BeadChip Array, which contains over 50,000 evenly distributed SNP-targeting probes. Genotype data for the remaining breeds were obtained from publicly available databases (see Supplementary Table 1 for details). We kept the new Hortobagyi Racka data separate from the previously genotyped Racka (Barbato et al., 2017) in order to assess their genetic structure and highlight any potential differences between these two populations. Genotype data for Cikta were also obtained from a publicly available database (Ciani et al., 2020). Sampling locations are detailed in Supplementary Figure S1.
To represent Merino and Texel influences, we included data from those breeds in the FST and individual ancestry (genetic components) estimations to identify “introgression” and estimate a genetic contribution score (GCS, see below).
In total, genotype data for 1,338 sheep were available. All data were genotyped using the Illumina OvineSNP50 or the Illumina Ovine HD SNP (Supplementary Table S1). After merging data using PLINK v1.9 (Chang et al., 2015), the SNP data were mapped against the genome assembly Oar_v3.1 using SNPchiMp v.3 (Nicolazzi et al., 2015) and a series of custom scripts developed in the context of the SMARTER project (https://smarter-database.readthedocs.io/en/latest/index.html). Quality control was performed using PLINK v2.0 (Chang et al., 2015), following the FAO guidelines for the genomic characterisation of animal genetic resources (Ajmone-Marsan et al., 2023). Variants with a call rate below 90% and a minor allele frequency of less than 1% were excluded from the analysis. Individuals with genotype calls of less than 95% markers were also removed. Additionally, individuals displaying extreme levels of observed heterozygosity were filtered out using the “--het flag” for obtaining the genotypic counts. Afterwards, we used R v4.1 (R Core Team, 2018) to first compute the observed heterozygosity and then identify those individuals having within-population extreme heterozygosity (>3 SD). Linkage disequilibrium (LD) pruning was performed using PLINK v1.9 with a window size of 50 SNPs, shifting the windows by five SNPs at each step, and an r^2 threshold of 0.5 to reduce ascertainment bias (Lachance and Tishkoff, 2013). Duplicate samples and closely related individuals were removed by applying the command “--king cat-off” in PLINK v2.0 (Chang et al., 2015). Unmapped variants and variants on the sex chromosomes were also excluded. After quality control, a total of 31,010 SNPs were used for genetic diversity and population structure analyses. To avoid the loss of information due to the LD pruning process when applying the analysis based on LD, we reintroduced the variants in linkage disequilibrium to our filtered dataset, resulting in a total of 35,688 SNPs for the runs of homozygosity (ROH) detection and
Population differentiation and genetic variability among and within breeds, as well as among groups of breeds, were assessed using Wright’s FST and Reynolds’ genetic distance (Supplementary Table S2; Supplementary Figure S2). For the visualisation of pairwise FST values, we used the R package pheatmap v. 1.0.12 (https://CRAN.R-project.org/package=pheatmap, R Core Team, 2018).
Software GONE (Santiago et al., 2020) was used to obtain contemporary and historical estimates of the effective population size with unphased genotypes, correcting for sample sizes, and using the default settings (except for the number of internal replicates, which was set to 100). The so-called contemporary
Runs of homozygosity were detected using the R package detectRUNS 0.9.5 (Biscarini et al., 2018) with default settings, except for the minimum length of homozygous segments and the minimum number of SNPs included in the runs (set to 1 Mb and 50 SNP, respectively). The genomic inbreeding coefficient (FROH) was computed as the total length of the genome covered by ROH divided by the total length of autosomes flanked by markers. Their distribution per classes of length (0–2, 2–4, 4–8, 8–16, and >16 Mb) was determined.
Clustering analysis was used to investigate population structure and genetic components at the individual level, applying the principal component analysis (PCA) using the SNPRelate package v. 1.38 (Zheng et al., 2012) and the unsupervised method in ADMIXTURE v.1.3 (Alexander et al., 2009), with K clusters ranging from 2 to 40. The cross-validation error function was computed for each value of K. K with the lowest cross-validation error was chosen to model the data. We used evalAdmix v. 0.95 (Garcia-Erill and Albrechtsen, 2020; available at https://github.com/GenisGE/evalAdmix) to evaluate the fit of admixture proportions. In brief, the method estimates the correlation of the residual difference between the true genotypes and the genotypes predicted by the model. In the case of poor fit of the admixture model, individuals with similar demographic histories exhibit a positive correlation with their residuals. The correlation of the residuals’ matrix was plotted using R v4.1 (R Core Team, 2018) and the visFuns.R script available with the program and BITE was used to plot the ADMIXTURE results. In addition to established traditional methods of population analysis, we applied the pipeline implemented in NetViewP (Neuditschko et al., 2012; Steinig et al., 2016) (https://github.com/esteinig/netviewr/tree/master/R) and used NetViewR as a visualisation tool. This programme offers a more comprehensive overview of the population structure at different levels of genetic similarity, allowing a focus on fine- or large-scale structure (high-resolution population structure). The programme NetViewP implements three machine learning methods to establish the mutual k-nearest neighbour threshold for network analysis. Since the dataset included several breeds with a critical conservation status (Supplementary Table 1, DAD-IS, https://www.fao.org/dad-is/browse-by-country-and-species/en/), we combined the high-definition network visualisation (NetView) and the model-based clustering results (ADMIXTURE) with the identification of “key contributors,” following the integrated three-step approach (Steinig et al., 2016). Steinig and collaborators define “key contributors” as individuals retaining the largest variation in the relevant genetic relationship structure within populations (the most informative). We ran the NetView analysis with 10,000 iterations to calculate the genetic contribution score (GCS). To further investigate phylogenetic relationships among the sheep breeds in our study, a network-building method (NeighbourNet) analysis based on the matrix of Reynolds’ unweighted distances previously calculated was first performed using SplitsTree v5.3.0 software (Huson and Bryant, 2006). NeighbourNet is a method for clustering taxa into hierarchically nested sets, similar to cladistics and Bayesian inference. However, it differs in that it does not strictly adhere to a branching model of descent. Instead, it allows for overlapping and intersecting sets, making it better suited for detecting conflicting signals in datasets, such as those resulting from borrowing and blending among evolutionary lineages. The key features of the method are as follows: i) relationships among taxa are inferred by calculating pairwise distances based on character data; ii) weighted splits are generated and combined using an agglomerative clustering algorithm; and iii) results are represented as a “split graph” or network diagram with a “tree-like structure” (consistent splits form a branching and tree-like appearance or “box-like structures”; inconsistent splits produce a latticed appearance, reflecting reticulation (blending)). More specifically, a network-like topology is a graphical representation that reflects relationships among populations, breeds, or taxa that go beyond a simple tree-like structure. Unlike a strictly branching phylogenetic tree, which assumes a single lineage of descent, a network-like topology accounts for non-hierarchical relationships such as gene flow, admixture events, and conflicting signals (situations where genetic data suggest multiple possible relationships between populations). In a network-like topology, these complex relationships are represented as reticulations (interconnected “box-like” structures) in the graph. These reticulations visually indicate genetic blending, admixture, or shared ancestry.
The level of reticulation in the network was quantified using the Delta score and Q-residual score (Grey et al., 2010; Holland et al., 2002). The Delta score and Q-residual score are metrics used to assess conflicting signals within a network, where conflicting signals are calculated first. Both measures evaluate path length discrepancies among pairs of taxa within “quartets” (groups of four taxa) sampled from the network. Quartet scores range from 0 to 1, with values closer to 0 indicating more tree-like splits and values closer to 1 indicating greater reticulation, suggesting complex relationships and possible admixture in the latter case.
A more concise description of the historical demographic relationships between populations can be derived by constructing admixture graphs, which are phylogenetic trees that include admixture events such as splits and mergers among internal nodes (i.e., populations) in their topology (Patterson et al., 2012). AdmixtureBayes is a Python-based programme designed to generate, analyse, and visualise posterior samples of admixture graphs. It provides a method for a more comprehensive exploration: the MCMC framework allows for a thorough search of possible admixture graph topologies, often identifying correct structures where other methods may fail. In addition, it allows for the quantification of uncertainty by providing measures of statistical uncertainty, reporting posterior probabilities of specific admixture events, and offering insights into the confidence of inferred relationships. We constructed admixture graphs to infer and analyse gene flow and admixture events that occurred in the past, applying a Bayesian approach implemented in AdmixtureBayes (Nielsen et al., 2023), available at https://github.com/avaughn271/AdmixtureBayes. Although the programme can generate different types of trees, we considered only the consensus trees. These are trees formed by combining all nodes whose posterior probability of appearing in an admixture graph exceeds a certain threshold. Three posterior probability thresholds were chosen (75%, 90%, and 95%). Analyses were performed running three independent chains, checking for convergence following Nielsen et al. (2023), and discarding 20% of each chain as a burn-in. Each chain was extended until the convergence was reached. More specifically, during the second step of the procedure, the convergence of the MCMC sampler was assessed first using the analyzeSamples.py script implemented in the programme, followed by examining the trace plots of the chain or the Gelman–Rubin convergence diagnostics of parallel chains using the included R script EvaluateConvergence.R. Thus, we reran the MCMC procedure five times, increasing the number of iterations each time, for a total number of 5,000,000 iterations, until all three chains had reached the same stationary distribution. For the construction of the consensus graphs, we analysed only chain1 using the following setting: -mcmc_results chain1. csv--burn_in_fraction 0.30 --thinning_rate 40 --slow.
After merging the datasets, there were initially 582,101 markers and 1,338 animals. However, filtering for minor allele frequency, poor genotyping, and autosomal location reduced the number of markers to 35,688. Additional filtering for LD reduced the dataset to 31,010 markers. The average genotyping rate was 0.998. After applying additional filters for low genotyping quality, excess heterozygosity, and presence of close relatives, 763 animals remained for the subsequent, final analyses.
Geographic information, label acronyms, names of breeds, number of animals per breed before and after the filtering steps, and origin of datasets are summarised in Supplementary Table S1; a geographical map illustrating the sample location is shown in Supplementary Figure S1. The data comprise genotypes obtained from 38 breeds from 16 countries (one transboundary breed, with samples from two countries). There were 5 to 30 animals per breed.
The pairwise FST matrix is shown as a heatmap in Supplementary Figure S2 and summarised in Supplementary Table S2. Among the Hungarian sheep, the two populations HRR and RAK are the most differentiated, with FST estimates of 0.09, compared to other Hungarian breeds. Meanwhile, BAT, CIK, and TSI are more similar to each other, and Merino has a slightly greater influence (FST = 0.05) on these last breeds compared to HRR and RAK (FST = 0.08). In contrast, VAL is the most distant member of the Eastern group. Comparing the distances between Hungarian breeds and the Northern group, estimates range from 0.07 to 0.1 with the Norwegian and Finnish breeds and from 0.1 to 0.17 with the Swedish breeds, whereas the Western group shows a range between 0.07 and 0.15. The observed (Ho) and expected heterozygosity (He) values and the genomic inbreeding coefficient (FROH) are shown in Supplementary Table S3 and Figure 1.
Figure 1. Histograms of descriptive statistics. The observed and expected heterozygosity (Ho and He) and genomic inbreeding (FROH) estimates for each breed are represented with vertical bars.
Within-breed levels of heterozygosity ranged from 0.36 to 0.4 for Ho and from 0.37 to 0.4 for He in the Hungarian breeds; considering all the breeds, OUE has the lowest values (Ho = 0.26 and He = 0.32), GOR has the highest (0.42 for both estimates), and the other breeds are in the range between 0.34 and 0.38 for both indices. Overall, the highest and lowest FROH values are 0.52 and 0.02, respectively. Specifically, the genomic inbreeding coefficient ranges from 0.045 to 0.09 for the Hungarian breed, with the exception of BAT (0.027). For the remaining breeds, the estimates are consistent with the previous studies (Beynon et al., 2015; Ciani et al., 2020; Rochus et al., 2020; Schönherz et al., 2020).
Exploring the distribution of ROH and the total coverage (sum of ROH or SROH) across the genome in each population allows us to quantify inbreeding levels and gain insights into aspects of populations’ demographic history. Following the distribution of length classes (Supplementary Figure S3), an initial survey revealed a common distribution pattern, with no ROH falling in the shortest class (0–2 Mb), and almost 50% of the homozygous segments in all breeds were between 4 and 8 Mb. In the Hungarian CIK and TSI breeds and Swedish FJA and KLO, almost 30% of segments are longer than 16 Mb. Analysing the SROH plot in Supplementary Figure S4 and focussing first on the Hungarian breeds, TSI is divided into two subgroups: one with individuals that have very long and more abundant ROH and another with individuals having a smaller fraction of the genome covered by ROH. BAT is the breed the least autozygosity, and the remaining breeds exhibit similar patterns (fewer and shorter ROH segments). Examining the other populations included in this study, some differences are discernible: NWI, FIN, TWM, DLW, BAT, ROM, and GOR breeds revealed the lowest number of autozygous segments in their respective groups (Northern, Western, and Eastern Europe). For ICL, SWH, NSP, SBF, GAL, and LLY breeds, we observed slightly longer and more ROH segments. FJA, BOR, and OUE have the greatest fraction of the genome covered by ROH.
The clustering analyses allowed us to infer the population structure and individual genetic components. We performed the principal component analysis (PCA) twice: first with all the breeds included (Supplementary Figure S5) and then again with the most divergent breeds excluded to avoid large-sample bias and provide a better representation of their relationship. Figures 2A, B shows the PCA for the first three PCs.
Figure 2. PCA of the dataset after excluding the most divergent breeds. (A) Scores for each individual for PC1–PC2 and (B) PC1–PC3. Colour of symbols indicates the country of origin (bright blue, Finland; blue, Sweden; green, Norway; darker blue, Iceland; light green, British Isles, red, Hungarian; dark red, remaining Eastern European breeds; black, O. orientalis. Symbols of the same colour indicate breeds within the country.
The total percentage of variance accounted for by the first three PCs is almost 7%. The first PC (PC1) divides the breeds on the basis of the types of tails (short- and long-tailed sheep), with the ancestral MIR significantly closer to the Swedish populations and the Icelandic breed constantly clustering with the Norwegian group. The second component (PC2) distinguishes the Swedish (except KLO), Hungarian (CIK, RAK, and HRR), and Central–Eastern European (VAL and GOR) breeds from Norwegian, Icelandic, and the British Isles sheep. The third PC (PC3) distinguishes the Swedish populations DAL and GUT from the remaining Swedish groups.
The connection among breeds depicted by the PCA was supported by the NeighbourNet analysis based on Reynolds’ distances and is shown in Figure 3.
Figure 3. Phylogeographic inference. (A) NeighbourNet graph for the target breeds in our study based on Reynolds’ distance matrix. The branches corresponding to Northern, Western, and Central–Eastern Europe are indicated in blue, green, and red, respectively, within different shades, according to their geographical origin. (B) Plot of Delta scores (red) and (C) Q-residual (green) plots. For the description of the sheep breeds, see Supplementary Table 1.
Despite the complexity of this exploratory analysis, the phylogenetic network (fit = 98.987, Figure 3A) approximately distinguishes the breeds by grouping the short- and long-tailed sheep. SOA, BOR, and SBF, archetypal of “primitive” breeds, are in a separate branch. The graph showed several clear clusters, but a closer inspection of the central part of the network revealed rectilinear webbing within clusters, with regions of conflicting signals for specific breeds. The Delta score and Q-residual score, which quantify the extent of reticulation for the whole network diagram and, thus, offer an accurate measure of departures from a strict tree, are summarised in Figures 3B, C. The estimates of the overall tree and non-tree-likeness of the network yielded an average Delta score of 0.3 and a Q-residual score of 7.457E-4. The highest scores belong to BAT and KAM (Delta score = 0.36 and Q-residual score = 0.001; blue arrows in Figures 3B, C). The measures of tree-likeness for HRR, RAK, GOR, and CIK are the lowest. An exception was the scores for Irish GAL and Danish HMF, where the Delta-score values were more tree-like, in contrast to the Q-residual values, which were slightly more network-like (Delta score = 0.26 and 0.27 and Q-residual score = 0.001; black arrows shown in Figures 3B, C). The results of the two metrics suggest a shift towards a slightly network-like topology, which is reflected in our phylogenetic reconstruction by the reticulated relationships that link the breeds analysed in this study. Individual admixture proportions were estimated using the model-based admixture clustering analysis and are shown in Figure 4.
Figure 4. Admixture results summarised in a circular pattern for different values of K (number of clusters) increasing from the centre (K = 2) to periphery (K = 40). Different genetic contributions are shown as different colours corresponding to different genetic components. For the full definition of breeds, see Supplementary Table 1.
The most probable number of K clusters (individual ancestry estimations) is 31 (Supplementary Figure S6), which is the estimation that exhibits the lowest log-likelihood value after applying the cross-validation method. In Supplementary Table S4, we reported the admixture proportions for the Hungarian breeds analysed in this study. We also evaluated the model fit using evalAdmix as the correlation of residuals (Supplementary Figure S6), and in our analysis, the residuals are uncorrelated among individuals, indicating a good model fit.
Overall, the analysis revealed that OUE, SOA, GUT, and HMF breeds maintained a distinct genetic profile compared to other European breeds. With K = 2, all the Hungarian breeds show both genetic components, as do the remaining breeds, except for SOA (red) and HMF (blue), which show only one component. At K = 4, the yellow component (most prominent in the Swedish GUT breed) was dominant in all populations, excluding the Danish breeds Galway, Ouessant, and Soay. At K = 8, the indigenous component of the Hungarian HRR is observed in all the Eastern European breeds, and from K = 20 onward, many breeds appear genetically well-characterised. Another noticeable consideration is the separation at K26 of TSI in two subpopulations, which is consistent with SROH analysis. Finally, at K = 32, all the breeds show a defined genetic identity except for some Eastern European (CIK, VAL, KAM, and GOR) and a few British Isles populations (BHC, DLW, and TWM).
Since our analyses revealed a complex population structure, we applied a deeper analytical method that allows for the identification of high-resolution population structure and sub-structure. As shown in Supplementary Figure S7, the three methods concur in detecting k = 8 as the starting point to evaluate the relationship between populations. Starting from the larger values of k (Supplementary Figure S7A, k = 45), we observed the connection of more distantly related samples, with some relationships emerging according to the model-based clustering analysis. The Hungarian breeds are closely related to all the Eastern European breeds, with the exception of ROM, which is linked to the Finnish and the short-tailed breeds instead. The Swedish GOT, GUT, and DAL breeds appear the most divergent, together with the primitive BOR and SOA. For k = 33 (Supplementary Figure S7B), the indigenous HRR and RAK breeds are now distinguished from the remaining Hungarian breeds. We also detect more outliers and divergent groups across the short-tailed sheep. Further substructures within both short- and long-tailed sheep are evident (Supplementary Figure S7C, k = 20), with Eastern European populations still showing some relationship to each other, Merino, and two Welsh populations. Moreover, there were some isolated individuals with no connection at the periphery of the graph. Finally, at k = 8 (Supplementary Figure S7D), we can observe a finer-scale genetic structure; in particular, HRR, RAK, and ICL breeds are divided into two groups, revealing a strong substructure. As a complementary analysis, we combined the network information with the individual admixture proportions specified by the ADMIXTURE analysis (Figure 5). This approach provided more detailed information and allowed for the identification of individuals that could be defined as “key contributors.”
Figure 5. Network representing the genetic relationship among breeds and the key contributors. The network was built combining the results of the GCS (the genetic contribution of individuals within population) analysis and the estimates of individual admixture proportions (ancestries). We plotted the results using the most probable number of clusters (K = 31, optimal K-value selected by the cross-validation method in Admixture). Every circle or node represents one individual, and the breed code is indicated within each circle. The size of the nodes is proportional to its GCS reported in Supplementary Table 5. The grey lines are the edges that connect individuals in the network, and their width is proportional to the genetic distance between individuals. Breed codes are shown in Supplementary Table 1.
The size of the circle or node (that represents one individual with its admixture proportion) is proportional to the individual genetic contribution to such population substructure. If we focus on the breeds with a greater proportion of admixture (e.g., VAL and DLW), no key contributors are identified (each animal exhibits a little contribution). The opposite occurs for the Swedish breed (e.g., DAL and FJA), with some individuals ranked amongst the top key contributors (Supplementary Table S5). The topology of the network additionally illustrates the slightly lower impact of admixed sheep on the formation of some populations (SBF, LLY, SSW, TWM, BWM, and ROM). Another observation is the high proportion of Merino-like ancestry in all the Eastern European breeds, except Tsigai, the two populations of Hungarian Hortobagyi Racka, and Romanov. Although this analysis has some limitations in terms of intuitiveness, as it is a two-dimensional (2D) representation of a three-dimensional 3D model, it still allowed us to identify some well-known relationships among our breeds such as BAT, FIN, and ROM. However, there are some noticeable differences compared with the admixture analysis.
One more important aspect in the investigation of breed origin and formation is observing the fluctuation in the effective population size over time through the assessment of the historical
After checking for convergence (Supplementary Figure S10), we built the graph. Figure 6 illustrates the consensus admixture graph at 90% posterior probability. In this graph, where the branch length is proportional to the distance among breeds, four main clusters can be identified, each labelled within dashed rectangles. The first cluster, shown in red, represents the Eastern European breeds. This cluster is the closest to the root and well-separated from the others (except Ouessant, OUE). The second cluster, shown in dark blue corresponds to the Swedish cluster; this cluster is connected to the root via two distant internal nodes (n25 and n24). The internal node where the OUE branches off (n21) connects to the whole British Isles group (third cluster, in green). A more internal, intermediate node (a14), which links both the second and third clusters, connects them to a fourth cluster (in light blue), which is represented by the Norwegian samples. Interestingly, the same node (a14) gives rise to the Icelandic and the primitive Scottish breeds, which appear to share a common ancestry. Separated by long branches and located at the end of the admixture graph, we retrieve the Danish populations. Figure 6 shows the admixture graph, which provides more information, visibly highlighting the different derivations of TSI, CIK, and HRR/RAK (red dashed square) and the ancient origin of the French OUE, followed by the Swedish DAL. In our graph, the position also suggests its role as the “ancient contributor” between the Nordic and the Western genetic components and a cryptic proximity to the Hungarian and the Polish breed GOR; this outcome should be interpreted with caution. In addition, while some admixture events (highlighted in purple in Figure 6) indicate recent introgressions (e.g., BAT and NWI), others are not documented or previously detected (e.g., SWH). Moreover, the ROM breed showed “Nordic” genetic contributions from Sweden and Finland. Supplementary Materialsl (datasheets 1 and 2) contains the consensus admixture graphs obtained at 75% (datasheet 1) and 95% (datasheet 2) posterior probability, which identify the same admixed populations.
Figure 6. Consensus admixture graph at 90% of posterior probability as inferred by AdmixtureBayes. Each leaf corresponds to a population/breed (breed codes as shown in Supplementary Table 1). Leaves, which are the product of an admixture event, are shown in purple. Otherwise, the leaves are light blue. Internal nodes are green. Shades of green show the posterior probability (numbers in parentheses) of the true graph having a node with the same descendants. Darker shades of green correspond to higher probabilities. The root is shown in black.
By integrating a multidisciplinary approach that connects archaeological findings, zooarchaeological remains, historical written sources, and modern genomic data, researchers can explore a broader range of aspects related to sheep domestication. This comprehensive approach allows for a deeper understanding of the intricate web of interactions and migrations between different populations and the underlying forces that prompted these movements. For instance, one such driving factor could be production objectives, as observed in the spread of woollier sheep breeds during the revolution of secondary products, when sheep began to be valued not only for meat but also for wool. This kind of analysis provides valuable insights into the complex processes and societal influences that shaped the development and movement of sheep populations across various regions and time periods (Macheridis, 2022; Sabatini et al., 2019). However, despite these advancements, the demographic history of sheep, particularly European breeds, remains an ongoing area of study, with uncertainties surrounding the precise timing and nature of key events. For instance, questions still remain about the origin and divergence of haplotypes, the distribution of various lineages, and whether the gene flow between populations was bidirectional or unidirectional (Miranda et al., 2023). This complexity is especially pronounced in the regions of Central and Eastern Europe, which have long served as a bridge between Europe and Asia. These areas have been shaped by the convergence of numerous cultures, economies, and trading networks and have seen the development and decline of empires, wars, and migrations over centuries. Such a rich historical backdrop has left a profound mark on the genetic landscape of sheep in these regions, making it an essential focal point for understanding the broader patterns of domestication and breed evolution. Therefore, in the present study, our main objective was to genetically characterise four indigenous Hungarian breeds, one of which was recently created, while the others have a more ancient background and complex ancestry, by analysing their population structure and admixture, as well as their relatedness to neighbouring breeds. Since Hungary has been the convergence point of different expansion and trading routes (such as “the Varangians to the Greek,” a mediaeval trade route that connected Scandinavia, Kievan Rus, and the Eastern Roman Empire), we explored the pattern of admixture, searching for some trace of Hungarian ancestry in Scandinavian sheep or vice versa. Hungary’s central position in these historical networks makes it a key region for studying potential genetic exchanges between distant populations.
Our results on descriptive statistics (heterozygosities, fixation index, and genomic inbreeding coefficient) for the dataset including all breeds align with previous SNP array studies. Hungarian sheep populations have predominantly been studied using mitochondrial DNA, cytochrome B and D-loop regions, and microsatellite markers (Gáspárdy et al., 2021; 2022; Kusza et al., 2008; 2011). Although research on SNPs to analyse the diversity of Hungarian indigenous sheep has been limited, with one notable study by Zsolnai et al. (2021), further publications on this topic are currently in progress. Zsolnai and colleagues (2021) examined the genetic status of lowland-type Racka sheep in both black and white colour variants. Our results from the ROH analysis for the two populations closely align with those reported for the white variant by Zsolnai et al. (2021). Additionally, their study highlighted a severe population bottleneck in the breed following World War II, with gradual recovery beginning after 1983, largely due to efforts by the association of Hungarian Racka breeders (Veress et al., 2002). Our historical analysis of
The significant reduction in the population size among Hungarian breeds and the introgression of foreign alleles is a situation common to many traditional breeds worldwide. This decline has primarily been driven by changes in breeding objectives and the need for higher production to meet increasing demand (Sabatini et al., 2019). Additionally, historical events such as the World Wars have further exacerbated this decline in Hungary, leading to a lasting impact on local sheep and other livestock populations. As a result, several Hungarian breeds, historically exploited for triple purposes and widespread across Europe since the early Middle Ages, were unable to compete with more productive British and Southern European breeds and were gradually replaced (Bodó, 1994; Deng et al., 2020; Gáspárdy et al., 2021). The preservation of these breeds now depends on non-commercial initiatives, with their survival maintained by small, fragmented populations in a few flocks. This situation contributes to the high levels of inbreeding detected in this study and previous studies. Specifically, the analysis of both contemporary and historical population sizes and the distribution of ROH across length classes for the Cikta (CIK) and Tsigai (TSI) breeds suggest a decrease in population size and subsequent inbreeding, which is consistent with historical bottlenecks, a trend already observed in previous studies based on STR and mtDNA data (Gáspárdy et al., 2021; 2022; Kovács et al., 2019). Recent or past demographic events, insufficient resolution due to the limitation of the dataset, and other factors can obscure signals of the gene flow. However, several interesting findings emerged from the admixture graph (Figure 6). First, the most striking result is the distinct separation of both studied Hortobagyi Racka populations from other Hungarian and Eastern European breeds, as supported by cluster analyses, network analyses, and the admixture graph itself. This analysis also highlights the different origins of the Cikta and Tsigai breeds, which reflect their evolution from distinct sheep types. Second, the analysis suggested a more ancient origin for the Ouessant and Swedish Dalapäls breeds, while other Swedish breeds appear to have more recent origins. The breeds labelled in purple on the admixture graph are “breeds with mixed origins,” as identified by the analysis. For instance, the Bábolna Tetra (BAT) is a synthetic breed developed relatively recently through the crossbreeding of Merino sheep with the hardy Romanov and Finnish Landrace breeds (Neubauer et al., 2015). The Norwegian white Spaelsau (NWI) breed, officially formed in 2000, was developed through the crossbreeding of several Northern European breeds (Oliveira et al., 2020). The Valachian, an autochthonous breed from Slovakia, was crossbred with English mutton breeds in the mid-20th century to improve productivity while retaining rustic traits (Mészárosová et al., 2022). These mixed origins are well-documented, and the analysis successfully identified them. Notably, the Romanov population also appears to have a mixed origin, descending from a Swedish ancestor and sharing a common ancestor with the Finnsheep. This finding is consistent with historical records, which describe the Romanov as an independent branch of Northern European short-tailed breeds (Ivanov, 1935). The shared ancestry between Romanov, Norwegian Spaelsau, and Finnsheep has been suggested in earlier studies (Deniskova et al., 2018; Kijas et al., 2012; Ryder, 1983; Tapio et al., 2006). The Romanov breed, known for its extraordinary prolificacy, is the only short, thin-tailed breed in Russia and is bred globally for its valuable traits (Deniskova et al., 2018; Veniaminov, 1984). Although the programme is a robust tool to infer and visualise quite complex admixture histories in populations, the results have to be carefully interpreted by taking into consideration some limitations associated with our dataset and those associated with the programme itself. For example, the limited number of populations/breeds in the dataset is a balance between the newly genotyped data, the genotypes already available from publicly available databases that meet our requirements, and the computational effort, time consumption, and scalability issues of the inference. When the number of populations increases, the complexity of the admixture graph increases exponentially, making the process computationally more expensive, slow, and difficult to infer. Despite these limitations, the programme performed well by generating a topology that correctly identified admixed breeds when choosing a more stringent threshold for the posterior probability (Supplementary Material, datasheets 1 and 2). Overall, the analyses of individual genetic components, such as NeighbourNet, NetView, and AdmixtureBayes, highlight the gene flow between Hungarian long-tailed sheep and Nordic short-tailed sheep. The NeighbourNet and the associated metrics, which quantified the degree of reticulation of the network, allowed us to identify signals of genetic exchange that cannot be captured by simple tree-based models. These reticulations visually suggest genetic blending, admixture events, shared ancestry, and other evolutionary dynamics, enabling a deeper understanding of breed history and the genetic impact of interbreeding. However, its complexity requires a careful interpretation since assumptions about the nature and timing of the gene flow or other demographic events, such as population decline or expansion, may impact results. We detected a weak but identifiable Finnish and Norwegian genetic influence in the Cikta breed (Figure 5) and traces of the older Hungarian Racka in both Finnsheep and Romanov breeds (Figure 4, K = 8), suggesting bidirectional genetic exchanges between these populations. Investigating the paternal origins of several sheep breeds, Deng et al. (2020) identified migratory events responsible for the development of five lineages (y-HA, y-HB1 and 2, y-HC, and y-HD). Haplotype y-HC, present in Scandinavia, Western Russia, England, and at low frequencies in Spain, most likely spread early with primitive Northern European sheep (with a hair coat). A later wave, associated with y-HC-carrying English breeds specialised in meat production, became transboundary in the last five centuries (Meadows et al., 2004; Kijas et al., 2012). Archaeological evidence supports this expansion model, suggesting that fat-tailed sheep carrying haplotype y-HB spread approximately 3,400 years ago (Deng et al., 2020).
Based on archaeological and genetic evidence, Sabatini et al. (2019) proposed another wave of sheep expansion during the Bronze Age, introducing woollier animals from the Middle East to continental Europe. Százhalombatta-Földvár, located on the right side of the Danube approximately 30 km south of Budapest, is one of the largest temperate settlements from the Bronze Age in Central Europe. The sheep remains discovered at this archaeological site, dated to 1500 BCE, indicate that Hungary was among the earliest regions for fine wool production. This finding could explain the presence of Hungarian alleles in some short-tailed sheep breeds, possibly resulting from the introduction of new, woollier animals. On the other hand, the Nordic genetic influence in Cikta may reflect an ancient gene flow, possibly linked to Varangian migrations across Eastern Europe. Another possibility is a more recent introgression of Finnsheep, a breed commonly used to enhance fertility in others (Branford Oltenacu and Boylan, 1981), although there is no recorded use of Finnsheep in Cikta breeding. Although these genetic remnants linked to human activity may be obscured by crossbreeding, bottlenecks, and selection (Gopalan et al., 2022), the study of human–animal relationships continues to shed light on unresolved historical events (Manunza et al., 2023; Miranda et al., 2023).
This study exemplifies the challenges of dating specific genetic events, such as bidirectional gene flow, admixture, and haplotype divergence, as well as the difficulty in providing a historical context for these findings. The lack of written documentation for certain historical periods often obscures the origins and evolution of local breeds. This is particularly true for Hungary’s autochthonous Hortobagyi Racka sheep, whose origins remain unknown due to limited historical records, especially during the Turkish occupation (1,552–1,693) (Gáspárdy et al., 2022, Yunusbayev et al., 2015). Overall, our results not only offer new insights into the history, evolution, and relationships of Hungarian breeds with neighbouring populations, but they also raise further questions about the cultural, political, and biological processes that have shaped human societies. New discoveries challenge certain historical assumptions, particularly regarding the influence of Scandinavians (mainly Varangians) on the formation of the Hungarian nation and their cultural exchanges with the Magyars, ancestors of present-day Hungarians. The Varangians–warriors, traders, and settlers primarily from present-day Sweden–also ventured into Russia, Ukraine, and the Byzantine Empire from the 10th to 14th centuries (Gleason, 2009; Mägi, 2018; Price et al., 2019). It is believed that these Varangians spread northern sheep breeds during their travels. Excavations at the Viking Age settlement of Birka in southeastern Sweden have uncovered artefacts of possible Magyar origin, suggesting deeper connections (Benedict, 2009; Hedenstierna-Jonson, 2009). Additionally, Hungarian coins found in Viking Age contexts in Eastern Scandinavia hint at possible trading contacts, suggesting a Varangian presence in Magyar territory (Tentiuc, 2020). Combined evidence from graves, settlement layers, folk traditions, and linguistic studies (Moravcsik, 1946; Hedenstierna-Jonson, 2009; 2012; Mägi, 2018) suggests that Magyar–Varangian cultural exchanges went beyond mere trade or warfare.
In the 9th century, the Varangians migrated to Eastern Europe and established Kievan Rus, an early East Slavic state. The term “Rus” is considered to derive from the Varangians, highlighting their significant role in its foundation. As the Rus integrated with local Slavic populations, they developed a unique cultural and political identity. They controlled vital trade routes connecting Scandinavia to the Byzantine Empire, facilitating economic exchange and cultural interactions with neighbouring regions, including Hungary. During the 10th and 11th centuries, the Hungarians were also forming their identity in the Carpathian Basin. The arrival of the Hungarians coincided with Rus’s expansion, leading to potential interactions. Overall, the connections between the Varangians, the Rus, and early Hungarian rulers underscore a complex interplay of trade, diplomacy, and cultural exchange that was vital in shaping the identities of both the Rus and the emerging Hungarian state.
Our study aimed to provide historical context for these genetic findings by discovering clues such as introgression events that may correlate with large-scale human movements or mechanisms like trade, tribute, or the introduction of breeding stock. Although our analysis of four local Hungarian sheep breeds provides valuable insights, future studies incorporating the remaining native breeds would be crucial for understanding the complete genetic diversity of sheep in Hungary as these breeds reflect distinct historical migration patterns and adaptations in the Carpathian Basin. Moreover, although it is tempting to relate these findings to known historical events and the resulting assumptions, more in-depth analyses are needed to formally test our hypothesis, possibly exploiting ancient DNA information and using more refined methods to better understand the historical demography of the breeds under study.
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: http://webserver.ibba.cnr.it/smarter/, https://doi.org/10.46471/gigabyte.139 (Cozzi et al., 2024).
The studies involving animal participants were reviewed and approved by the Committee for Animal Welfare at Work of the University of Debrecen 19/2023/DEMÁB. Written informed consent was obtained from the owners for the participation of their animals in this study.
JR-D: formal analysis, investigation, and writing–review and editing. TB: investigation, and writing–review and editing. BG: methodology, supervision, writing–review and editing. AAS: writing–review and editing. PC: data curation and writing–review and editing. SK: project administration and writing–review and editing. GS: writing–review and editing. AS: project administration, supervision, writing–original draft, and writing–review and editing. AM: conceptualization, methodology, project administration, supervision, writing–original draft, and writing–review and editing.
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This project was funded by the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement no. 772787 (SMARTER).
In loving memory of Carole Moreno, the authors would like to thank István Monori and János Oláh for the collection of blood samples.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1507315/full#supplementary-material
Ajmone-Marsan, P., Boettcher, P. J., Colli, L., Ginja, C., Kantanen, J., and Lenstra, J. A. (2023). Genomic characterization of animal genetic resources – practical guide. FAO Animal Prod. Health. doi:10.4060/cc3079en
Alexander, D. H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19 (9), 1655–1664. doi:10.1101/gr.094052.109
Alvarado Bremer, J. R., Viñas, J., Mejuto, J., Ely, B., and Pla, C. (2005). Comparative phylogeography of Atlantic bluefin tuna and swordfish: the combined effects of vicariance, secondary contact, introgression, and population expansion on the regional phylogenies of two highly migratory pelagic fishes. Mol. Phylogenetics Evol. 36 (1), 169–187. doi:10.1016/j.ympev.2004.12.011
Astuti, P. K., Bagi, Z., Bodrogi, L., Pintér, T., Skoda, G., Fajardo, R., et al. (2023a). Hungarian indigenous Tsigai, a promising breed for excellent heat tolerance and immunity. Saudi J. Biol. Sci. 30 (9), 103747. doi:10.1016/j.sjbs.2023.103747
Astuti, P. K., Gavojdian, D., Ilie, D. E., Wanjala, G., Monori, I., Bagi, Z., et al. (2023b). Genetic polymorphism in European and African sheep breeds reared in Hungary based on 48 SNPs associated with resistance to gastrointestinal parasite infection using KASP-PCR technique. Trop. Animal Health Prod. 55 (3), 197. doi:10.1007/s11250-023-03609-0
Bakucs, Z., Fertő, I., and Vígh, E. (2020). Crop productivity and climatic conditions: evidence from Hungary. Agric. Switz. 10 (9), 421–512. doi:10.3390/agriculture10090421
Barbato, M., Hailer, F., Orozco-Terwengel, P., Kijas, J., Mereu, P., Cabras, P., et al. (2017). Genomic signatures of adaptive introgression from European mouflon into domestic sheep. Sci. Rep. 7 (1), 7623. doi:10.1038/s41598-017-07382-7
Benedict, B. (2009). “Varangians and their origins,” in The Varangians of byzantium (Cambridge University Press), 1–14. doi:10.1017/CBO9780511562365
Beynon, S. E., Slavov, G. T., Farré, M., Sunduimijid, B., Waddams, K., Davies, B., et al. (2015). Population structure and history of the Welsh sheep breeds determined by whole genome genotyping. BMC Genet. 16 (1), 65. doi:10.1186/s12863-015-0216-x
Biscarini, F., Marras, G., Cozzi, P. A., and Gaspa, G. (2018). detectRUNS: an R package to detect runs of homozygosity and heterozygosity in diploid genomes. Available online at: https://cran.r-project.org/web/packages/detectRUNS/index.html.
Bodó, I. (1994). The Hungarian racka. Anim. Genet. Resources/Resources génétiques animales/Recursos genéticos Anim. 13, 75–82. doi:10.1017/S1014233900000304
Branford Oltenacu, E. A., and Boylan, W. J. (1981). Productivity of purebred and crossbred Finnsheep. I. Reproductive traits of ewes and lamb survival. J. Animal Sci. 52 (5), 989–997. doi:10.2527/jas1981.525989x
Ceccobelli, S., Landi, V., Senczuk, G., Mastrangelo, S., Sardina, M. T., Ben-Jemaa, S., et al. (2023). A comprehensive analysis of the genetic diversity and environmental adaptability in worldwide Merino and Merino-derived sheep breeds. Genet. Sel. Evol. 55 (1), 24. doi:10.1186/s12711-023-00797-z
Chang, C. C., Chow, C. C., Tellier, L. C. A. M., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4 (1), 7. doi:10.1186/s13742-015-0047-8
Ciani, E., Mastrangelo, S., Da Silva, A., Marroni, F., Ferenčaković, M., Ajmone-Marsan, P., et al. (2020). On the origin of European sheep as revealed by the diversity of the Balkan breeds and by optimizing population-genetic analysis tools. Genet. Sel. Evol. 52 (1), 25. doi:10.1186/s12711-020-00545-7
Cozzi, P., Manunza, A., Ramirez-Diaz, J., Tsartsianidou, V., Gkagkavouzis, K., Peraza, P., et al. (2024). SMARTER-database: a tool to integrate SNP array datasets for sheep and goat breeds. GigaByte 2024, gigabyte139. doi:10.46471/gigabyte.139
Deng, J., Xie, X. L., Wang, D. F., Zhao, C., Lv, F. H., Li, X., et al. (2020). Paternal origins and migratory episodes of domestic sheep. Curr. Biol. 30 (20), 4085–4095. doi:10.1016/j.cub.2020.07.077
Deniskova, T. E., Dotsev, A. V., Selionova, M. I., Kunz, E., Medugorac, I., Reyer, H., et al. (2018). Population structure and genetic diversity of 25 Russian sheep breeds based on whole-genome genotyping. Genet. Sel. Evol. 50 (1), 29. doi:10.1186/s12711-018-0399-5
Drăgănescu, C., and Grosu, H. (2010). Valachian (Zackel) heritagephiletic sheep group - a taxonomic problem. Scientific Papers. Rom. Acad. 1, 1–8.
Dudu, A., Ghiţă, E., Costache, M., and Georgescu, S. E. (2016). Origin and genetic diversity of Romanian Racka sheep using mitochondrial markers. Small Ruminant Res. 144, 276–282. doi:10.1016/j.smallrumres.2016.10.016
Ellis, C. (2021). Remembering the Vikings: ancestry, cultural memory and geographical variation. Hist. Compass 19 (4). doi:10.1111/hic3.12652
Garcia-Erill, G., and Albrechtsen, A. (2020). Evaluation of model fit of inferred admixture proportions. Mol. Ecol. Resour. 20 (4), 936–949. doi:10.1111/1755-0998.13171
Gáspárdy, A., Berger, B., Zabavnik-Piano, J., Kovács, E., Annus, K., Zenke, P., et al. (2021). Comparison of mtDNA control region among descendant breeds of the extinct Zaupel sheep revealed haplogroup C and D in Central Europe. Veterinary Med. Sci. 7 (6), 2330–2338. doi:10.1002/vms3.585
Gáspárdy, A., Zenke, P., Kovács, E., Annus, K., Posta, J., Sáfár, L., et al. (2022). Evaluation of maternal genetic background of two Hungarian autochthonous sheep breeds coming from different geographical directions. Animals 12 (3), 218. doi:10.3390/ani12030218
Gopalan, S., Smith, S. P., Korunes, K., Hamid, I., Ramachandran, S., and Goldberg, A. (2022). Human genetic admixture through the lens of population genomics. Philosophical Trans. R. Soc. B Biol. Sci. 377 (1852), 20200410. doi:10.1098/rstb.2020.0410
Gray, R. D., Bryant, D., and Greenhill, S. J. (2010). On the shape and fabric of human history. Philosophical Trans. R. Soc. B Biol. Sci. 365 (1559), 3923–3933. doi:10.1098/rstb.2010.0162
Hedenstierna-Jonson, C. (2012). “Magyar material culture in the Swedish viking Age context of Birka,” in Die Archäologie der frühen Ungarn. Chronologie, Technologie und Methodik. Internationaler Workshop des Archäologischen Instituts der Ungarischen Akademie der Wissenschaften und des Römisch-Germanischen Zentralmuseums Mainz am 4. und 5. Dezember 2009. Editor T. Bengeuz (RGZM-Tagungen 17. Mainz: Römisch-Germanisches Zentralmuseum), 29–48.
Heinz, M. N., Gáspárdy, A., and Prof, A. (2021). Comparison of two Zaupel Sheep descendants, Cikta and Bovec, based on their mtDNA CR sequences.
Holland, B. R., Huber, K. T., Dress, A., and Moulton, V. (2002). Delta plots: a tool for analyzing phylogenetic distance data. Mol. Biol. Evol. 19 (12), 2051–2059. doi:10.1093/oxfordjournals.molbev.a004030
Huson, D. H., and Bryant, D. (2006). Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23 (2), 254–267. doi:10.1093/molbev/msj030
Kawęcka, A., Pasternak, M., Miksza-Cybulska, A., and Puchała, M. (2022). Native sheep breeds in Poland—importance and outcomes of genetic resources protection programmes. Animals 12 (12), 1510. doi:10.3390/ani12121510
Kijas, J. W., Lenstra, J. A., Hayes, B., Boitard, S., Neto, L. R., Cristobal, M. S., et al. (2012). Genome-wide analysis of the world’s sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol. 10 (2), e1001258. doi:10.1371/journal.pbio.1001258
Kovács, E., Tempfli, K., Shannon, A., Zenke, P., Maróti-Agóts, Á., Sáfár, L., et al. (2019). STR diversity of a historical sheep breed bottlenecked, the cikta. J. Anim. Plant Sci 29 (Issue 1).
Kusza, S., Gyarmathy, E., Dubravska, J., Nagy, I., Jávor, A., and Kukovics, S. (2009). Study of genetic differences among Slovak Tsigai populations using microsatellite markers. Czech J. Anim. Sci. 54 (10), 468–474. doi:10.17221/1670-cjas
Kusza, S., Ivankovic, A., Ramljak, J., Nagy, I., Jávor, A., and Kukovics, S. (2011). Genetic structure of Tsigai, ruda, pramenka and other local sheep in southern and eastern Europe. Small Ruminant Res. 99 (2–3), 130–134. doi:10.1016/j.smallrumres.2011.03.059
Kusza, S., Nagy, I., Németh, T., Molnár, A., Jávor, A., and Kukovics, S. (2010). The genetic variability of Hungarian Tsigai sheep. Arch. Tierz. 53 (3), 309–317. doi:10.5194/aab-53-309-2010
Kusza, S., Nagy, I., Sasvári, Z., Stágel, A., Németh, T., Molnár, A., et al. (2008). Genetic diversity and population structure of Tsigai and Zackel type of sheep breeds in the Central-Eastern- and Southern-European regions. Small Ruminant Res. 78 (1–3), 13–23. doi:10.1016/j.smallrumres.2008.04.002
Lachance, J., and Tishkoff, S. A. (2013). SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it. BioEssays 35 (9), 780–786. doi:10.1002/bies.201300014
Lischer, H. E. L., and Excoffier, L. (2012). PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs. Bioinformatics 28 (2), 298–299. doi:10.1093/bioinformatics/btr642
Loukovitis, D., Szabó, M., Chatziplis, D., Monori, I., and Kusza, S. (2023). Genetic diversity and substructuring of the Hungarian merino sheep breed using microsatellite markers. Anim. Biotechnol. 34 (4), 1701–1709. doi:10.1080/10495398.2022.2042307
Mägi, M. (2018). “The role of the eastern baltic in viking Age communication across the baltic sea,” in Austrvegr (Brill; Leiden: The Northern World).
Manunza, A., Ramirez-Diaz, J., Cozzi, P., Lazzari, B., Tosser-Klopp, G., Servin, B., et al. (2023). Genetic diversity and historical demography of underutilised goat breeds in North-Western Europe. Sci. Rep. 13 (1), 20728. doi:10.1038/s41598-023-48005-8
Meadows, J. R. S., Hawken, R. J., and Kijas, J. W. (2004). Nucleotide diversity on the ovine Y chromosome. Anim. Genet. 35 (5), 379–385. doi:10.1111/j.1365-2052.2004.01180.x
Mészárosová, M., Mészáros, G., Moravčíková, N., Pavlík, I., Margetín, M., and Kasarda, R. (2022). Within-and between-breed selection signatures in the original and improved valachian sheep. Animals 12 (11), 1346. doi:10.3390/ani12111346
Milanesi, M., Capomaccio, S., Vajana, E., Bomba, L., Fernando Garcia, J., Ajmone-Marsan, P., et al. (2017). BITE: an R package for biodiversity analyses. doi:10.1101/181610
Miranda, P. M., Soares, A. E. R., and Günther, T. (2023). Demographic reconstruction of the Western sheep expansion from whole-genome sequences. G3 Genes, Genomes, Genet. 13 (11), jkad199. doi:10.1093/g3journal/jkad199
Moravcsik, G. (1946). Byzantine christianity and the Magyars in the period of their migration. Am. Slav. East Eur. Rev. 5 (¾), 29–45. doi:10.2307/2492084
Mujitaba, M. A., Tokár, A., Balogh, E. E., Debnár, V. J., Javkhlan, A., Vásárhelyi, P. B., et al. (2024). In vitro gene conservation status and the quality of the genetic resources of native Hungarian sheep breeds. Vet. Sci. 11, 337. doi:10.3390/vetsci11080337
Neubauer, V., Vogl, C., Seregi, J., Sáfár, L., and Brem, G. (2015). Genetic diversity and population structure of Zackel sheep and other Hungarian sheep breeds. Arch. Tierz. 58 (2), 343–350. doi:10.5194/aab-58-343-2015
Neuditschko, M., Khatkar, M. S., and Raadsma, H. W. (2012). NetView: a high-definition network-visualization approach to detect fine-scale population structures from genome-wide patterns of variation. PLoS ONE 7 (10), e48375. doi:10.1371/journal.pone.0048375
Nicolazzi, E. L., Caprera, A., Nazzicari, N., Cozzi, P., Strozzi, F., Lawley, C., et al. (2015). SNPchiMp v.3: integrating and standardizing single nucleotide polymorphism data for livestock species. BMC Genomics 16 (1), 283. doi:10.1186/s12864-015-1497-1
Nielsen, S. V., Vaughn, A. H., Leppälä, K., Landis, M. J., Mailund, T., and Nielsen, R. (2023). Bayesian inference of admixture graphs on Native American and Arctic populations. PLoS Genet. 19 (2), e1010410. doi:10.1371/journal.pgen.1010410
Novo, I., Ordás, P., Moraga, N., Santiago, E., Quesada, H., and Caballero, A. (2023). Impact of population structure in the estimation of recent historical effective population size by the software GONE. Genet. Sel. Evol. 55 (1), 86. doi:10.1186/s12711-023-00859-2
Oliveira, H. R., McEwan, J. C., Jakobsen, J., Blichfeldt, T., Meuwissen, T., Pickering, N., et al. (2020). Genetic connectedness between Norwegian white sheep and New Zealand composite sheep populations with similar development history. Front. Genet. 11, 371. doi:10.3389/fgene.2020.00371
Patterson, N., Moorjani, P., Luo, Y., Mallick, S., Rohland, N., Zhan, Y., et al. (2012). Ancient admixture in human history. Genetics 192 (3), 1065–1093. doi:10.1534/genetics.112.145037
Price, T. D., Moiseyev, V., and Grigoreva, N. (2019). Vikings in Russia: origins of the medieval inhabitants of Staraya Ladoga. Archaeol. Anthropol. Sci. 11 (11), 6093–6109. doi:10.1007/s12520-019-00897-2
Prieur, V., Clarke, S. M., Brito, L. F., McEwan, J. C., Lee, M. A., Brauning, R., et al. (2017). Estimation of linkage disequilibrium and effective population size in New Zealand sheep using three different methods to create genetic maps. BMC Genet. 18 (1), 68. doi:10.1186/s12863-017-0534-2
Rannamäe, E., Saarma, U., Kantanen, J., and Bläuer, A. (2023). Maternal genetic diversity of ancient goats in Finland and Estonia and comparison with extant northern European goat breeds. Anim. Genet. 54 (2), 177–188. doi:10.1111/age.13281
R Core Team (2018). R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Available at: https://www.R-project.org/
Rochus, C. M., Jonas, E., and Johansson, A. M. (2020). Population structure of five native sheep breeds of Sweden estimated with high density SNP genotypes. BMC Genet. 21 (1), 27. doi:10.1186/s12863-020-0827-8
Ryman, N., Laikre, L., and Hössjer, O. (2019). Do estimates of contemporary effective population size tell us what we want to know? Mol. Ecol. 28 (8), 1904–1918. doi:10.1111/mec.15027
Sabatini, S., Bergerbrant, S., Brandt, L., Margaryan, A., and Allentoft, M. E. (2019). Approaching sheep herds origins and the emergence of the wool economy in continental Europe during the Bronze Age. Archaeol. Anthropol. Sci. 11 (9), 4909–4925. doi:10.1007/s12520-019-00856-x
Santiago, E., Caballero, A., Köpke, C., and Novo, I. (2024). Estimation of the contemporary effective population size from SNP data while accounting for mating structure. Mol. Ecol. Resour. 24 (1), e13890. doi:10.1111/1755-0998.13890
Santiago, E., Novo, I., Pardiñas, A. F., Saura, M., Wang, J., and Caballero, A. (2020). Recent demographic history inferred by high-resolution analysis of linkage disequilibrium. Mol. Biol. Evol. 37 (12), 3642–3653. doi:10.1093/molbev/msaa169
Schönherz, A. A., Szekeres, B. D., Nielsen, V. H., and Guldbrandtsen, B. (2020). Population structure and genetic characterization of two native Danish sheep breeds. Acta Agric. Scand. A Anim. Sci. 69 (1–2), 53–67. doi:10.1080/09064702.2019.1639804
Steinig, E. J., Neuditschko, M., Khatkar, M. S., Raadsma, H. W., and Zenger, K. R. (2016). Netview p: a network visualization tool to unravel complex population structure using genome-wide SNPs. Mol. Ecol. Resour. 16 (1), 216–227. doi:10.1111/1755-0998.12442
Tapio, M., Marzanov, N., Ozerov, M., Ćinkulov, M., Gonzarenko, G., Kiselyova, T., et al. (2006). Sheep mitochondrial DNA variation in European, caucasian, and central asian areas. Mol. Biol. Evol. 23 (9), 1776–1783. doi:10.1093/molbev/msl043
Tentiuc, I. (2020). On the Viking enclaves and their relations with the inhabitants of the Carpathian – dniester region between the 9th and the 11th centuries. Dacia N.S. LXII-LXIII, 2018-2019, 2020, 249–286.
Veress, L., Dunka, B., and Oláh, J. (2002). “Ősi magyar juhaink megmentése (Saving our ancient Hungarian sheep),” in Kutatási eredmények régi háziállatfajták értékeiről Gene conservation: research results on the values of old farm animals, 161–165.
Wanjala, G., Astuti, P., Bagi, Z., Kichamu, N., Strausz, P., and Kusza, S. (2023). Assessing the genomics structure of dorper and white dorper variants, and dorper populations in South Africa and Hungary. Biol. (Basel) 12 (3), 386–411. doi:10.3390/biology12030386
Yunusbayev, B., Metspalu, M., Metspalu, E., Valeev, A., Litvinov, S., Valiev, R., et al. (2015). The genetic legacy of the expansion of turkic-speaking nomads across Eurasia. PLoS Genet. 11 (4), e1005068. doi:10.1371/journal.pgen.1005068
Yurtman, E., Özer, O., Yüncü, E., Dağtaş, N. D., Koptekin, D., Çakan, Y. G., et al. (2021). Archaeogenetic analysis of Neolithic sheep from Anatolia suggests a complex demographic history since domestication. Commun. Biol. 4 (1), 1279. doi:10.1038/s42003-021-02794-8
Zhao, Y. X., Yang, J., Lv, F. H., Hu, X. J., Xie, X. L., Zhang, M., et al. (2017). Genomic reconstruction of the history of native sheep reveals the peopling patterns of nomads and the expansion of early Pastoralism in East Asia. Mol. Biol. Evol. 34 (9), 2380–2395. doi:10.1093/molbev/msx181
Zheng, X., Levine, D., Shen, J., Gogarten, S. M., Laurie, C., and Weir, B. S. (2012). A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28 (24), 3326–3328. doi:10.1093/bioinformatics/bts606
Keywords: admixture, demography, genetic structure, Hungarian sheep, local sheep breeds
Citation: Ramírez-Díaz J, Bobbo T, Guldbrandtsen B, Schönherz AA, Cozzi P, Kusza S, Sahana G, Stella A and Manunza A (2025) Exploring the complex population structure and admixture of four local Hungarian sheep breeds. Front. Genet. 16:1507315. doi: 10.3389/fgene.2025.1507315
Received: 07 October 2024; Accepted: 17 February 2025;
Published: 19 March 2025.
Edited by:
Jesús Fernández, Instituto Nacional de Investigación y Tecnología Agroalimentaria (INIA), SpainReviewed by:
Maria Cristina Cozzi, University of Milan, ItalyCopyright © 2025 Ramírez-Díaz, Bobbo, Guldbrandtsen, Schönherz, Cozzi, Kusza, Sahana, Stella and Manunza. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Szilvia Kusza, a3VzemFzekBob3RtYWlsLmNvbQ==; Alessandra Stella, YWxlc3NhbmRyYS5zdGVsbGFAaWJiYS5jbnIuaXQ=
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.