Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 28 March 2022
Sec. Evolutionary and Population Genetics

Genome-Wide Marker Data-Based Comparative Population Analysis of Szeklers From Korond, Transylvania, and From Transylvania Living Non-Szekler Hungarians

  • 1Department of Medical Genetics, Medical School, University of Pécs, Pécs, Hungary
  • 2Szentágothai Research Centre, University of Pécs, Pécs, Hungary
  • 3Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary
  • 4National Institute of Oncology, Budapest, Hungary

Genome-wide genotype data from 48 carefully selected population samples of Transylvania-living Szeklers and non-Szekler Hungarians were analyzed by comparative analysis. Our analyses involved contemporary Hungarians living in Hungary, other Europeans, and Eurasian samples counting 530 individuals altogether. The source of the Szekler samples was the commune of Korond, Transylvania. The analyzed non-Szekler Hungarian samples were collected from villages with a history dating back to the era of the Árpád Dynasty. Population structure by principal component analysis and ancestry analysis also revealed a great within-group similarity of the analyzed Szeklers and non-Szekler Transylvanian Hungarians. These groups also showed similar genetic patterns with each other. Haplotype analyses using identity-by-descent segment discovering tools showed that average pairwise identity-by-descent sharing is similar in the investigated populations, but the Korond Szekler samples had higher average sharing with the Hungarians from Hungary than non-Szekler Transylvanian Hungarians. Average sharing results showed that both groups are isolated compared to other Europeans, and pointed out that the non-Szekler Transylvanian Hungarian inhabitants of the investigated Árpád Age villages are more isolated than investigated Szeklers from Korond. This was confirmed by our autozygosity analysis as well. Identity-by-descent segment analyses and 4-population tests also confirmed that these Hungarian-speaking Transylvanian ethnic groups are strongly related to Hungarians living in Hungary.

Introduction

Transylvania is a historical region located in today’s Romania, which is bordered on the east and south by the Carpathian Mountains and the Apuseni Mountains on the west. From the 1st century BC until the 2nd century AD, it was the center of the Kingdom of Dacia. Transylvania was under the control of various tribes residing in the Carpathian basin between the 3rd and 9th centuries AD. Right before it went under Hungarian control, Transylvania was ruled by the Bulgars, a Slavic ethnic group of the Balkans. After Hungarians conquered the Carpathian basin, it became under partial Hungarian control in 1003 AD (Engel, 2005). The settlement of the Hungarians in Transylvania occurred between the late 10th and 13th centuries, but it is still disputed by historians, since Hungarian artifacts were found in Transylvania also from the first half of the 10th century. This settlement processes resulted in the establishment of Transylvanian Hungarian settlements dating back to the era of the Arpad Dynasty. However, the origin of the Székelys (referred to in English and German as Szeklers, in Romanian Secui, in Latin Siculi) is uncertain and still disputed. After World War I, in 1920, Transylvania was annexed to Romania.

The Szeklers are a Hungarian-speaking, Catholic, Hungarian ethnic group. They declared themselves as a special cultural entity with special origin and history. The first written evidence of Szeklers originates from the 12th century, 1116 AD, and traditionally they derive themselves from the Huns (Hermann, 2004). The territory of the Szeklers was first governed under the leadership of the Count of the Székelys (in Latin: Comes Siculorum), who was at the beginning a royal appointee from the Hungarian nobility and was de facto a margrave, which was the military leader of the borderlands. From the 15th century, the military leaders of Transylvania, the voivodes, held the title themselves. The Szeklers were considered a separate ethnic group (natio Siculica) and were a part of the so-called Unio Trium Nationum (“Union of Three Nations”). This was a coalition of three Transylvanian factions. The official censuses with information on the population of Transylvania have been ad hoc conducted since the 18th century. In 1784, Emperor Joseph II called for the first census of the Habsburg Empire, which included Transylvania (Piaschka, 1975). According to that, which stands even nowadays, the population of Transylvania is mixed, including mainly Romanians, Hungarians, Saxons, and Szeklers (Seton-Watson and William, 1933; Stroschein, 2012).

Korond is a Szekler commune comprising five villages with the administrative central village named Korond. It lies in Hargitha County, Transylvania, Romania, deep in the heart of the Székely Land, and its roots go back with the history of the Szeklers to the 12th century. Nowadays, the Szeklers are one of the largest minorities of Europe, from which the majority, 500–700,000 people are living in the Székely Land, Transylvania (Eberhardt, 2003). Korond is traditionally a genetically isolated area, and according to a 2011 census, the commune had a population count of 6135, of which 94% were Hungarian ethnicities (mostly Szeklers), 2.6% were Roma, and 0.3% were Romanians (National Institute of Statistics, 2011). The Szeklers live in the highest population density in the Székely Land, and their name is possibly derived from a Hungarian expression meaning “frontier guards” since they stationed near the Eastern borders of the 1000 years historical Kingdom of Hungary. As mentioned before, historically, the Szeklers derive themselves from the Huns, as after the death of Attila, several of their tribes left Europe; meanwhile, some of them settled in Transylvania (Barta et al., 1994; Hermann, 2004; Egyed, 2013). The non-Szekler Hungarians are from an area located about 60 km from Korond, from the area of Szászrégen and these villages also are traditionally closed communities.

Here we analyzed 24 Szekler samples collected from Korond (in Romanian: Corund), and 24 samples from people identifying themselves as non-Szekler Hungarians living in Transylvanian villages with known history from the age of the Árpád Dynasty (1000–1301 AD). We used the high-density Illumina Infinium Global Screening Array v.3.0 platform to genotype the DNA samples collected from systematically selected individuals and extracted from whole blood samples. We conducted allele frequency correlation-based population stratification, ancestry estimation, and admixture analysis methods as well as haplotype-based ancestry source and homozygosity-by-descent analysis to lay down the foundations of the characterization of Transylvanian Hungarian ethnicities based on genome-wide autosomal marker data. The aim of this study was to assess the extent of isolation of these ethnic groups in order to characterize the population and also to determine the accuracy of our sampling strategy. We also investigated their relationship to contemporary Hungarians of Hungary.

Materials and Methods

Datasets

Samples from 24 Szekler individuals living in the Transylvanian village of Korond (Transylvania-living Szeklers—TLS) and another 24 samples obtained from Transylvanian villages (Transylvania-living Hungarians—TLH), having documented history since the age of the Árpád Dynasty (1000–1301 AD), were involved in this study. EDTA-anticoagulated whole blood samples were collected from the participants and served as the source of DNA. The samples were stored below −30°C until DNA isolation and genotyping. Sample collection was based on self-declaration, and was carried out by local community leaders and local pastors. The participants were unrelated, and both genders were represented approximately equally. Their age group was 40–80 years old. Samples were collected exclusively from those persons who could declare with high fidelity that they are Szeklers, or Transylvanian non-Szekler Hungarians. All subjects have documented records back to at least two generations, the parents and grandparents of each participant belong to the same ethnic group and were born in the same area as the respective study participant. The Szekler participants were from Korond, while the non-Szekler Hungarians derived from the Szászrégen (traditional Hungarian settlement name, in Romanian: Reghin) area, Maros (Mureş) county. Within a radius of 15 km, the following villages were the sources of the non-Szekler Hungarian samples: Disznajó (Vălenii de Mureş; 11 subjects), Fickópataka (Fiţcău; n = 5), Marosjára (Iara de Mureş, n = 6), and Magyaró (Comuna Aluniș, n = 2).

A total of 48 samples of Hungarian-speaking ethnic groups from Transylvania were genotyped on the Illumina Infinium Global Screening Array platform (725,831 SNPs). DNA isolation, genotyping, and preliminary quality control were carried out by a third-party service provider (Human Genomics Facility) on the University of Rotterdam in the Netherlands. Additional quality control and data preparation for population genetic analyses were performed domestically applying PLINK1.9 and 2.0 software packages (Purcell et al., 2007; Chang et al., 2015). Illumina identifiers were replaced by reference SNP cluster IDs, and SNPs showing missing genotypes were removed from the data applying the “geno” flag of PLINK1.9. The threshold of missing call frequency was set to 0.1. The SNPs that failed in the Hardy-Weinberg equilibrium tests (p = 0.05) were also removed from the data. After the preparation procedures, 665,073 SNPs remained in the dataset.

The participants gave their written informed consent to participate in this study. They all got personal verbal information prior to their signed consent. All samples were anonymized. The study belongs to a series of investigations that—along with the consent form—were approved by the National Ethics Board (ETT TUKEB), and by Regional Ethics Committee of Pécs. The research was conducted according to the principles expressed in the Declaration of Helsinki.

In addition to our data, we also used genome-wide autosomal marker data of populations from the Human Genome Diversity Project whole genomes to increase overlap (genotyped on Illumina 650Y platform) and also considered two datasets (genotyped on Illumina 610K platforms) obtained from the open genotype database of the Estonian Biocentre detailed in Supplementary Table S3 (Cavalli-Sforza, 2005; Rosenberg et al., 2005; Behar et al., 2010; Yunusbayev et al., 2012). We performed on these data the same quality control as described above in the case of TLH and TLS. From the HGDP data, we used the available populations of Europe, the Caucasus area, and South Asia. European groups were the French (n = 29), Orcadians (n = 16), French Basques (n = 24), North Italians (n = 13), Sardinians (n = 28), Tuscans (n = 8), and Russians (n = 25). A population from the Caucasus, the Adygei (n = 17) was also used. South Asian populations were the Balochi (n = 23), Brahui (n = 25), Burusho (n = 25), Hazara (n = 16), Kalash (n = 23), Makrani (n = 23), Pashtun (n = 23), and Sindhi (n = 23). We used the Uyghurs (n = 10), the Han Chinese (n = 44), and the Yoruba samples (n = 19) from the HGDP dataset as outgroups. From the data of the Estonian Biocentre, we selected Hungarian samples (n = 20) representing the Hungarians living in today’s Hungary (we will refer to them simply as Hungarians from now on) and Romanians (n = 16) of contemporary Romania. German samples were also selected from the Estonian data. All the data were tested in preliminary population structure and ancestry analyses such as PCA and ADMIXTURE. German data were filtered according to preliminary PCA, since we discovered that some of the German samples are outliers possessing significant non-West European genetic ancestry. After filtering, 10 German samples remained.

Population Structure, Ancestry Analysis, and Fst Calculations

Population structure analysis was carried out with the SMARTPCA software, which is part of the EIGENSOFT 6.01 software package (Patterson et al., 2006). In these investigations, we used a merged dataset containing all previously listed populations, which contained 578 individuals and 110,734 SNPs. The background linkage disequilibrium in genome-wide marker data can bias both PCA and maximum likelihood estimation methods based on expectation maximization. Therefore, SNPs with strong LD were removed from the data using the “indep-pairwise” pruning command of PLINK 1.9. The r2 threshold was set to 0.3, all other parameters were left at their default setting. After the pruning process, 80,056 SNPs were present in the data. We run SMARTPCA with its default settings. Sigma threshold of outlier samples was set to 6.0 and the option “altnormstyle” was set to enabled, which uses the normalization formula presented in Patterson et al. (2006). Besides the calculated eigenvectors and eigenvalues, SMARTPCA also outputs an Fst matrix containing the pairwise average allele frequency differentiation values between the analyzed populations. Therefore, Fst calculations were also carried out with the SMARTPCA algorithm.

Ancestry estimation was performed using ADMIXTURE 1.22 software package (Alexander et al., 2009). We calculated probable ancestries with K values from 2 to 10 and cross-validation was also performed in order to find the best K value that fits to our dataset.

DNA Segment Analyses

DNA segments of interest were identified using Refined IBD 1.02 (Browning and Browning, 2013). Both identical-by-descent (IBD) and homozygous-by-descent (HBD) segment detection were carried out with the same software. These analyses are haplotype-based analyses as opposed to the previous methods, which are allele frequency-based approaches. Therefore, they do not need to be pruned based on background LD. We used the unpruned version of the dataset created for the PCA and ADMIXTURE analysis (n = 578; 110,734 SNPs). Genotyping data in binary PLINK format was converted to VCF format using the PLINK/SEQ algorithm (Purcell, 2014). Refined IBD analysis was performed with an IBD scale parameter set to 2.4 according to the recommendations of the developers of the algorithm. Window size was set to 25,000 and window overlap value was 3000. All other parameters were left on its default setting.

Average pairwise IBD sharing was calculated from the output of refined IBD according to the method applied in Atzmon et al. (2010).

Averageshare=i=1nj=1mIBDijn×m

In the equation, IBDij is the length of the IBD segment shared between the individuals i and j from populations I and J, and n, m are the number of individuals in populations I and J.

HBD segment counts were also calculated from the output data of the same refined IBD run. Length and number of shared IBD segments and HBD segments were investigated by calculating averages and statistics in the form of boxplots.

The 4-Population Tests

Four-Population tests were applied to check if the relationship of investigated Hungarian-speaking groups, the Szeklers from Korond and Transylvanian Hungarians with Hungarians are well detectable. The purpose of this method was to test the assumed connection of TLH and TLS to Hungarians. It was carried out with the 4-population test algorithm of ADMIXTOOLS 4.1, which is implemented in this software package as D-statistics (Green et al., 2010; Patterson et al., 2012). In this investigation, the same unpruned dataset was used which we applied previously in the IBD analysis. We used the following four populations for the tests: Yoruba, the two groups in the focus of this study, Hungarians, and Romanians. We also checked the setups with Han Chinese and Adygei as outgroups instead of the Yorubas. The 4-population test uses a hypothetical unrooted phylogenetic tree [(W) (X)] [(Y) (Z)], where W, X and Y, Z groups are supposed to be related. If the Z-score of the 4-population test is not significant, then the results are not violating this assumption. If the Z-score is significantly positive (greater than or equal to 3), then W, Y or X, Z. If the Z-score happens to be significantly negative (less than or equal to −3), then W, Z or X, Y are related. Here the setup of the unrooted phylogenetic trees were: [(Yoruba) (Study focus groups) (Hungarians) (Romanians)] and [(Han Chinese) (Study focus groups) (Hungarians) (Romanians)] and [(Adygei) (Study focus groups) (Hungarians) (Romanians)].

Results

Relationship of the Transylvanian Samples to European and Worldwide Populations

Both PCA and ADMIXTURE analysis showed that the investigated Szeklers and Transylvanian Hungarians were relatively homogeneous and clustered tightly together and close to Hungarians and Romanians (Figure 1). These populations show a quite high similarity to each other on PCA and on the ADMIXTURE graph, referring to their very strong common history.

FIGURE 1
www.frontiersin.org

FIGURE 1. PCA results plotted on the first four eigenvectors. Eigenvalues of the first four principal components were 20.711, 14.184, 6.219, and 3.243, respectively. PCA featuring only 13 populations is the result of the same analysis plotting only the respective populations. TLH—Transylvania-living Hungarians; TLS—Transylvania-living Szeklers.

The investigated populations are clustered between South and East Europe with Hungarians and Romanians mirroring their actual geographical positions. Europe and the Caucasus are well separated from South Asian populations. Some of the South Asian ethnicities split into two groups depending on whether they have significant East Asian or African derived ancestry components. In this investigation, Han Chinese represented the Eastern component and Yoruba represented the African component, which are clearly separated from each other and from European and South Asian populations. On the contrary, Europeans do not show such a tendency. Han Chinese and Yoruba groups were not incorporated on the PCA graphs due to the large plotting distances, however, the complete PCA can be found in the Supplementary Figure S1.

Fst calculations also show that investigated populations are close to each other, with neighboring populations, and with Germans. This observed similarity with Germans might be formed through their admixture with the Transylvanian Saxon population and in the case of Hungarians with direct admixture with Swabian immigrants (Figure 2). It can also be observed on the PCA figure, since TLH, TLS, and Hungarian clusters are significantly overlapping with Germans. TLH, TLS, Hungarians, and Romanians have the highest Fst with geographically isolated groups like Orcadians and Sardinians and genetically isolated populations like French Basques. Fst values also reflect the geographical distances, since Adygei samples have a relatively high Fst with the four groups, the TLH, TLS, Hungarians, and Romanians. It can also be observed that TLH samples have the highest Fst with all European groups compared to TLS, Hungarians, and Romanians. Fst values of TLS is most similar to those of Romanians. Hungarians—along with TLH—tend to have lower Fst with West Europeans, Romanians have lower Fst with East and South Europeans. According to the average pairwise allele frequency differentiation values, it looks like that TLH is most similar to Hungarians in terms of Fst differences considering the various European populations. They share a same pattern and are just farther away from them genetically than Hungarians.

FIGURE 2
www.frontiersin.org

FIGURE 2. Fst calculation results.

The cross-validation error of the ADMIXTURE analysis was the lowest at K = 5 hypothetical ancestral groups (Supplementary Figure S2). The analysis supported the results of the PCA and showed also that French Basques and, in the case of the PCA, the Sardinians are separated from other European populations. This can be explained due to the well-known strong genetic isolation of the Basque and Sardinian populations (Figure 3).

FIGURE 3
www.frontiersin.org

FIGURE 3. Admixture analysis results of the investigated populations with K = 5 hypothetical ancestral groups.

Seeking for the Source of Ancestry

According to the average pairwise IBD sharing results, the relationship of TLS and TLH to Hungarians of contemporary Hungary is varied. TLS and TLH both show similarities to Hungarians in certain aspects. Our investigation showed that non-Szekler Transylvanian Hungarians have more similarity to Hungarians in the case of ancestry derived from West Europe. TLS, on the other hand, shares more similarities in South European-derived ancestry with Hungarians. Hungarians have a much more significant East European ancestry component than either TLS or TLH (Figure 4; Supplementary Table S1). These can also be observed on the boxplot diagrams of Supplementary Figure S3, visualizing the IBD segment length distribution of European populations.

FIGURE 4
www.frontiersin.org

FIGURE 4. Average pairwise IBD sharing between investigated populations.

Average sharing results show that Hungarians are somewhat closer to TLS than TLH. We know that Hungarians have been admixed heavily with immigrating populations from the age of the Árpád Dynasty until today. This might also suggest that TLH is a somewhat more isolated population and is less affected by admixture with other Transylvanian ethnic groups. Examinations regarding the average length of IBD segments shared between pairs of individuals do not show any recognizable pattern. Analyzing the average number of IBD segments between pairs of individuals pointed out the similarity of Hungarians, TLS, and TLH. However, it also shows that populations in the Transylvanian region, including Romanians, are in general less admixed with West Europeans and also with Russians (Supplementary Figures S4, S5).

Autozygosity Analysis

According to the HBD segment analysis, as expected, French Basques, Sardinians, and Orcadians showed the highest extent of autozygosity indicating their strongly isolated nature. Szeklers and Transylvanian Hungarians show a higher autozygosity relative to most European populations, therefore, also higher than the autozigosity observed in Hungarians and Romanians (Figure 5; Supplementary Table S2), which could indicate their relatively higher genetic isolation. Investigating the variation of HBD segment number per individual from all groups, we can observe that TLH and TLS have indeed higher numbers of HBD segments compared to Hungarians, Romanians, and most West European populations. HBD segment length examination per individual shows that HBD length is highly variable in the case of most groups, except of Hungarians, Romanians, Germans, and Tuscans (Supplementary Figure S6).

FIGURE 5
www.frontiersin.org

FIGURE 5. Homozygous-by-descent segment analysis results.

Relationship to Hungarians Living in Contemporary Hungary

The 4-population tests revealed that TLS and TLH are strongly related to contemporary Hungarians living in Hungary despite the many admixture events Hungarians went through during the centuries. The results were similar in all three cases applying the three distinct ethnic groups as outgroup populations. Z-scores showed significant negative values indicating that W,Z or X,Y groups are related to each other on the hypothetical unrooted phylogenetic tree (Table 1). In our case, the close relationship of X and Y, TLS and TLH with Hungarians is more likely. All calculations showed significantly high negative Z-scores, however, it was most significant in the case we used the Adygei from the Caucasus region as an outgroup.

TABLE 1
www.frontiersin.org

TABLE 1. 4-population test results.

Discussion

Population structure analysis by PCA and ancestry estimation with ADMIXTURE suggested that TLH and TLS groups are homogeneous, and they showed similar genetic ancestry. Placing these populations in a Eurasian perspective, they were shown to be a well-defined, tightly clustered group, suggesting no signs of recent admixture with populations from other regions considered in these tests.

The DNA segment analyses using identity-by-descent segment discovering tools pointed out that average pairwise IBD sharing results are similar in TLS, TLH, Hungarians, and Romanians. However, TLS samples had a higher average sharing with the Hungarians. Average sharing results also showed that the investigated Szeklers from Korond might be somewhat less isolated than the inhabitants of Árpád Age villages. This was confirmed by our autozygosity analysis as well. Our IBD calculations pointed out that Romanians differ from Hungarians, TLH, and TLS in the number of detected IBD segments between pairs of individuals, suggesting that Hungarians, TLH, and TLS have a common history. Although this tendency is much more significant in the case of TLH, they also show similar tendencies than of Hungarians regarding the relationship with West Europeans. This has been also indicated earlier by the fixation index values. Our investigations of haplotype-based DNA segment analysis concluded that these two areas, the commune of Korond and the people of the surrounding villages of Szászrégen, are noticeably isolated compared to most European populations, which could have helped to retain the strong relationship with the people of contemporary Hungary even though the contemporary Hungarian population is admixed, showing strong sources of ancestry from both West and East Europe. The source of West European ancestry might be primarily the Germans through the Swabian immigration mentioned before.

The 4-population tests also showed that these Hungarian-speaking ethnic groups, the Szeklers from Korond and Transylvanian Hungarians, are strongly related to Hungarians living in Hungary.

Earlier genetic studies involving Hungarians and Hungarian-speaking ethnicities outside of contemporary Hungary are based on Y chromosomal and mitochondrial DNA haplogroup analyses and gene frequency analysis of blood groups and red cell enzymes coding nuclear loci (Guglielmino et al., 2000; Brandstatter et al., 2007; Biro et al., 2015). The results of these investigations showed that Hungarians and Hungarian-speaking ethnicities of Romania have significant Middle Eastern ancestry from the Iranians and Turks besides the Slav and German derived ancestry components. Hungarian-speaking ethnicities of Transylvania also have a higher rate of Turkic ancestry than Hungarians living in Hungary. Certain studies pointed out that Hungarian ethnicities in Transylvania also have Central Asian genetic elements, with the highest extent in the case of the Szeklers, which was 7.3% (Biro et al., 2015). That could also imply that Hungarians from ancient Transylvanian villages and the Szeklers are a rather isolated population with less admixture between the neighboring ethnicities involved. The isolated nature of Szeklers and Transylvanian Hungarians could preserve these Central Asian and Turkic roots. This Central Asian genetic heritage needs further investigations in order to shed light on its origin, which could point out indirect transmission through Slavic groups, but also could strengthen the Central Asian origin of Szeklers on a genetic basis. One of our previous studies investigated Csango samples from Bacau county, Romania (Ádám et al., 2020). Csangos are a Transylvanian Hungarian ethnic group similar to Szeklers with an even more isolated nature and showed to have a unique Turkic ancestry from Central European and Siberian Turkic populations. These results, regarding the significant Turkic ancestry estimated in Csangos, are in line with the previously described Central Asian Turkic ancestry and is even more significant than the Turkic ancestry proportion reported previously in the Szeklers. Investigating the Szeklers in a similar manner with subsequent studies could help to discover their nature. We could get closer to learning about their exact origins and could shed light on their connection with other Hungarian ethnic groups of Transylvania such as the Csangos.

In summary, we can conclude that Szeklers from the area of Korond and non-Szekler Transylvanian Hungarians are homogeneous, and more isolated than the continental population of Europe, especially in the case of the Western Europeans. Our analyses show that their genetic connection to Hungarians is still well detectable not only using methods based on allele frequency calculations, but also results of haplotype-based methods suggest this conclusion.

Our study is the first genome-wide autosomal marker-based analysis of Szekler population samples, and of samples from Transylvania living non-Szekler Hungarian population living in settlements established in the age of the Árpád Dynasty. With our investigation, we described the genetic makeup of Szeklers, assessed their degree of isolation, and found that they can be separated from Transylvanian Hungarians using genome-wide data. We also described their connection with Hungarians, Romanians, and European populations. Subsequent research should aim at the study of their assumed Central Asian and East Asian Turkic ancestry.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries regarding the Szekler data can be directed to the corresponding authors.

Ethics Statement

The studies involving human participants were reviewed and approved by the National Ethics Board (ETT TUKEB) and Regional Ethics Committee of Pécs. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

All authors have materially participated in this work. VÁ and ZB conceived and designed the investigations; ZB, GB, and AS evaluated and interpreted the results of the tests carried out. KS, AS, and LM contributed to the data collection and genotyping. VÁ, ZB, KS, and LM co-wrote the manuscript and BM, AM, and MK revised the initial draft of the manuscript.

Funding

This study was supported by the National Scientific Research Program (NKFI) K 138669 and K 119540. This study was supported by the Institute of Hungarian Research, Budapest, Hungary. This study was supported by the Human Resources Development Operational Program, Ministry of Human Resources, Hungary, and by the Medical School of University of Pécs; EFOP-3.6.1-16-2016-00004.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The present scientific contribution is dedicated to the 650th anniversary of the foundation of the University of Pécs, Hungary.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.841769/full#supplementary-material

References

Ádám, V., Bánfai, Z., Maász, A., Sümegi, K., Miseta, A., and Melegh, B. (2020). Investigating the Genetic Characteristics of the Csangos, a Traditionally Hungarian Speaking Ethnic Group Residing in Romania. J. Hum. Genet. 65, 1093–1103. doi:10.1038/s10038-020-0799-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Alexander, D. H., Novembre, J., and Lange, K. (2009). Fast Model-Based Estimation of Ancestry in Unrelated Individuals. Genome Res. 19, 1655–1664. doi:10.1101/gr.094052.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Atzmon, G., Hao, L., Pe'er, I., Velez, C., Pearlman, A., Palamara, P. F., et al. (2010). Abraham's Children in the Genome Era: Major Jewish Diaspora Populations Comprise Distinct Genetic Clusters with Shared Middle Eastern Ancestry. Am. J. Hum. Genet. 86, 850–859. doi:10.1016/j.ajhg.2010.04.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Barta, G., Bóna, I., Köpeczi, B., Makkai, L., Miskolczy, A., Mócsy, A., et al. (1994). History of Transylvania. Budapest: Akadémiai Kiadó.

Google Scholar

Behar, D. M., Yunusbayev, B., Metspalu, M., Metspalu, E., Rosset, S., Parik, J., et al. (2010). The Genome-wide Structure of the Jewish People. Nature 466, 238–242. doi:10.1038/nature09103

PubMed Abstract | CrossRef Full Text | Google Scholar

Bíró, A., Fehér, T., Bárány, G., and Pamjav, H. (2015). Testing Central and Inner Asian Admixture Among Contemporary Hungarians. Forensic Sci. Int. Genet. 15, 121–126. doi:10.1016/j.fsigen.2014.11.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Brandstätter, A., Egyed, B., Zimmermann, B., Duftner, N., Padar, Z., and Parson, W. (2007). Migration Rates and Genetic Structure of Two Hungarian Ethnic Groups in Transylvania, Romania. Ann. Hum. Genet 71, 791–803. doi:10.1111/j.1469-1809.2007.00371.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Browning, B. L., and Browning, S. R. (2013). Improving the Accuracy and Efficiency of Identity-By-Descent Detection in Population Data. Genetics 194, 459–471. doi:10.1534/genetics.113.150029

PubMed Abstract | CrossRef Full Text | Google Scholar

Cavalli-Sforza, L. L. (2005). The Human Genome Diversity Project: Past, Present and Future. Nat. Rev. Genet. 6, 333–340. doi:10.1038/nrg159610.1038/nrg1579

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: Rising to the challenge of Larger and Richer Datasets. GigaSci 4, 7. doi:10.1186/s13742-015-0047-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Eberhardt, P. (2003). Ethnic Groups and Population Changes in Twentieth Century Eastern Europe. London, UK: Routledge.

Google Scholar

Egyed, Á. (2013). A Székelyek Rövid Története a Megtelepedéstől 1989-ig. Pallas-Akadémiai kiadó.

Google Scholar

Engel, P. (2005). “The Realm of St Stephen: A History of Medieval Hungary,” in International Library of Historical Studies. Editor A. Ayton (London, UK: I.B. Tauris), 895–1526.

Google Scholar

Green, R. E., Krause, J., Briggs, A. W., Maricic, T., Stenzel, U., Kircher, M., et al. (2010). A Draft Sequence of the Neandertal Genome. Science 328, 710–722. doi:10.1126/science.1188021

PubMed Abstract | CrossRef Full Text | Google Scholar

Guglielmino, C. R., Silvestri, A., and Beres, J. (2000). Probable Ancestors of Hungarian Ethnic Groups: an Admixture Analysis. Ann. Hum. Genet 64, 145–159. doi:10.1046/j.1469-1809.2000.6420145.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Hermann, G. M. (2004). Székely Történeti Kistükör 1848-ig [Small Mirror of Székely History until 1848]. Székelyudvarhely: Litera könyvkiadó.

Google Scholar

National Institute of Statistics (2011). Recensământul Populației Și Locuințelor, Rezultate definitive_RPL_2011: Tab8. Populaţia Stabilă După Etnie – Judeţe, Municipii, Oraşe, Comune. Available at: http://www.recensamantromania.ro/rezultate-2/ (Accessed June 2, 2021).

Google Scholar

Patterson, N., Moorjani, P., Luo, Y., Mallick, S., Rohland, N., Zhan, Y., et al. (2012). Ancient Admixture in Human History. Genetics 192, 1065–1093. doi:10.1534/genetics.112.145037

PubMed Abstract | CrossRef Full Text | Google Scholar

Patterson, N., Price, A. L., and Reich, D. (2006). Population Structure and Eigenanalysis. Plos Genet. 2, e190. doi:10.1371/journal.pgen.0020190

PubMed Abstract | CrossRef Full Text | Google Scholar

Piaschka, R. (1975). Austrian Policy towards the Balkans in the Second Half of the Eighteenth Century: Maria Theresa and Josef II. East. Eur. Q. 9.

Google Scholar

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795

PubMed Abstract | CrossRef Full Text | Google Scholar

Purcell, S. (2014). PLINK/SEQ: A Library for the Analysis of Genetic Variation Data. Available at: https://atgu.mgh.harvard.edu/plinkseq.

Google Scholar

Rosenberg, N. A., Mahajan, S., Ramachandran, S., Zhao, C., Pritchard, J. K., and Feldman, M. W. (2005). Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure. Plos Genet. 1, e70. doi:10.1371/journal.pgen.0010070

PubMed Abstract | CrossRef Full Text | Google Scholar

Seton-Watson, R. W. (1933). The Problem of Treaty Revision and the Hungarian Frontiers. Int. Aff. 12, 481–503. doi:10.2307/2603603

CrossRef Full Text | Google Scholar

Stroschein, S. (2012). Ethnic Struggle, Coexistence, and Democratization in Eastern Europe. London, UK: Cambridge University Press.

Google Scholar

Yunusbayev, B., Metspalu, M., Jarve, M., Kutuev, I., Rootsi, S., Metspalu, E., et al. (2012). The Caucasus as an Asymmetric Semipermeable Barrier to Ancient Human Migrations. Mol. Biol. Evol. 29, 359–365. doi:10.1093/molbev/msr221

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: population genetics, identical by descent, Transylvania, Szeklers, genome-wide data

Citation: Ádám V, Bánfai Z, Sümegi K, Büki G, Szabó A, Magyari L, Miseta A, Kásler M and Melegh B (2022) Genome-Wide Marker Data-Based Comparative Population Analysis of Szeklers From Korond, Transylvania, and From Transylvania Living Non-Szekler Hungarians. Front. Genet. 13:841769. doi: 10.3389/fgene.2022.841769

Received: 22 December 2021; Accepted: 23 February 2022;
Published: 28 March 2022.

Edited by:

Peter Poczai, University of Helsinki, Finland

Reviewed by:

Szilvia Kusza, University of Debrecen, Hungary
Consuelo Quinto, UGA—LANGEBIO, Mexico

Copyright © 2022 Ádám, Bánfai, Sümegi, Büki, Szabó, Magyari, Miseta, Kásler and Melegh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zsolt Bánfai, YmFuZmFpLnpzb2x0QHB0ZS5odQ==; Béla Melegh, bWVsZWdoLmJlbGFAcHRlLmh1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.