Corrigendum: The maternal U1 haplogroup in the Koraga tribe as a correlate of their North Dravidian linguistic affinity
- 1Department of Applied Zoology, Mangalore University, Mangalore, Karnataka, India
- 2Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
- 3Institut für Sprachwissenschaft, Universität Bern, Bern, Switzerland
Introduction: The Koraga tribe are an isolated endogamous tribal group found in the southwest coastal region of India. The Koraga language shares inherited grammatical features with North Dravidian languages. To seek a possible genetic basis for this exceptionality and understand the maternal lineage pattern, we have aimed to reconstruct the inter-population and intra-population relationships of the Koraga tribal population by using mtDNA markers for the hypervariable regions along with a partial coding region sequence analysis.
Methods and Results: Amongst the 96 individuals studied, we observe 11 haplogroups, of which a few are shared and others are unique to the clans Soppu, Oṇṭi and Kuṇṭu. In addition to several deep rooted Indian-specific lineages of macrohaplogroups M and U, we observe a high frequency of the U1 lineage (∼38%), unique to the Koraga. A Bayesian analysis of the U1 clade shows that the Koraga tribe share their maternal lineage with ancestral populations of the Caucasus at the cusp of the Last Glacial Maximum.
Discussion: Our study suggests that the U1 lineage found in the Indian subcontinent represents a remnant of a post-glacial dispersal. The presence of West Asian U1 when viewed along with historical linguistics leads us to hypothesise that Koraga represents a mother tongue retained by a vanquished population group that fled southward at the demise of the Indus civilisation as opposed to a father tongue, associated with a particular paternal lineage.
1 Introduction
The Indian subcontinent acted as a major corridor for early human migrations. Indian population structure has been shaped by various waves of migration, some of which followed a southern coastal route (Thangaraj et al., 2005). The most remarkable feature of Indian populations is the clear stratification between castes and tribes (Cordaux et al., 2003; Cordaux et al., 2004). The immense cultural, linguistic and ethnic diversity amongst Indian populations offers tremendous scope for genetic diversity studies. The tribal populations of India considered to be ‘aboriginal’ represent 8.6% of the total population (Registrar General and Census Commissioner of India, 2011). There are more than 700 tribes in India whose languages are either affiliated with the Austroasiatic, Indo-European, Dravidian and Trans-Himalayan languages families or represent linguistic isolates. In the broadest of geographical terms, Indian tribes tend to be classified as Southern, Northern, Eastern and Western tribal populations. Each ethnic tribal population is unique with respect to language, lifestyle and social customs. Studies have demonstrated that the populace of the Indian subcontinent is comprised of numerous small endogamous populations as a consequence of strict endogamy and social customs, resulting in the great complexity observed in the genetics of Indian populations (Cordaux et al., 2003; 2004; Thangaraj et al., 2005; 2006; Thanseem et al., 2006; Basu et al., 2016; Mustak et al., 2019). Several studies have highlighted that due to the founder effect, recessive diseases manifest themselves in Indian populations (e.g., Reich et al., 2009). The southern tribes have an interesting maternal haplotype distribution pattern with some tribes showing very high frequency of Indian-specific M haplogroup and others showing higher frequency of West Eurasian haplogroups (Anthropological Survey of India, 2021a). The Koraga tribe belongs to the latter group.
The Koraga represent a small endogamous population with extremely low social status in Indian society. They are mainly concentrated in parts of Dakṣiṇa Kannaḍa and Uḍupi districts of Karnāṭaka, Kāsaragoḍ district of Kerala, and also found in small numbers in the adjoining districts of Śivamogga (Shimoga) and Koḍagu (Coorg) in Uttara Kannaḍa (Figure 1). As per the Census 2011, the Koraga population in these states tallies at 16,376. Between 1991 and 2011, a 10% decrease was observed in the number of individuals in Karnataka. The Koraga are amongst the poorest and most marginalised populations of South Kanarā. They weave baskets, cradles and winnowing trays, collect firewood and honey from nearby forests and work as seasonal labourers for a daily wage. Koraga people are classified into three endogamous groups, i.e., Soppu (ಸೊಪ್ಪು “lettuce”) or Tappu (ತಪ್ಪು “leaves”), Kuṇṭu (ಕುಂಟು “cloth”) and the Oṇṭi “earring” (Thurston and Rangachari, 1909). These divisions were based on the tribal dress they wore (Sherring, 1881). Bhat (1971) identified three dialect communities within the Koraga, which apparently coincide with the clan divisions, i.e., Tappu, ‘Mudu’ and Oṇṭi. The Koraga are traditionally classified within the Caṇḍāla caste. Progeny from the union of a higher caste female and a lower caste male are also classified within the untouchable Caṇḍāla category (Thurston and Rangachari, 1909). A folklore tale recounts a clash between Habāśika, a Koraga chieftain and his Caṇḍāla army from the Ghats, with the Kadamba rulers of Banavāsī (Walhouse, 1875).
FIGURE 1. Map showing U1 haplogroup spatial distribution, geographical location of studied populations and North Dravidian languages.
The Koraga speak a Dravidian language, the precise phylogenetic propinquity of which within the language family remains unresolved. Bhat (1971) and McAlpin (1981) grouped Koraga together with Kurukh and Malto under the North Dravidian branch. Zvelebil (1990) proposed to treat Koraga as an independent branch of Dravidian under its own node in the tree, like Brahui. The Koraga language has been influenced for centuries by surrounding Tuḷu speakers, and many Koraga are bilingual in Tuḷu. Krishnamurti (2003) therefore grouped Koraga as close to Tuḷu and opined that Koraga look “like an offshoot of Tuḷu at a recent past.” However, he conceded that such a phylogenetic assignment would be problematic: “The location of Tuḷu in the family tree is doubtful and Koraga needs to be appropriately located in the subgrouping scheme.” At variance with Krishnamurti’s conjecture and in accordance with the phylogenetic assignments proposed by Bhat (1971), McAlpin (1981) and Zvelebil (1990), Koraga forms the past tense with the suffix <-k>, a grammatical feature which Koraga shares uniquely with North Dravidian and which is not found in South Dravidian languages.
The prehistory of Dravidian can only be unravelled against the backdrop of the caste system which arose in the aftermath of the advent of speakers of Indo-Aryan languages to the subcontinent at the beginning of the second millennium BC. The original Śūdras comprising the fourth and lowest caste or varṇa “colour” represented the vanquished indigenous pre-Aryan people who populated the Indus Valley civilisation before the Aryan invasion. However, the development of the caste system was a complex process, as were the patterns of migration set into motion by demographic changes connected to the decline of Harappan civilisation (van Driem, 2021). In this regard, Caldwell (1856) noted the significance of the fact that the majority of people in the Dravidian south were categorised as Śūdras. The vertical stratification of society determined the direction of migrations and linguistic assimilation but also the direction of hypergamy practised by women in communities of low status. Historically, the Koraga were considered to be amongst the lowliest of the untouchables. After Indian independence, positive measures were taken to provide social justice to historically underprivileged groups. Yet the legacy of low status outside of the caste system lingers on as a social stigma which endures to the present day.
Somatological impressionism in view of the physical phenotype prompted Stuart, in his report for the 1891 census, to opine that the “Koragas are an indigenous race. They speak Tuḷu language for communication with other people, but it is supposed they use a separate language at home quite distinct with any other main language family” (Stuart, 1893). Today the Koraga feel ashamed of their native language and immediately switch to Tuḷu if they notice that they are being observed by outsiders, even at a distance. The Koraga language has come to be perceived by the speakers themselves as an overt mark of their inferior socio-economic status in Indian society.
Various assumptions have been made about their origin by anthropologists, linguistics and historians, and origin stories exist in local folklore. Earlier genetic studies grouped the Koraga tribe under the South Indian tribes (Forster et al., 2002; Cordaux et al., 2003; Debnath et al., 2011; Gupta et al., 2012; Palanichamy et al., 2015; Anthropological Survey of India, 2021a). A recent genomic survey of over 2,000 samples from 75 Indian communities, mostly tribes, included Koraga genomes (Anthropological Survey of India, 2021b). The findings of these studies, although noteworthy, were confined to the broader objective of overall genetic diversity. A focused study interlinking linguistics and the maternal or paternal lineages of Koraga does not exist to date. The present study therefore aims to reconstruct the intra- and inter-population relationships of the Koraga tribe and provide an age estimate for the arrival of their ancestors on the southwestern coast of India, thereby correlating the findings with historical linguistics.
2 Materials and methods
The present study was approved by the Institutional Human Ethical Committee Mangalore University, Mangaluru (MU/AZ/349/IHEC/2014-2015 dated 15/07/2014). The purpose of the study was explained to all volunteers in vernacular language, and a written consent was obtained before sample collection. During the in-person interaction, their family history and medical history was obtained. Only those healthy individuals who were above the age of 18 and unrelated for at least three generations were included for blood sample collection. About 5–9 mL of intravenous blood sample was collected from 100 individuals from the Koraga tribal population belonging to different clans (Soppu, Kuṇṭu and Oṇṭi) and residing on the southwest coast of Karnataka and Kerala. This study was conducted in accordance with the Declaration of Helsinki. DNA was extracted using phenol-chloroform method described in Thangaraj et al. (2002). The extracted DNA was amplified using Applied Biosystems™ Veriti™ 96-Well Thermal Cycler with the following PCR conditions—95°C for 5 min followed by 35 cycles of 95°C for 30 s (denaturation), 52°C for 30 s (annealing) and 72°C for 60 s, and 72°C for 7 min (elongation). PCR products were sequenced with mitochondrial markers covering the control region (HVR-I and HVR-II) and partial coding region (see Supplementary Table S1). Sequences were compared with the revised Cambridge Reference Sequence (r-CRS), and haplogroups were assigned using Haplogrep 3 (Schönherr et al., 2023).
The mtDNA haplogroup analysis is the initial measure for identification of maternal lineage. Haplogroup frequency helps us to understand intra- and inter-population differences. In order to elucidate genetic differentiation within the Koraga population groups, Fst was estimated based on mtDNA haplogroup frequencies using Arlequin software (Excoffier and Lischer, 2010). Furthermore, PCA (principal component analysis) was performed using prcomp package in R software to understand the clustering pattern amongst selected populations based on the haplogroup frequencies. Median-joining network tree analysis was performed using the POPART software to assess haplotype sharing between the clans (Leigh and Bryant, 2015). Both PCA and median joining network analysis were performed with default parameters.
Bayesian analysis was performed using BEAST v2.7.4 software (Bouckaert et al., 2019) to measure the divergence time for U1 clades found in the Koraga and other global populations. We also included high-coverage ancient samples from the AmtDB database (Ehler et al., 2019) as calibration priors. Bayesian phylogenetic analysis was performed as per the protocol described by Connell et al. (2022). As an outgroup, the L2c2 mitogenome of a Moreno person (PaMOR16007) was employed. The sequences from the non-coding region were used in BEAST runs. Runs were performed using the HKY substitution model, and trees were connected. For the HVS1 and HVS2 regions, rigorous molecular clocks with mutation rates of 1.292 and 0.369 mutations/site/million years, respectively, were used (Connell et al., 2022). The following settings were used in Tree annotator to create a consensus tree: Common Ancestor Heights, 50% burning (produced higher posterior probability values). The tree was rerooted to the L2c2 outgroup in Figtree. In order to determine TMRCAs, median heights with 95% HPD (high probability densities) were used. All runs were carried out using 5,000,000 burning and 50,000,000 iterations, taking samples at intervals of 10,000 MCMC (Markov chain Monte Carlo) steps (Olivieri et al., 2017; Brandini et al., 2018; Capodiferro et al., 2021).
3 Results
3.1 Maternal haplogroup diversity within the Koraga tribe
In the present study, mitochondrial markers were used to trace the maternal ethnic origin and population structure of the Koraga tribe. An earlier study by Cordaux et al. (2003) recorded only three major haplogroups, viz. U1a, M3 and U2a (lower frequency), with little diversity within the Koraga tribal population, whereas the present study identified the presence of 19 haplogroups, viz. L3e’i’k’x, U1, U1a, U1a1a, U2, U2a1, H2a*, H33b, M1a3, M1a3b1, M2a1a2, M3, M3a2a, M7a, M30, M40, N, N9b, D4k. However, the haplogroups U1a, U1a1a, U2a1, M3a2a and M30 accounted for 72% of the total variation. The previous study did not investigate clan-wise categorisation amongst the Koraga (Cordaux et al., 2003), whereas the present study categorised the Koraga into the three major clans Soppu, Kuṇṭu and Oṇṭi (Figure 2).
The haplogroups observed among the Koraga were nested in the lineages of macrohaplogroups M, N, R and U. Amongst the quantified mtDNA haplogroups, U1a1a (27.1%), M3a2a (16.7%), M30 (11.5%), U2a1 (9.4%) and U1a (7.3%) were the most common among the Koraga (Table 1). The majority of Koraga samples fell into known sub-lineages of mtDNA haplogroup U, viz. U1, U1a, U1a1a, U2 and U2a1, which altogether account for about 51% of the variation observed in West-Eurasian-specific lineages. Interestingly, U1 subclades uncommon to the Indian subcontinent (Kivisild et al., 1999) contributed to two thirds of the total U haplogroup distribution observed in the Koraga (Figure 1). The haplogroup U1a is characterised by an HVS1 back mutation at np16189 (T16189C!), most of the subclusters of which are defined by A385G (U1a1a) and C16400T in the non-coding region. C16400T has been reported earlier for the L, A, H and K subclades (van Oven and Kayser, 2009). We report C16400T for the U1a haplogroup in the Koraga population along with T199C. Autochthonous subclades of haplogroup U have been reported earlier for South Indian tribes (Ingman and Gyllensten, 2003; Sylvester et al., 2019) including the Koraga, i.e., U1a1a4, dated at 17,900 ± 6800 YBP (Palanichamy et al., 2015). In our study, the Koraga tribe show an unusually high frequency for U1 with a very low nucleotide diversity (π = 0.00064) and a relatively high haplotype diversity (Hd = 0.957), suggesting an early bottleneck, possibly towards the end of Late Glacial Maximum (∼18kya) (Silva et al., 2017) with the subsequent accumulation of variations (Grant and Bowen, 1998).
Other than the West-Eurasian-specific mtDNA lineages, Indian-specific mtDNA lineages of haplogroup M and its subclades were also observed at a frequency of 41.7% amongst the Koraga (Table 1). We report the mtDNA haplogroups M2a1a2, M3, M3a2a, M7a, M30 and M40 in the Koraga in the current study. The approximate age estimates for these M haplogroups range from between 15 and 44 YBP (Chandrasekar et al., 2009), suggesting that the ancestors of the modern Koraga population admixed with individuals carrying Indian-specific haplogroups along their ancestral route of migration. M3 subclusters with a 17.7% frequency in the Koraga could indicate that admixture took place in the western or northwestern portion of the Indian subcontinent, since this haplogroup is predominant in these areas (Metspalu, 2004). Another interesting finding is the presence of traces of the African L3e’i’k’x, M1a3 and M1a3b1 haplogroups in the Koraga population. Although retained only as traces, these haplogroups represent unambiguous artefacts of the earliest human migrations out of Africa (Olivieri et al., 2006).
Some mitochondrial haplogroups were shared amongst the three clans, whilst some were unique to a particular clan. The mtDNA haplogroup U1a and its subclades are shared by all three Koraga clans and distributed almost equally amongst them (5.3%–31.6%). Haplogroup M3a2a was the second most common haplogroup, likewise shared amongst all three Koraga clans (Table 1). The haplogroup U2a1 was found in the highest frequency in the Kuṇṭu at 21.1%, at a lower frequency in the Oṇṭi at 8.8%, and completely absent in the Soppu Koraga. The haplogroup M2a1a2 was found only in the Soppu, at a frequency of 20%, and absent in the Kuṇṭu and Oṇṭi. The haplogroup M30 was absent in the Soppu, whereas M40 was found to be more frequent in the Soppu as compared to the Kuṇṭu and Oṇṭi. Overall, the Kuṇṭu reported a higher frequency of West Eurasian U1 subclusters (42.1%), whilst the Soppu reported a higher frequency of Indian-specific M (60%) (Table 1).
We performed network analysis to understand the haplotype sharing pattern between the clans. In the 96 samples studied, 48 haplotypes were reported. The Oṇṭi and Soppu clans exhibit more unique haplotypes, whereas most of the haplotypes in the Kuṇṭu are shared (Figure 3). The lowest haplotype diversity was observed in the Oṇṭi and the highest in the Soppu. The African-specific haplogroups M1 and N occur uniquely in the Soppu clan. The Indian-specific haplogroups M2 and M40 are found at higher frequencies in the Soppu than in the other two clans, whilst the haplogroups U1 and M3 are evenly present in all the three clans, all of this suggesting a population stratification within the tribe.
FIGURE 3. Median joining network showing haplotype sharing pattern between the clans. Each node represents a unique mtDNA haplotype. Size of the node represents relative haplotype frequency. Each hatch mark/bar represents a nucleotide change. Black nodes represent unsampled or extinct ancestral sequence.
The pairwise Fst values for the clans lie between 0.02 and 0.12, suggesting that the Koraga clans are less differentiated within the population (Supplementary Figure S1). The Soppu clan is less closely related to the Kuṇṭu than to the Oṇṭi (Supplementary Figure S1). When the Koraga population is compared with other populations, Fst was in the range of 0.10–0.45 (Figure 4), suggesting that the Koraga population is more isolated. Such isolation may have preserved the original genetic variation more faithfully, as is evident from the diversity indices (Table 2). Because of a strong founder effect, a reduced nucleotide diversity is observed within this population (Tournebize et al., 2022). This founder effect is also reflected in their higher genetic distance as well as their higher average number of pairwise differences (Supplementary Figure S2). The Fst analysis indicates that the Koraga tribe are a drifted population due to higher pairwise differences with the neighbouring tribes (Cordaux et al., 2003).
FIGURE 4. Pairwise Fst variation in the Koraga tribe and other populations spread along areas of U1 haplogroup prevalence. MN, Munda; NWI, North West Indian; AED, Dubai; UAE, United Arab Emirates; YEM, Yemen; MLD, Maldives; NP, North Pakistan; SINB, Sindhī-Balocī; IRA, Iran; BK, Bëṭṭʉ Kuṟumba; VED, Vedda; CAU, Caucasus; UK, Ūrāḷi Kuruman; MK, Malekuḍiya; KER, Kerala; JK, Jēnu Kuṟumba.
3.2 Koraga in comparison with West Asian, Caucasian and South Indian tribes
In the inter-population pairwise Fst analysis, we observe that the average pairwise distances between Koraga and other populations (Supplementary Figure S2 above the diagonal) are comparable with those of the Ūrāḷi Kuruman (UK) and Malekuḍiya (MK) tribes. The population pairwise distances are the lowest in the Koraga, followed by the Ūrāḷi Kuruman. Two populations, namely, the Bëṭṭʉ Kuṟumba (BK, cf. Zvelebil, 1982) and the Sindhī-Balocī (SINB), stand out with their higher genetic distances. Interestingly, the Nei’s distance between the Koraga and the Sindhī-Balocī, a northwestern population, is less than between the Koraga and the South Indian Bëṭṭʉ Kuṟumba tribe. Similarly, the Fst value between the Koraga and Caucasian populations is 0.01 (Figure 4), which is much less than between the Koraga and the Bëṭṭʉ Kuṟumba (0.36), suggesting that the Koraga exhibit a greater maternal affinity with populations of the Caucasus and West Asia than with Indian tribes rich in the M2 haplogroup.
Furthermore, in order to understand the clustering pattern with other populations, Principal Component Analysis (PCA) was performed using the haplogroup frequencies (Figure 5). Populations were chosen based on prior information concerning their regional affiliation. The Koraga formed a separate cluster, closer to the northwestern populations. The only other southern population closer were the Jēnu Kuṟumba. The biplot showed that the formation of this cluster was mainly due to the U1 and M3 haplogroups present in these tribes.
FIGURE 5. PCA showing clustering pattern in the Koraga tribe and other populations from the Indian subcontinent. Inset has a biplot showing haplogroups with highest contribution to the variance. Koraga cluster is encircled in a dotted ellipse.
Earlier studies observed that the influence of West Eurasian mtDNA haplogroups is greater amongst high-caste populations such as the Brahmins, and amongst Muslims, whereas the frequencies of West Eurasian haplogroups were reportedly lower amongst tribal populations (Bamshad et al., 2001; Roychoudhury et al., 2001; Kivisild et al., 2003). In contrast to previous studies, the present study shows the prevalence of West Eurasian mitochondrial haplogroup to be high in the Koraga. Although haplogroup U1 is responsible for this contrast, the contribution of M3 in both the Koraga and Jēnu Kuṟumba (a.k.a. Kāṭṭu Nāyakkar, Kaṭṭunāyakan, cf. Zvelebil, 1988) should not be ignored, as this haplogroup is widely present in northwestern populations carrying the Ancestral North Indian component (Reich et al., 2009). With the absence of Y haplogroups associated with recent migration and admixture in both these tribes (Anthropological Survey of India, 2021c), it is unlikely that the gene pool of these tribes was in any way influenced by recent demographic changes. Therefore, the movement of M3 or U1 must have resulted from an earlier wave of migration. Previous studies have, in fact, suggested that the spread of Neolithic agriculture was associated with the pre-Bronze-Age migration of West Eurasian haplogroups into South India (Kivisild et al., 1999; Palanichamy et al., 2004). Others have associated the presence of HV and U1 haplogroups in South India with a proto-Dravidian migration (Palanichamy et al., 2015). We too, observed a close relationship with the Iranian U1 sequences and the Koraga U1 sequences in the Neighbour Joining tree (Supplementary Figure S3).
3.3 Divergence time estimate for U1 haplogroup in the Koraga tribe
In order to date the U1 cluster found in the Koraga tribe, we performed Bayesian analysis. In the Bayesian phylogenetic tree constructed for the maternal U1 haplogroup (Figure 6), all Koraga individuals clustered under the single clade U1a, a maternal lineage shared with populations of the Caucasus, dating from ∼28,000 years ago. The next divergence happened in the LGM recovery period, resulting in a TMRCA dating from ∼16,000 years ago. All the U1 subclades of the Koraga tribe can be presumed to have arisen from a founder whose descendants reached the southwestern coast of the Indian peninsula. These assumptions are based on the spatial distribution of U1 haplogroup (Figure 1) and the divergence time gradient observed in U1 clades from the Caucasus to South India. The TMRCA measured in our study is consistent with earlier results (Supplementary Table S2).
FIGURE 6. Bayesian phylogenetic tree for U1 haplogroup. Divergence time estimates are shown for the U1 samples found in the Koraga tribe (red branches) and Caucasus and Middle Eastern samples (purple branches).
4 Discussion
As with the Hungarians, whose language provides unambiguous evidence of the provenance of the language community notwithstanding the complexity of the Hungarian gene pool (van Driem, 2021), so too the Koraga language clearly indicates a North Dravidian provenance of the community. The genetic profile of the Koraga reveals not just the complex history of the language community but also sheds a new light on the ethnolinguistic prehistory of the Indian subcontinent. Uniparental markers may reveal different pasts of a single language community. Poloni et al. (1997), Poloni and Nicolas Ray (2000) reported that globally the distribution of language families patterned geographically with the prevalent paternal lineages of language communities. Subsequent studies showed how ubiquitously ethnolinguistic phylogeography correlated with Y chromosomal haplogroups, but also stressed that these correlations were neither perfect nor universal (van Driem, 2013; van Driem, 2021). The Bodish language communities of Baltistan in Pakistan-occupied Kashmir (Zerjal et al., 1997; Quintana-Murci et al., 2001; Qamar, 2002; van Driem, 2014) and the Laccadives (Mustak et al., 2019) both represent salient cases of a mother tongue correlation at variance with the globally more frequent father tongue correlation. Palanichamy et al. (2015) suggested that maternal molecular markers might serve as tracer dyes for the spread of Dravidian, and we argue that their proposal makes sense for the Koraga in light of the known sociolinguistic history of the Indian subcontinent.
Paternal lineages of a community afflicted with low social status are less likely to fare well over time. The Brahui are a North Dravidian language community in Beluchistan, who are conventionally regarded as an in situ linguistic remnant of the Dravidian populace of the Indus civilisation. It was proposed that the paternal lineage L might serve as a molecular tracer for the ancient spread of Elamo-Dravidian (van Driem, 2012). However, the Beluch exhibit haplogroup L at a higher frequency than any other group in Pakistan, more so than the Brahui (Qamar, 2002; Kivisild et al., 2003; Trivedi, 2008; Haber et al., 2012; Lacau et al., 2012). This paradoxical finding represents an expected outcome of the social stigma connected with the pre-Indo-Aryan ethnolinguistic identity, which rendered the Brahui prone to genetic contribution from Aryan males through hypergamy practised by succeeding generations of Brahui women (van Driem, 2012; 2021). As in the case of Hungarian, the Brahui language community represents a purely linguistic retention (Pagani et al., 2017). On the other hand, the paternal lineage L (M76) preserved in the Liṅgāyat, Okkaliga and other agriculturalist population groups trace an ancient pathway of southward migration from the Indus basin along the southwestern coast.
As with Brahui, the language of the Koraga served as the most conspicuous marker of inherited low social status, and an earlier study on the Koraga argued that the low haplotype and nucleotide diversity in the tribe indicated strong genetic drift (Cordaux et al., 2003). The mitochondrial lineage U1 observed in the Koraga clusters phylogenetically with populations of Western Asia and the Caucasus (Palanichamy et al., 2015). Tribal groups in South India exhibit reduced diversity and large genetic distances, both among themselves and as compared with other groups. These results reveal no signals of prehistoric demographic expansion and instead reflect enhanced genetic drift, to which these groups were subjected due to small population size and/or bottlenecks.
The Y haplogroup analysis from an earlier study revealed that the Koraga differ markedly from other tribes and caste populations. Y-chromosomal haplogroup D, characterised by the Y-chromosome Alu insertion polymorphism (YAP) at locus DYS287, occurs in a trace amount in the Koraga tribe (Cordaux et al., 2004). This paternal lineage represents a remnant of a very early spread eastward across South Asia, with areas of retention today on the Andamans, in the Himalayas, parts of Southeast Asia and the Japanese archipelago, especially in the Ainu and the Ryūkyūans (Chandrasekar et al., 2007; van Driem, 2021). More salient is the exceedingly high frequency of Y-chromosomal haplogroup H1 (M82) in the Koraga (Anthropological Survey of India, 2021c), a paternal lineage also found in high frequency in Gōṇḍ tribes (Sharma, 2009), in the Kātkarī and Jēnu Kuṟumba and other population groups categorised as untouchable or of low status (Anthropological Survey of India, 2021c). The frequency of Y chromosomal haplogroup F is particularly high in the Sōliga and Mādiga, two untouchable ‘scheduled’ castes who have assimilated to the Dravidian language communities surrounding them and who now each speak their own dialect of Kannaḍa and Telugu respectively. The age of haplogroup F has been estimated at ∼25,000 YBP, reflecting an older stratum of population, whilst the expansion of haplogroup H in the Indian subcontinent has been estimated at ∼7,000 YBP (Poznik et al., 2016).
The only other paternal haplogroup found in the Koraga tribe is R2 (M124), which may reach a frequency of up to 40% in some other Dravidian tribal groups. The Siddi tribe, which is of historically comparatively recent African origin and shows about 40% YAP polymorphism (Thangaraj et al., 1999), is found in geographical proximity to the Koraga population. Some gene flow between the two seems possible, however, the African-specific haplogroups like M1 observed in the Koraga have not been reported in Siddi tribes before (Shah et al., 2011).
The mitochondrial clade U1 in the Koraga tribe ultimately originates in West Asia, and a close affinity between the Caucasian U1 and the Koraga U1 cluster is observed. Based on our Bayesian estimate, the movement of U1 maternal ancestors began at the time of the Last Glacial Maximum. Similar West Eurasian founder lineages are reported in the Ūrāḷi Kuruman, Malekuḍiya, Lakṣadvīp Islanders and other tribes on the southwestern coast (Forster et al., 2002; Palanichamy et al., 2015; Mustak et al., 2019; Sylvester et al., 2019). This maternal lineage may have been borne southward into the Indian subcontinent in the aftermath of the demise of the Harappan civilisation. The presence of West Eurasian mitochondrial haplogroups U1, HV and U7 elsewhere in the Dravidian heartland suggests different waves of incomers to South India at different time depths. The Koraga did not experience recent gene flow from neighbouring tribes, but the two sets of uniparental markers in the language community capture two divergent facets of the Koraga past.
In conclusion, the untouchable status of the Koraga language community, early somatological impressionism based on physical phenotype, the septentrional phylogenetic position of the language within the Dravidian language family in combination with the contrast between our mitochondrial findings and the paternal lineages borne by the population allow us to present the hypothesis that Koraga is a mother tongue retained by a vanquished population group that fled southward at the demise of the Indus civilisation. The original Koraga migrant group encountered other Dravidian populations whose linguistic ancestors had preceded them as part of a pre-Bronze Age southward dispersal of Elamo-Dravidian languages. The reviled social status of the Koraga language community doomed the long-term survival prospects of the original Koraga paternal lineages and enabled their replacement by paternal lineages introduced into the community from local untouchable populations, whereas only the Koraga maternal lineage retained an ancestral correlation with the linguistic affiliation of the language community. This situation has two parallels in the Brahui and Kurukh, where the native Y chromosomes were lost or reduced through hypergamy practised by Dravidian women and Munda women respectively, marrying local men of the Indo-Aryan (i.e., Beluch) and Austroasiatic language communities (Chaubey et al., 2011; van Driem, 2012). This pattern permits us to infer the original low status of Northern Dravidian speakers as a consequence of their subjugation during the demise of the Indus civilisation.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by the Mangalore University Human Ethical Committee. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
JS: Writing–original draft, Data curation, Formal Analysis, Methodology, Software, Writing–review and editing. KV: Writing–original draft, Investigation. RD: Conceptualization, Data curation, Supervision, Validation, Writing–review and editing. GvD: Writing–review and editing. MM: Conceptualization, Project administration, Supervision, Validation, Writing–review and editing, Methodology.
Funding
The authors declare that no funding support was received for the research, authorship and/or publication of this work.
Acknowledgments
The authors thank Dr. Kumarasamy Thangaraj from the Centre for Cell and Molecular Biology (CCMB) at Hyderabad for his support and guidance. The authors acknowledge the participants, lab technicians and research staff involved in the study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1303628/full#supplementary-material
References
Anthropological Survey of India (2021a). Genomic diversity in people of India: focus on mtDNA and Y-chromosome polymorphism. Singapore: Springer Singapore. doi:10.1007/978-981-16-0163-7
Anthropological Survey of India (2021b). “Mitochondrial DNA phylogeny of M haplogroup in Indian population,” in Genomic diversity in people of India (Singapore: Springer Singapore), 11–81. doi:10.1007/978-981-16-0163-7_2
Anthropological Survey of India (2021c). “Y-chromosome phylogeny in Indian population,” in Genomic diversity in people of India (Singapore: Springer Singapore), 145–169. doi:10.1007/978-981-16-0163-7_6
Bamshad, M., Kivisild, T., Watkins, W. S., Dixon, M. E., Ricker, C. E., Rao, B. B., et al. (2001). Genetic evidence on the origins of Indian caste populations. Genome Res. 11, 994–1004. doi:10.1101/gr.gr-1733rr
Basu, A., Sarkar-Roy, N., and Majumder, P. P. (2016). Genomic reconstruction of the history of extant populations of India reveals five distinct ancestral components and a complex structure. Proc. Natl. Acad. Sci. U. S. A. 113, 1594–1599. doi:10.1073/pnas.1513197113
Bhat, D. N. S. (1971). The Koraga Language. Pune: Deccan College Postgraduate and Research Institute.
Bouckaert, R., Vaughan, T. G., Barido-Sottani, J., Duchêne, S., Fourment, M., Gavryushkina, A., et al. (2019). BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLOS Comput. Biol. 15, e1006650. doi:10.1371/journal.pcbi.1006650
Brandini, S., Bergamaschi, P., Cerna, M. F., Gandini, F., Bastaroli, F., Bertolini, E., et al. (2018). The paleo-Indian entry into South America according to mitogenomes. Mol. Biol. Evol. 35, 299–311. doi:10.1093/molbev/msx267
Caldwell, R. (1856). A Comparative Grammar of the Dravidian or South-Indian Family of Languages. London: Harrison and Sons.
Capodiferro, M. R., Aram, B., Raveane, A., Rambaldi Migliore, N., Colombo, G., Ongaro, L., et al. (2021). Archaeogenomic distinctiveness of the Isthmo-Colombian area. Cell 184, 1706–1723. doi:10.1016/j.cell.2021.02.040
Chandrasekar, A., Kumar, S., Sreenath, J., Sarkar, B. N., Urade, B. P., Mallick, S., et al. (2009). Updating phylogeny of mitochondrial DNA macrohaplogroup M in India: dispersal of modern human in South asian corridor. PLoS One 4, e7447. doi:10.1371/journal.pone.0007447
Chandrasekar, A., Saheb, S. Y., Gangopadyaya, P., Gangopadyaya, S., Mukherjee, A., Basu, D., et al. (2007). YAP insertion signature in South Asia. Ann. Hum. Biol. 34, 582–586. doi:10.1080/03014460701556262
Chaubey, G., Metspalu, M., Choi, Y., Mägi, R., Romero, I. G., Soares, P., et al. (2011). Population genetic structure in indian austroasiatic speakers: the role of landscape barriers and sex-specific admixture. Mol. Biol. Evol. 28, 1013–1024. doi:10.1093/molbev/msq288
Connell, J. R., Benton, M. C., Lea, R. A., Sutherland, H. G., Chaseling, J., Haupt, L. M., et al. (2022). Pedigree derived mutation rate across the entire mitochondrial genome of the Norfolk Island population. Sci. Rep. 12, 6827. doi:10.1038/s41598-022-10530-3
Cordaux, R., Aunger, R., Bentley, G., Nasidze, I., Sirajuddin, S. M., and Stoneking, M. (2004). Independent origins of Indian caste and tribal paternal lineages. Curr. Biol. 14, 231–235. doi:10.1016/j.cub.2004.01.024
Cordaux, R., Saha, N., Bentley, G. R., Aunger, R., Sirajuddin, S. M., and Stoneking, M. (2003). Mitochondrial DNA analysis reveals diverse histories of tribal populations from India. Eur. J. Hum. Genet. 11, 253–264. doi:10.1038/sj.ejhg.5200949
Debnath, M., Palanichamy, M. G., Mitra, B., Jin, J.-Q., Chaudhuri, T. K., and Zhang, Y.-P. (2011). Y-chromosome haplogroup diversity in the sub-Himalayan Terai and Duars populations of East India. J. Hum. Genet. 56, 765–771. doi:10.1038/jhg.2011.98
Ehler, E., Novotný, J., Juras, A., Chyleński, M., Moravčík, O., and Pačes, J. (2019). AmtDB: a database of ancient human mitochondrial genomes. Nucleic Acids Res. 47, D29–D32. doi:10.1093/nar/gky843
Excoffier, L., and Lischer, H. E. L. (2010). Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567. doi:10.1111/j.1755-0998.2010.02847.x
Forster, L., Forster, P., Lutz-Bonengel, S., Willkomm, H., and Brinkmann, B. (2002). Natural radioactivity and human mitochondrial DNA mutations. Proc. Natl. Acad. Sci. 99, 13950–13954. doi:10.1073/pnas.202400499
Grant, W. A. S., and Bowen, B. W. (1998). Shallow population histories in deep evolutionary lineages of marine fishes: insights from sardines and anchovies and lessons for conservation. J. Hered. 89, 415–426. doi:10.1093/jhered/89.5.415
Gupta, R., Ratan, A., Rajesh, C., Chen, R., Kim, H. L., Burhans, R., et al. (2012). Sequencing and analysis of a South Asian-Indian personal genome. BMC Genomics 13, 440. doi:10.1186/1471-2164-13-440
Haber, M., Platt, D. E., Ashrafian Bonab, M., Youhanna, S. C., Soria-Hernanz, D. F., Martínez-Cruz, B., et al. (2012). Afghanistan’s ethnic groups share a Y-chromosomal heritage structured by historical events. PLoS One 7, 34288. doi:10.1371/journal.pone.0034288
Ingman, M., and Gyllensten, U. (2003). Mitochondrial genome variation and evolutionary history of Australian and New Guinean aborigines. Genome Res. 13, 1600–1606. doi:10.1101/gr.686603
Kivisild, T., Bamshad, M. J., Kaldma, K., Metspalu, M., Metspalu, E., Reidla, M., et al. (1999). Deep common ancestry of Indian and Western-Eurasian mitochondrial DNA lineages. Curr. Biol. 9, 1331–1334. doi:10.1016/S0960-9822(00)80057-3
Kivisild, T., Rootsi, S., Metspalu, M., Mastana, S., Kaldma, K., Parik, J., et al. (2003). The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations. Am. J. Hum. Genet. 72, 313–332. doi:10.1086/346068
Lacau, H., Gayden, T., Regueiro, M., Chennakrishnaiah, S., Bukhari, A., Underhill, P. A., et al. (2012). Afghanistan from a Y-chromosome perspective. Eur. J. Hum. Genet. 20, 1063–1070. doi:10.1038/ejhg.2012.59
Leigh, J. W., and Bryant, D. (2015). POPART: full-feature software for haplotype network construction. Methods Ecol. Evol. 6 (9), 1110–1116. doi:10.1111/2041-210X.12410
McAlpin, D. W. (1981). “Proto Elamo Dravidian: The evidence and its implications,” in Transactions of the American Philosophical Society (Philadelphia: American Philosophical Society), 71. 3.
Metspalu, M., Cui, Y., and Wu, R. (2004). A multilocus likelihood approach to joint modeling of linkage, parental diplotype and gene order in a full-sib family. BMC Genet. 5, 20. doi:10.1186/1471-2156-5-20
Mustak, M. S., Rai, N., Naveen, M. R., Prakash, S., Carlus, S. J., Pasupuleti, N., et al. (2019). The peopling of Lakshadweep archipelago. Sci. Rep. 9, 6968–6977. doi:10.1038/s41598-019-43384-3
Olivieri, A., Achilli, A., Pala, M., Battaglia, V., Fornarino, S., Al-Zahery, N., et al. (2006). The mtDNA legacy of the levantine early Upper Palaeolithic in Africa. Science 314 (80-), 1767–1770. doi:10.1126/science.1135566
Olivieri, A., Sidore, C., Achilli, A., Angius, A., Posth, C., Furtwängler, A., et al. (2017). Mitogenome diversity in Sardinians: a genetic window onto an island’s past. Mol. Biol. Evol. 34, 1230–1239. doi:10.1093/molbev/msx082
Pagani, L., Colonna, V., Tyler-Smith, C., and Ayub, Q. (2017). An ethnolinguistic and genetic perspective on the origins of the Dravidian-speaking Brahui in Pakistan. Man In India 97, 267–278.
Palanichamy, M. G., Mitra, B., Zhang, C. L., Debnath, M., Li, G. M., Wang, H. W., et al. (2015). West Eurasian mtDNA lineages in India: an insight into the spread of the Dravidian language and the origins of the caste system. Hum. Genet. 134, 637–647. doi:10.1007/s00439-015-1547-4
Palanichamy, M. G., Sun, C., Agrawal, S., Bandelt, H.-J., Kong, Q.-P., Khan, F., et al. (2004). Phylogeny of mitochondrial DNA macrohaplogroup N in India, based on complete sequencing: implications for the peopling of South Asia. Am. J. Hum. Genet. 75, 966–978. doi:10.1086/425871
Poloni, E. S., and Nicolas Ray, S. S. (2000). “Languages and genes: modes of transmission observed through the analysis of male-specific and female-specific genes,” in Proceedings: evolution of language, 3rd international conference 3-6 april. Editors D. Jean-Louis,, and G. Laleh (Paris: École Nationale Supérieure des Télécommunications), 185–186.
Poloni, E. S., Semino, O., Passarino, G., Santachiara-Benerecetti, A. S., Dupanloup, I., Langaney, A., et al. (1997). Human genetic affinities for Y-chromosome P49a,f/TaqI haplotypes show strong correspondence with linguistics (cf. The erratum published in 1998 in the American journal of human genetics, 62: 1267. Am. J. Hum. Genet. 61, 1015–1035. doi:10.1086/301602
Poznik, G. D., Xue, Y., Mendez, F. L., Willems, T. F., Massaia, A., Wilson Sayres, M. A., et al. (2016). Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences. Nat. Genet. 48, 593–599. doi:10.1038/ng.3559
Qamar, R., Ayub, Q., Mohyuddin, A., Helgason, A., Mazhar, K., Mansoor, A., et al. (2002). Y-chromosomal DNA variation in Pakistan. Am. J. Hum. Genet. 70, 1107–1124. doi:10.1086/339929
Quintana-Murci, L., Krausz, C., Zerjal, T., Sayar, S. H., Hammer, M. F., Mehdi, S. Q., et al. (2001). Y-chromosome lineages trace diffusion of people and languages in southwestern Asia. Am. J. Hum. Genet. 68, 537–542. doi:10.1086/318200
Registrar General & Census Commissioner of India (2011). “C-16 population by mother tongue”, Census of India. Delhi: Ministry of Home Affairs.
Reich, D., Thangaraj, K., Patterson, N., Price, A. L., and Singh, L. (2009). Reconstructing Indian population history. Nature 461, 489–494. doi:10.1038/nature08365
Roychoudhury, S., Roy, S., Basu, A., Banerjee, R., Vishwanathan, H., Usha Rani, M. V., et al. (2001). Genomic structures and population histories of linguistically distinct tribal groups of India. Hum. Genet. 109, 339–350. doi:10.1007/s004390100577
Schönherr, S., Weissensteiner, H., Kronenberg, F., and Forer, L. (2023). Haplogrep 3 - an interactive haplogroup classification and analysis platform. Nucleic Acids Res. 51, W263–W268. doi:10.1093/nar/gkad284
Shah, A. M., Tamang, R., Moorjani, P., Rani, D. S., Govindaraj, P., Kulkarni, G., et al. (2011). Indian siddis: African descendants with Indian admixture. Am. J. Hum. Genet. 89 (1), 154–161. doi:10.1016/j.ajhg.2011.05.030
Sharma, S. (2009). The Indian origin of paternal haplogroup R1a1* substantiates the autochthonous origin of Brahmins and the caste system (cf. the erratum published in 2014 in the Journal of Human Genetics, 59:7, 415. J. Hum. Genet. 54–416. doi:10.1038/jhg.2008.2
Silva, M., Oliveira, M., Vieira, D., Brandão, A., Rito, T., Pereira, J. B., et al. (2017). A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals. BMC Evol. Biol. 17, 88. doi:10.1186/s12862-017-0936-9
Stuart, H. A. (1893). Census of India 1891 Volume XIII Madras - The Report of the Census. Madras: Government Press.
Sylvester, C., Krishna, M. S., Rao, J. S., and Chandrasekar, A. (2019). Maternal genetic link of a South Dravidian tribe with native Iranians indicating bidirectional migration. Ann. Hum. Biol. 46, 175–180. doi:10.1080/03014460.2019.1599067
Thangaraj, K., Chaubey, G., Singh, V. K., Vanniarajan, A., Thanseem, I., Reddy, A. G., et al. (2006). In situ origin of deep rooting lineages of mitochondrial Macrohaplogroup “M” in India. BMC Genomics 7, 151. doi:10.1186/1471-2164-7-151
Thangaraj, K., Chaubey, G., Kivisild, T., Reddy, A. G., Singh, V. K., Rasalkar, A. A., et al. (2005). Reconstructing the origin of Andaman Islanders. Sci. (80) 308, 996. doi:10.1126/science.1109987
Thangaraj, K., Joshi, M. B., Reddy, A. G., Gupta, N. J., Chakravarty, B., and Singh, L. (2002). CAG repeat expansion in the androgen receptor gene is not associated with male infertility in Indian populations. J. Androl. 23 (6), 815–818. doi:10.1002/j.1939-4640.2002.tb02338.x
Thangaraj, K., Ramana, G. V., and Singh, L. (1999). Y-chromosome and mitochondrial DNA polymorphisms in Indian populations. Electrophoresis 20, 1743–1747. doi:10.1002/(SICI)1522-2683(19990101)20:8<1743::AID-ELPS1743>3.0.CO;2-V
Thanseem, I., Thangaraj, K., Chaubey, G., Singh, V. K., Vks Bhaskar, L., Reddy, B. M., et al. (2006). Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA. BMC Genet. 7, 42. doi:10.1186/1471-2156-7-42
Thurston, E., and Rangachari, K. (1909). Castes and Tribes of Southern India vol-III. Madras: Government Press.
Tournebize, R., Chu, G., and Moorjani, P. (2022). Reconstructing the history of founder events using genome-wide patterns of allele sharing across individuals. PLOS Genet. 18, e1010243. doi:10.1371/journal.pgen.1010243
Trivedi, R., Sahoo, S., Singh, A., Bindu, G. H., Banerjee, J., Tandon, M., et al. (2008). Genetic imprints of Pleistocene origin of Indian populations: a comprehensive phylogeographic sketch of Indian Y-chromosomes. Int. J. Hum. Genet. 8, 97–118. doi:10.1080/09723757.2008.11886023
van Driem, G. (2013). East asian ethnolinguistic phylogeography. Bull. Chin. Linguist. 7, 135–188. doi:10.1163/2405478x-90000111
van Driem, G. (2014). “A prehistoric thoroughfare between the Ganges and the Himalayas,” in 50 Years after Daojali-Hading: Emerging Perspectives in the Archaeology of Northeast India. Editors T. Jamir, and M. Hazarika (New Delhi: Research India Press), 60–98.
van Driem, G. (2012). “Etyma, shouldered adzes and molecular variants,” in Methods in Contemporary Linguistics. Editors E. Andrea, L. Adrian, and W. Bernhard (Berlin: Mouton de Gruyter), 335–361. doi:10.1515/9783110275681.335
van Oven, M., and Kayser, M. (2009). Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 30, E386–E394. doi:10.1002/humu.20921
Vishwanathan, H., Deepa, E., Cordaux, R., Stoneking, M., Usha Rani, M. V., and Majumder, P. P. (2004). Genetic structure and affinities among tribal populations of southern India: a study of 24 autosomal DNA markers. Ann. Hum. Genet. 68, 128–138. doi:10.1046/j.1529-8817.2003.00083.x
Walhouse, M. J. (1875). Some account of a leaf-wearing tribe on the western coast of India. J. Anthropol. Inst. G. B. Irel. 4, 369–376. doi:10.2307/2840981
Zerjal, T., Dashnyam, B., Pandya, A., Kayser, M., Roewer, L., Santos, F. R., et al. (1997). Genetic relationships of Asians and Northern Europeans, revealed by Y-chromosomal DNA analysis. Am. J. Hum. Genet. 60, 1174–1183.
Zvelebil, K. (1990). Dravidian Linguistics: An Introduction. Pondicherry Institute of Linguistics and Culture.
Zvelebil, K. V. (1982). Bëṭṭʉ Kuṟumba: first report on a tribal language. J. Am. Orient Soc. 102, 523. doi:10.2307/602307
Keywords: mtDNA, North Dravidian, Koraga tribe, Indus civilisation, tribes of India, Caucasus, last glacial maximum
Citation: Sequeira JJ, Vinuthalakshmi K, Das R, van Driem G and Mustak MS (2024) The maternal U1 haplogroup in the Koraga tribe as a correlate of their North Dravidian linguistic affinity. Front. Genet. 14:1303628. doi: 10.3389/fgene.2023.1303628
Received: 28 September 2023; Accepted: 31 December 2023;
Published: 07 February 2024.
Edited by:
Sankar Subramanian, University of the Sunshine Coast, AustraliaReviewed by:
Niraj Rai, Birbal Sahni Institute of Palaeobotany, IndiaEaaswarkhanth Muthukrishnan, Georgia Institute of Technology, United States
Magdalena Gayà-Vidal, Centro de Investigacao em Biodiversidade e Recursos Geneticos (CIBIO-InBIO), Portugal
Copyright © 2024 Sequeira, Vinuthalakshmi, Das, van Driem and Mustak. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mohammed S. Mustak, bXNtdXN0YWtAZ21haWwuY29t; George van Driem, Z2VvcmdlLnZhbmRyaWVtQHVuaWJlLmNo
†These authors have contributed equally to this work and share first authorship