ORIGINAL RESEARCH article

Front. Plant Sci., 02 April 2025

Sec. Plant Symbiotic Interactions

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1539151

This article is part of the Research TopicEnhancing Plant Stress Resilience: Innovative Roles of Plant Growth-Promoting Bacteria (PGPB) in Sustainable Agriculture and Crop ProtectionView all 3 articles

Pangenome analysis indicates evolutionary origins and genetic diversity: emphasis on the role of nodulation in symbiotic Bradyrhizobium

Leonardo Araujo Terra&#x;Leonardo Araujo Terra1†Milena Serenato Klepa&#x;Milena Serenato Klepa1†Marco Antonio Nogueira,&#x;Marco Antonio Nogueira1,2†Mariangela Hungria,*&#x;Mariangela Hungria1,2*†
  • 1CNPq, Brasília, Brazil
  • 2Soil Biotechnology Laboratory, Embrapa Soja, Londrina, Paraná, Brazil

The Bradyrhizobium genus is widely known for encompassing many species capable of forming nodules and establishing the biological nitrogen fixation process with several legumes, significantly contributing to agriculture and environmental sustainability. Despite its importance, questions about the evolution, pangenome, and symbiotic genes of Bradyrhizobium are still poorly understood. In this study, we analyzed the pangenome of a set of Bradyrhizobium symbiotic species using the Roary and GET_HOMOLOGUES tools in strains originated from the Northern and Southern Hemispheres. We also investigated the presence and correlation of the fix, nif, nod, Type III secretion system (T3SS) and their effector proteins, and T4SS genes, trying to find differences between clades, hosts, and biogeographic origin. Pangenome analysis of Bradyrhizobium species from the Northern and Southern Hemispheres provided valuable insights into their diversity, biogeography, origin, and co-evolution with their legume host plants. The genus possesses a relatively small core genome compared to the expanded accessory genome, a key feature that facilitates genetic exchange and acquisition of new genes, allowing adaptation to a variety of environments. Notably, the presence or absence of T3SS effector proteins varied significantly according to the geographic location, suggesting specific environmental adaptations, as well as a direct relationship with nodulation genes. Comparative analysis indicated that symbiotic Bradyrhizobium species originated in the Northern Hemisphere and present a greater diversity of orthologous groups than those from the Southern Hemisphere. These results contribute to our understanding of the evolutionary history of these symbiotic bacteria.

1 Introduction

Legumes represent a family of plants that are widely distributed around the world and have coevolved with nitrogen-fixing symbiotic bacteria for millions of years (Doyle, 2011; Sprent et al., 2017). The wide variety of adaptations resulting from this coevolution between plants and bacteria has allowed the occupation of diverse ecosystems, from arid regions to tropical forests (Werner et al., 2014). The geographic distribution of legumes and their close relationships with symbiotic-nodulating bacteria, such as those of the Bradyrhizobium genus, reveal patterns related to evolutionary history and biogeography, adapting to different environmental conditions and playing important roles in ecosystems and agriculture (Parker, 2012; Sprent et al., 2017).

Species belonging to the genus Bradyrhizobium are cosmopolitan bacteria that occupy diverse ecosystems and habitats around the world. The biogeography of these bacteria plays a fundamental role in adapting to a variety of global environments (Hungria et al., 2015; Sprent et al., 2017; Bouznif et al., 2019), contributing to the maintenance of ecosystems, sustainability, and agricultural productivity (Hungria et al., 2015).

Within the genus Bradyrhizobium, several species depend on nodulation (nod) genes that encode Nod factors, signaling molecules essential for host plant recognition and nodule formation in legume roots (Ormeño-Orrillo and Martínez-Romero, 2019). This genus is also an important target for studies on nif (nitrogen fixation) and fix genes that encode proteins essential for biological nitrogen fixation (BNF). The nif genes are responsible for the synthesis and functioning of nitrogenase, an enzyme complex that converts molecular nitrogen (N2) into ammonia (NH3+), while the fix genes act in the regulation and maintenance of nitrogenase activity (Menna and Hungria, 2011). Furthermore, the Bradyrhizobium genus is also studied in relation to nodulation genes (nod), Nod Factors, the Type III Secretion System (T3SS) and its effector proteins (T3E), which play fundamental roles in nodule formation (Giraud et al., 2007; Teulet et al., 2022).

With the establishment of symbioses, biological nitrogen fixation takes place, and it is recognized as the second most important process for plant growth after photosynthesis (Hungria and Nogueira, 2023). In general, phylogenetic and genomic analyses of rhizobial strains have provided valuable information about their evolution and geographic distribution (Stępkowski et al., 2007; Avontuur et al., 2019).

Advances in DNA sequencing technologies have enabled the generation of data from sequenced bacterial genomes, enabling pangenome analyses, which represent a valuable tool for exploring genetic diversity within bacterial species (McInerney et al., 2017; Brockhurst et al., 2019). The concept of pangenome encompasses all genes identified in a set of strains or representants of a given taxonomic group. It considers genes shared by all strains or species, known as the core genome and represented by genes predicted to encode common and vital functions, as well as genes that occur exclusively in a few organisms, representing the accessory genome and conferring the ability to adapt to different lifestyles. This approach reveals the intrinsic genomic diversity of a bacterial species or genus and the ability to adapt to different ecological environments (Tettelin et al., 2008; Brockhurst et al., 2019).

Some studies have explored the pangenomes of strains of the same species, for example, Escherichia coli (Rasko et al., 2008) and Pseudomonas aeruginosa (Mosquera-Rendón et al., 2016), of species within a genus, e.g., Burkholderia (Bach et al., 2022), Rhizobium (González et al., 2019), or even at the order level, for example, Rhizobiales (Rosselli et al., 2021), and the results obtained have advanced the understanding of these microorganisms. An analysis of the Burkholderia pangenome showed the presence of genes acquired via HGT from other microorganisms, contributing to their ability to survive in the rhizosphere (Bach et al., 2022). Similarly, a pangenome analysis of Bradyrhizobium revealed genetic traits associated with nitrogen fixation, highlighting the role of genetic diversity in shaping symbiotic efficiency (Zhong et al., 2024). Furthermore, the modular nature of symbiotic genes in Bradyrhizobium allows for evolutionary stability and flexibility, with horizontal gene transfer (HGT) playing a key role in the maintenance and reorganization of symbiotic traits (Weisberg et al., 2022).

In this study, we investigated the genetic diversity of the nif/fix, nod and T3SS genes and their effector proteins, as well as the biogeographic distribution of nod-dependent of Bradyrhizobium species isolated from legume nodules around the world. Through pangenome analysis, we sought to understand how these genes are organized and how their variations are associated with the different environments of the Northern and Southern Hemispheres. Our results provide new insights into the genetic diversity and evolution of these bacteria, highlighting the differences associated with the biogeography of the species in each hemisphere.

2 Materials and methods

2.1 Selection of symbiotic Bradyrhizobium species

A total of 54 genomes from different Bradyrhizobium species were retrieved from the GenBank database on 02/04/2023, based on well-documented geographic identification. Genome selection followed the following criteria: (1) presence of Bradyrhizobium strains isolated from root nodules of wild or cultivated legumes; (2) presence of nif/fix genes; and (3) presence of nod genes. In addition, genomes from phylogenetically distinct strains, unusually numerous single genes, and reduced number of genes in the accessory genome were excluded from the analysis. Based on these criteria, a total of 44 symbiotic nitrogen-fixing Bradyrhizobium genomes dependent on nod factors were selected. Of these, 11 were completely sequenced, while 33 were draft genomes (Supplementary Table S1).

2.2 Pangenome

2.2.1 Pangenome construction

To construct the pangenome, we used two programs: Roary, with option “-i 90” (v3.13.0) (Page et al., 2015), and GET_HOMOLOGUES, with option “-M -t 0 -i90” (v15052022) (Contreras-Moreira and Vinuesa, 2013). The GFF3 and gbk files of annotated genomes from Prokka (v1.14.6) (Seemann, 2014) were used as inputs for Roary and GET_HOMOLOGUES, respectively. Genome completeness and contamination were assessed via CheckM v.1.2.1 (Parks et al., 2015) and assembly quality parameters were analyzed via QUAST v.5.2.0 (Gurevich et al., 2013).

2.2.2 Core genome phylogeny

The phylogenetic tree was built on the basis of the core clusters identified via the CoreCruncher tool (Harris et al., 2021). Protein sequences with 90% identity, sizes of 80%, and frequencies of 100% were aligned via the –MAFFT (v7.490) program (Katoh and Standley, 2013) and concatenated. Multiple sequence alignment was used to construct phylogenetic trees via the LG+G+I+F model in MEGA 11 software (v11.0.13) (Tamura et al., 2021), with a bootstrap value of 100.

For phylogeographic analysis, alignments obtained from the core genome were analyzed using the BEAST tool (v1.10.4). Inference was performed with the Blosum62 substitution model with the Strict Clock molecular clock model and Symmetric CTMC (Continuous-Time Markov Chain) phylogeographic model. The population structure was modeled using a coalescent model, while geographic states were defined as Northern Hemisphere (N) and Southern Hemisphere (S). Bayesian inference was conducted with a Markov Chain Monte Carlo (MCMC) of 10 million iterations and Effective Sample Size (ESS) values greater than 200 (Supplementary Figure S1).

2.3 Prediction of Fix/Nif, Nod, T3SS, Type III effectors and T4SS clusters

The protein identification data were obtained from the GenBank database via the National Center for Biotechnology Information (NCBI) website in April 2023. The sequences of proteins involved in biological nitrogen fixation (Fix/Nif) (Supplementary Table S2), nodulation (Nod) (Supplementary Table S3), and Type III effector proteins were searched in the NCBI database, extracted from the reference genomes of B. diazoefficiens USDA110, B. elkanii USDA61, and Sinorhizobium fredii HH103 and used to identify homologous proteins via the BLASTP Local tool (Altschul et al., 1990). We set the parameters of 50% coverage and 50% identity. The MacsyFinder (v2.1) (Abby et al., 2014) tool with default settings and hmm TXSS profiles were used to annotate the Type III and Type IV secretion systems (Supplementary Table S4).

2.4 Enrichment analysis of gene clusters

The Scoary v. 1.6.16 (Brynildsrud et al., 2016) tool with the options “–collapse - and 100” was used to perform genome-wide association study (GWAS) analysis of the gene content present in the B. jicamae, B. elkanii, and B. japonicum clades regarding biogeography. The output files with 100% specificity were used as criteria to filter the proteins.

Accessory genome sequences were extracted via the Seqkit (v2.3.0) tool (Shen et al., 2016). The functional annotation of pangenomes was performed with the EggNOG-mapper (Huerta-Cepas et al., 2017). COG and KEGG annotations were derived from the EggNOG mapper results. We considered a function enriched if the p value and q value were less than 0.05, which controlled the expected proportion of false positives at 0.05.

Insertion sequences present in the genome were assessed via genome annotations and ISESCAN 1.7.2.3 (Xie and Tang, 2017).

2.5 Statistical analyses

Plots of the core and accessory genomes were generated via the R package Pagoo (Ferrés and Iraola, 2021). Histograms were generated via the native R package. UpSet graphs were generated via the UpSetR package (Conway et al., 2017). Gene presence/absence heatmaps were generated via the R package pheatmap. The map was generated via the matplotlib library, and the interactive map was generated via the folium library via the Python language.

3 Results

3.1 Phylogeny of the core genome and genomic distribution of the fix, nif, nod, T3SS, and T3E genes

The phylogenetic tree constructed from 163 concatenated core proteins separated the Bradyrhizobium species under study into two superclades, one with a unique clade, B. japonicum, and the second with two clades, B. elkanii and B. jicamae (Figure 1).

Figure 1
www.frontiersin.org

Figure 1. Representation of the absence and presence of genes involved in the metabolism of biological nitrogen fixation, nodulation, the T3SS, and candidate genes for T3SS and T3E effectors in symbiotic Bradyrhizobium. The phylogenetic tree is classified according to the clades: B. japonicum (purple), B. elkanii (green), and B. jicamae (orange). Red upward arrows indicate bacteria isolated in the Northern Hemisphere, whereas yellow downward arrows indicate bacteria isolated in the Southern Hemisphere. The colors in the heatmap represent the number of proteins found in each strain, following a variation scale. The scale bar represents the number of amino acid substitutions per site.

We found conserved genes associated with the Bradyrhizobium genome: fix, nif, nod, T3SS, and their effector proteins. It is worth to mention that from the 44 genomes studied here, 34 belong to at least 10 different symbiovars (Martinez-Romero et al., 2024) (Supplementary Table S1). A detailed analysis of these genes revealed that 11% of the Bradyrhizobium strains did not exhibit T3SS genes in their genomic repertoire (Figure 1).

The presence and absence of fix, nif, nod and T3SS genes, as well as their effector proteins and T4SS were correlated with clade, geographic distribution and/or host plant (Supplementary Figures S2–S16). In genes related to biological nitrogen fixation, a low occurrence of the FixK protein, positive regulator that controls nitrogenase activity, was observed in the B. elkanii clade, whereas its presence was confirmed in all strains of the B. jicamae and B. japonicum clades, except for the strain B. vignae LMG 28791 of the B. japonicum clade. The FixR protein, oxidoreductase activity, was abundant in all clades with multiple copies, except for B. rifense with only one copy. Among the Nod proteins, NodF protein, acyl carrier, was detected only in some members of the B. jicamae clade,. while the NodX protein, O-acetyltransferase, was found only in the B. elkanii clade. The NodL protein, O-acetyltransferase, is absent in the B. japonicum clade. Regarding T3SS proteins and their effector proteins, NopA protein and the T3E proteins Bel2-5, ErnA, NopA, NopAG, NopAJ, NopAC, NopC, NopD, NopF, and NopJ were not identified in the B. jicamae clade, whereas InnB and NopL were widely found in this clade.

Regarding the geographic distribution and hosts, Bradyrhizobium isolated from Chamaecytisus prolifer, Cytisus villous and Lupinius angustifolius, plants located in North Africa, do not present T3SS genes and their effectors. The only exception is B. diversitatis, isolated from soybean (Glycine max) in Australia, which also does not present these genes. Interestingly, all these strains belong to the B. japonicum clade. The absence of T3SS and its effector proteins indicates that some Bradyrhizobium species can establish symbiosis exclusively through Nod Factors. Furthermore, the presence of T3SS can activate immunity caused by effects on certain host plants, which would result in the infection of the symbiosis by the plant.

With respect to the genes related to their hosts and the geographic distribution of the T3SS proteins and their effector proteins, Bel2-5 and NopD proteins were exclusive to the soybean plant (Glycine max), and the NopAG and NopF proteins were predominant in this host plant. The NopE, NopH, NopAA, and NopT proteins were predominant in the Northern Hemisphere. In the Southern Hemisphere, the NopX, NopA, InnB, and NopP proteins were predominant. The NopP2 protein is commonly distributed in all clades and hemispheres.

The transcriptional regulator gene ttsL, which is important for T3SS activation in the nodulation process, was identified in all the genomes containing T3SS genes. The absence of both ttsL and T3SS genes may indicate simultaneous horizontal transfer of these elements. The exception was B. diversitatis CNPSo 4019, which lacks the T3SS but carries the ttsL gene, which may indicate another biological function for this gene (Figure 1).

In particular, among the proteins associated with the T3SS structure, strains that possess the NopX protein do not present NopE or NopH, and vice versa. The presence of NopX is strongly correlated with that of NopA, suggesting a possible biological interaction. Furthermore, NopX is absent in the B. jicamae clade. On the other hand, NopE and NopH proteins are predominant in the B. japonicum clade and present a strong correlation with the SctN protein (Figure 1).

Interestingly, our results indicate that the presence of NopE, NopH and NopX proteins is related to host specificity, indicating that different legume hosts have a preference for these proteins. Furthermore, the NopX protein occurs exclusively in species from Brazil, South Africa, and Colombia and in some species from Central America, the USA and China. This protein is absent in species from Canada, North Africa, and Oceania. The NopE and NopH proteins are predominant in North America, Asia and Oceania (See Interactive Map, Supplementary File 3).

3.2 Pangenome of symbiotic Bradyrhizobium species

3.2.1 Selection of organisms

The set of 44 Bradyrhizobium genomes consisted of incomplete genome assemblies with contigs and scaffolds and completely sequenced genomes. All other genomes had completeness higher than 97.17% and contamination below 3.11%, ensuring that the genomes were of adequate quality for pangenome analysis (Supplementary Table S1).

3.2.2 Pangenome analysis

Pangenome analysis identifies whether a given taxon has an open or closed genome. The central genes represent the core genome, which are the genes shared by all strains analyzed. This curve shows a decline as new genes are added, indicating a small set of genes strictly conserved in this taxon (Figure 2A). On the other hand, the accessory genome includes variable genes present in some strains and absent in others. The continuous growth of this curve indicates that new accessory genes continue to be added to the pangenome as more genomes are incorporated (Figure 2B). Our results indicate that the total number of gene sets increased without reaching a plateau, characterizing an open pangenome.

Figure 2
www.frontiersin.org

Figure 2. Pangenome characteristics of 44 species of symbiotic Bradyrhizobium. (A) Core genome curve (B) Accessory genome curve.

Analysis of the Roary pangenome revealed a total of 127,128 gene families, close to the 133,100 gene families detected with the GET_HOMOLOGUES. Table 1 presents the number of genes in the pangenome categorized into core, soft-core, shell, and cloud categories via sequence alignment with 90% identity. The results revealed similarities in the absolute distribution values between the core, soft-core, and shell categories.

Table 1
www.frontiersin.org

Table 1. Distribution of pangenomes classified by Roary and GET_HOMOLOGUES.

3.2.3 Northern and Southern Hemisphere pangenomes

3.2.3.1 Genetic interaction of the accessory genome

Genome separation between the Northern (latitude ≥ 0°) and Southern (latitude < 0°) Hemispheres was performed to investigate the impact of biogeography on the evolution and genetic diversity of Bradyrhizobium species dependent on nod factors. Although the exact origin of Bradyrhizobium is still uncertain, comparing genomes between hemispheres may provide insights into its diversification and dispersal.

In the Northern Hemisphere, 9 complete and 15 incomplete genomes were analyzed, with an average completeness of 99.69%. In the Southern Hemisphere, 2 complete and 18 incomplete genomes were identified, with an average of 99.51%. Although the proportion of incomplete genomes is higher in the Southern Hemisphere, the high average completeness in both groups indicates that the analyses of orthologous groups remain reliable and representative.

The geographic distribution of the genomes analyzed spans several continents in different hemispheres. In North Africa, 5 genomes were analyzed, while in South Africa, 4 genomes were analyzed. In North America, 7 genomes were distributed between Canada and the United States. In Central America, 2 genomes were analyzed, distributed in Honduras and Costa Rica.

In South America, a total of 12 genomes were analyzed, with the majority of records originating from Brazil and some from Peru and Colombia. In Australia, 5 genomes were analyzed, reinforcing the representativeness of Oceania. In Asia, the number of genomes identified was 7, covering different regions of China and Japan. Finally, 2 genomes were counted, with samples from Spain.

In the Northern Hemisphere, despite having only four more species than in the Southern Hemisphere did, the accessory genomes presented 1.8 times more orthologous groups, totaling 4,128 clusters between North America, North Africa, and Asia, suggesting high genetic interaction between these species. Notably, species from North Africa share 1,652 orthologous groups, indicating greater genetic interaction than species from South Africa (Figure 3).

Figure 3
www.frontiersin.org

Figure 3. Global distribution of orthologous clusters of the Bradyrhizobium accessory genome by geographic region and between hemispheres. The main graph highlights the hemisphere groupings and specific regions. The secondary graph highlights the African-specific accessory genome clusters, highlighting the specificity of accessory genomes between North Africa and South Africa.

Genetic interactions related to the accessory genome were investigated among species from regions between the Northern and Southern Hemispheres. We identified a robust genetic relationship between species from North America and Asia and between species from Spain, the Canary Islands (a Spanish archipelago in the Atlantic Ocean close to North Africa), and North Africa (Figure 4). In contrast, genetic interactions between species from Asia, the Canary Islands, and North America were more tenuous (Supplementary Figure S17). In the Southern Hemisphere, strong genetic interactions were detected between species from Brazil and South Africa. However, the interaction between Oceania and South America was moderate (Supplementary Figure S18).

Figure 4
www.frontiersin.org

Figure 4. Global distribution and genetic interaction among Bradyrhizobium species, according to the accessory genome. White boxes indicate the sharing of accessory genome gene clusters. Each species is identified by a number: 1. Bradyrhizobium elkanii; 2. B. frederickii; 3. B. ottawaense; 4. B. septentrionale; 5. B. quebecense; 6. B. baranii; 7. B. diazoefficiens; 8. B. jicamae; 9. B. pachyrhizi; 10. B. canariense; 11. B. rifense; 12. B. cytisi; 13. B. valentinum; 14. B. retamae; 15. B. algeriense; 16. B. hipponense; 17. B. nanningense; 18. B. zhengyangense; 19. B. guangzhouense; 20. B. zhanjiangense; 21. B. lablabi; 22. B. daqingense; 23. B. japonicum; 24. B. embrapense; 25. B. paxllaeri; 26. B. icense; 27. B. forestalis; 28. B. manauense; 29. B. tropiciagri; 30. B. centrolobii; 31. B. neotropical; 32. B. uaiense; 33. B. brasilense; 34. B. stylosanthis; 35. B. viridifuturi; 36. B. vignae; 37. B. altum; 38. B. acaciae; 39. B. australafricanum; 40. B. diversitatis; 41. B. archetypum; 42. B. australiense; 43. B. murdochi; 44. B. agreste.

Genetic interactions between the hemispheres that cross the equator were investigated. The B. jicamae clade revealed a genetic interaction pattern connecting North Africa, Oceania, and South America, which indicates a potential gene flow route in this geographical location. In the B. elkanii clade, it is possible to observe a gene flow path in North America, extending through Central and South America and reaching South Africa. The B. japonicum clade included a genetic pathway extending from Asia, North Africa, and South America. Notably, few genetic interactions were detected between species from North Africa and South Africa (Supplementary Figure S19).

3.2.3.2 Orthologous clusters

Comparative analysis of clusters of orthologous groups (COGs) between Bradyrhizobium species from the Northern and Southern Hemispheres revealed different distributions of gene clusters in the accessory genome. A predominance of clusters associated with functions such as replication, recombination and repair (L), transcription (K), amino acid metabolism and transport (E), and signal translation mechanisms (T) was detected in Northern Hemisphere species. In the Southern Hemisphere, transcription (K) and metabolism and transport of amino acids (E) are highlighted, as are groups involved in energy conversion and production (C) (Figure 5). It is important to highlight that the species from the Northern Hemisphere presented a high quantity of transposases within the functional group L.

Figure 5
www.frontiersin.org

Figure 5. Functional distribution of accessory genes in the pangenome according to COG categories: (A) Northern and Southern Hemisphere groups. (B) Specific subgroups of species isolated in North America, South America, Africa, Asia, and Oceania.

A detailed analysis of the clusters showed that species from North America and Oceania presented a reduced number of gene clusters. Notably, species from North America showed a high presence of replication, recombination and repair (L) clusters, suggesting an evolutionary strategy to maintain genomic plasticity. In contrast, species from South America, North Africa and Asia exhibited a greater richness of genes associated with transcription (K), amino acid metabolism and transport (E) and energy conversion and production (C) (Figure 5).

A detailed analysis of the insertion elements corroborated the COG category findings in replication, recombination and repair (L) (Supplementary Figure S20).

4 Discussion

In this study, specific representatives of each Bradyrhizobium species capable of forming nodules from different continental regions were selected, allowing us to investigate the influence of biogeography on the evolution and genetic diversity of these microorganisms. The genetic framework of these genomic representatives includes genes involved in adaptations to different water stresses, changes in pH and temperature, and interactions with their host plants (Freitas et al., 2022; Zhang et al., 2024). These characteristics impose selective pressures on these microorganisms, capturing genomic signatures that are direct results of the environmental pressure from which the organisms were isolated. Pangenome analysis revealed that species isolated in the Northern Hemisphere showed greater sharing of accessory genes compared to the Southern Hemisphere, indicating greater genetic interaction between species in this region. This difference may be associated with ecological factors such as environmental and host diversity, which may have favored greater gene exchange. The abundance of transposases and insertion elements in species from the Northern Hemisphere suggests greater genomic plasticity, which may have facilitated the adaptation of these species to different environments. Furthermore, by selecting genomes from different biogeographic environments, it was possible to understand the conservation of genes critical for local adaptation, evidencing the evolutionary pressure on the diversification of Bradyrhizobium.

Our pangenome study revealed a reduced core genome and a large accessory genome, evidencing broad genetic variability and the ability to adapt to various environments. Our results are in line with the studies by Weisberg et al. (2022) and Zhong et al. (2024) who showed that Bradyrhizobium has a small core genome and an extensive accessory genome, which characterizes an open pangenome. This genomic profile favors the acquisition of new genes via HGT, favoring adaptation to different environmental conditions. The genomic plasticity shown in these studies demonstrates the ability of Bradyrhizobium to colonize a wide variety of hosts.

Molecular evidence from fossils indicates that legumes emerged approximately 60 million years ago (Ma) between the Cretaceous and Cenozoic periods (Herendeen, 1992). At that time, Laurasia comprised a continuous and extensive landmass in the Northern Hemisphere. On the other hand, paleogeographic evidence indicates that during the fragmentation of Gondwana, South America remained connected to Antarctica. These facts disagree with the hypothesis of an African origin of legumes, favoring the boreotropic theory that suggests a Northern Hemisphere origin of legumes, with subsequent southward dispersal (see Figure 4 of Doyle and Luckow, 2003). This interpretation is supported by terrestrial connectivity between Africa and Eurasia during the Eocene and the subsequent union of the Americas (3-4 Ma), offering a more coherent overview of migration routes.

The analysis of the Bradyrhizobium accessory genome revealed a broad interaction of orthologous clusters between Asia and North America, suggesting intense genetic exchange in the Northern Hemisphere (Figure 4). Furthermore, our results revealed a putative gene flow route from B. elkanii from North America through Central America and South America that extends to southern Africa (Figure 4). According to the boreotropic hypothesis, legume dispersion occurs via continental masses, and these plants act as biotic vectors for rhizobia dispersal (Doyle and Luckow, 2003). Furthermore, despite the geological separation of continents before the emergence of legumes, it is possible to observe genetic interactions between Bradyrhizobium species from southern Africa, Oceania, and South America (Figure 4). According to Vinuesa and collaborators (Vinuesa et al., 2005), when studying the migration and speciation of B. canariense and B. japonicum, the most likely means for the transcontinental and hemispheric dispersal of Bradyrhizobium cells are by abiotic vectors, such as dust masses. Soil carrying Bradyrhizobium, which is transported by wind currents, has been reported in Brazil, and dust storms can reach large distances, favoring bacterial dispersal (Ferreira and Hungria, 2002). This hypothesis is reinforced by the results of our studies, which revealed genetic interactions over long distances, covering isolates from Asia, North Africa, and North America. In parallel, we observed short-range genetic interactions between Bradyrhizobium species from the B. jicamae and B. japonicum clades, which were specifically isolated from legumes of the Genistae tribe in the regions of North Africa, the Canary Islands, and Spain (See Interactive Map, Supplementary File 3).

We identified limited genetic interactions between species from North Africa and South Africa, possibly because they contain different biomes, raising significant questions about the dispersal routes of these species across landmasses over time and challenging the two hypotheses presented.

Schrire et al. (2005) carried out extensive work on the biogeography of legumes, revealing many differences between the hemispheres. In the Northern Hemisphere, the distribution of biomes is more homogeneous, favoring greater gene flow and facilitating dispersal and interbreeding between populations. In contrast, the Southern Hemisphere has a greater diversity of biomes, including tropical forests, savannas, and succulent and temperate biomes. This fragmentation of biomes promoted more specific selective pressures, reducing gene flow between populations. Furthermore, their work mentioned that the first legumes evolved in a semiarid region north of the Tethys Sea route, which separated the supercontinents Laurasia and Gondwana.

Our pangenome results for Bradyrhizobium indicate greater genetic sharing of the accessory genome in the Northern Hemisphere than in the Southern Hemisphere, corroborating the biogeographic patterns observed in legumes. These results indicate a possible origin of Bradyrhizobium in the Northern Hemisphere and close and adaptive coevolution with legumes over time (Figure 3, 4, Supplementary Figure S1). Bradyrhizobium has been suggested as the ancestor of all rhizobia (Provorov and Vorob’ev, 2000; Lloret and Martínez-Romero, 2005; Hungria et al., 2015); therefore, a Northern Hemisphere origin would also indicate that bacterial symbiosis originated in this region of the world.

The geological connection between Australia and Antarctica, as part of Gondwana, allowed the migration of plant species between Australia and South America. This connection is evidenced by the presence of the Brongniartieae tribe, a monophyletic clade present on both continents (Sprent et al., 2013). This relationship may clarify the genomic interactions between members of the B. jicamae clade inhabiting Australia and South America.

Some studies have reported that several legume species found in South America, previously considered archaic, are native to North America (Taylor, 1990; Lavin and Luckow, 1993), indicating that legume species in South America are evolutionarily more recent. These studies align with the phylogenetic analysis (Supplementary Figure S1), which revealed that most ancestral nodes are associated with the Northern Hemisphere, supporting the idea that Bradyrhizobium species likely originated in this region before dispersing to the Southern Hemisphere.

The genomes of Hyphomicrobiales (=Rhizobiales), including Bradyrhizobium, are marked by an abundance of insertion elements (Minamisawa et al., 1998; Barros-Carvalho et al., 2019; Bender et al., 2022). These elements are fundamental for the movement of DNA segments, contributing to genomic plasticity and diversity. Our analysis of the pangenome revealed the widespread presence of transposases and insertion elements in the Northern Hemisphere, suggesting progress in the evolutionary process and genomic adaptation compared with those in the Southern Hemisphere. Furthermore, components of the T4SS genetic framework were more abundant in the Northern Hemisphere. The identification of these genetic elements more concentrated in the Northern Hemisphere strains supports the hypothesis that they may have greater nodulation capacity through horizontal gene transfer, a key mechanism for the adaptation and evolution of these symbiotic organisms (Sprent, 1994).

The α-rhizobial T3SS genes and nod genes are coregulated, and their activation is triggered by plant and bacterial flavonoids. In the presence of flavonoids, bacterial NodD activates nod genes, and the regulator TtsL promotes the expression of T3E proteins (Dong and Song, 2020). Studies with nodule isolates belonging to the genus Bradyrhizobium have indicated that 90% of the T3SS genes are conserved (Teulet et al., 2020), confirming the identification rate of 89% of these genes in the species of this study. Species lacking the T3SS are specific to the B. japonicum clade. Furthermore, the presence of up to three noncanonical T3SS clusters in some species suggests a functional complexity that still needs to be understood (Tampakaki, 2014), and further studies are needed to elucidate these multiple T3SS clusters. Symbiovars are groups of strains that share similar symbiotic capabilities, supported by nodC and nifH phylogenies (Rogel et al., 2011). Here, species belonging to the same symbiovar generally exhibit the same T3SS and nod gene repertoire (Figure 1; Supplementary Table S1), supporting Teulet et al. (2020)’s suggestion of a common evolutionary origin for these genes.

The NopA and NopB proteins form needle-shaped extracellular filaments (Teulet et al., 2020), whereas NopX, NopH, and NopE act as translocons anchored inside the cell, facilitating the entry of effector proteins into host cells. Although NopE is recognized as a translocon (Wenzel et al., 2010), the exact function of NopH still requires further study, although this protein has been found in the secretome of B. diazoefficiens USDA110T (Staehelin and Krishnan, 2015). Teulet et al. (2022) proposed that the NopH and NopE proteins perform functions similar to those of NopX in T3E translocation. Our results indicate that the presence of NopX is related to the absence of NopH and NopE, suggesting that these two proteins may have compensatory functions in effector translocation, acting as alternatives when NopX is not present.

Bradyrhizobium species associated with Genisteae legumes include populations from the Canary Islands, North Africa and Spain, which exhibit different profiles regarding the type III secretion system (T3SS) and the TtsL regulator. We observed that species lacking both the T3SS and the TtsL regulator or those that simultaneously present the T3SS with the NopE and NopB proteins are associated with Genisteae legumes (see Interactive Map). In addition, these species, B. canariense, B. hipponense, B. rifense, and B. cytisi, are grouped in the symbiovar genistearum (Martinez-Romero et al., 2024). Pangenome studies with Bradyrhizobium confirmed that most isolates from Genisteae legume tribe lack the T3SS (Zhong et al., 2024). Interestingly, the species lacking the T3SS in this group possess the T4SS (see Interactive Map). A notable example is B. canariense, which lacks the T3SS and TtsL regulator but is able to nodulate several Genisteae and Loteae legumes (Vinuesa et al., 2005), suggesting an evolutionary advantage in these geographic regions. Studies involving the type IV (T4SS) and type VI (T6SS) secretion systems in Bradyrhizobium demonstrated the ability to induce nodulation in legumes, possibly compensating for the absence of the T3SS in the establishment of symbiosis (Banerjee et al., 2019; Tighilt et al., 2022; Wangthaisong et al., 2023).

The T3SS of Bradyrhizobium is encoded by a gene cluster known as Rhizobium-conserved (Rhc), whose evolution is characterized by several gains and losses, indicating specific adaptations (Teulet et al., 2020). The interaction between the NopA and NopX proteins suggests that a specific biological relationship was absent in the B. jicamae clade. These components are predominantly found in strains from the Southern Hemisphere, especially in Brazil, South Africa, and Colombia, whereas NopX is absent in strains from Canada, North Africa, and Oceania. The distribution patterns of the RhcIa and RhcIb clusters, which are associated with NopE/NopH and NopX/NopA, respectively, highlight the dynamic evolution of Bradyrhizobium (Teulet et al., 2020), suggesting that adaptations are influenced by symbiosis with host plants and are related to biogeography.

Variations in the composition of T3SS effector proteins may represent fundamental adaptations for bacteria to avoid plant immune responses or establish specific interactions and play a role in the transport and translocation of bacterial effector proteins into host plant cells (Staehelin and Krishnan, 2015; Miwa and Okazaki, 2017). The T3E proteins ErnA, NopAC, NopM, NopP, NopP2, and NopT are present in most of the genomes analyzed in our study. The NopM, NopP, and NopT proteins are found in other rhizobial genera (Kimbrel et al., 2013), and Teulet et al. (2020) proposed that these proteins were acquired horizontally by the ancestors of Bradyrhizobium lineages together with the T3SS cluster and then lost in some lineages.

In our study, NopAG and NopF proteins were predominantly found in Bradyrhizobium isolated from soybean (Glycine max), whereas the Bel2-5 and NopD proteins were exclusive to this plant. The NopAG, Bel2-5 and NopD proteins play important roles in inhibiting plant defenses, facilitating symbiosis, and influencing the processing of small plant ubiquitin proteins (Xiang et al., 2020).

The analysis of genomic relationships revealed that B. japonicum and B. daqigense, which were isolated in Asia, indicated that they have accessory genomes related to B. diazoefficiens, B. barrani, and B. ottawaense from North America. The presence of common T3E in all these genomes, with the exception of B. barranii, which lost the Bel2-5 and ErnA proteins, highlights the importance of these effectors in symbiotic interactions. These T3Es are essential to suppress plant defense responses and allow effective colonization (Teulet et al., 2022). The absence of these effectors in B. barranii suggests that this lineage may have adopted other strategies to survive and develop in the environment, possibly compensating for the loss with other symbiotic mechanisms or adjusting to different ecological niches.

5 Conclusion

Our study expands our understanding of the biogeography and evolution of symbiotic species of the genus Bradyrhizobium, providing evidence on their geographic origin and gene flow. Phylogeographic analysis of the core genome revealed that most ancestral lineages were associated with the Northern Hemisphere, indicating this region as the likely origin of these organisms.

In addition, pangenome data indicated a more intense gene flow between species from the Northern Hemisphere compared to the Southern Hemisphere. Our investigation also revealed a diversity of genes associated with symbiotic functions, including fix, nif, nod, and the T3SS and its effector proteins, demonstrating that these genes are correlated with the phylogenetic clades, hosts, and biogeographic conditions in which the species occur.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

LA: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. MK: Data curation, Methodology, Writing – original draft, Writing – review & editing. MN: Funding acquisition, Investigation, Project administration, Writing – review & editing. MH: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. Partially financed by INCT - Plant Growth Promoting Microorganisms for Agricultural Sustainability and Environmental Responsibility (CNPq 465133/2014-4, Fundação Araucária-STI 043/2019, CAPES).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1539151/full#supplementary-material

Supplementary File 1 | (Tables S1, S2, S3 and S4) | Description of the selected genomes, including the identification and selection of fix/nif and nod genes related to nitrogen fixation and symbiosis, genes associated with the type III secretion system (T3SS) and their effector proteins, and the type IV secretion system (T4SS).

Supplementary File 2 | (Figures S1, S2, S3, S4, S5, S6, S7, S8, S9, S10, S11, S12, S13, S14, S15, S16, S17, S18 and S19) | Heat map representing the distribution of fix/nif, nod, T3SS and their effector proteins and T4SS genes in comparison to the different hemispheres, clades (Bradyrhizboium jicamae, B. elkanii and B. japonicum) and their respective hosts. UpSet plots demonstrating the sharing of accessory genome gene clusters between species from the Northern and Southern Hemispheres. Analysis of the distribution of insertion elements in Bradyrhizobium analyzed by different geographic regions and comparisons between hemispheres.

Supplementary File 3 | Instruction guide for using the interactive map.

References

Abby, S. S., Néron, B., Ménager, H., Touchon, M., Rocha, E. P. C. (2014). MacSyFinder: A program to mine genomes for molecular systems with an application to CRISPR-Cas systems. PloS One 9, e110726. doi: 10.1371/journal.pone.0110726

PubMed Abstract | Crossref Full Text | Google Scholar

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., Lipman, D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403–410. doi: 10.1016/S0022-2836(05)80360-2

PubMed Abstract | Crossref Full Text | Google Scholar

Avontuur, J. R., Palmer, M., Beukes, C. W., Chan, W. Y., Coetzee, M. P. A., Blom, J., et al. (2019). Genome-informed Bradyrhizobium taxonomy: Where to from here? Systematic Appl. Microbiol. 42, 427–439. doi: 10.1016/j.syapm.2019.03.006

PubMed Abstract | Crossref Full Text | Google Scholar

Bach, E., Sant’Anna, F. H., Seger, G. D., dos, S., Passaglia, L. M. P. (2022). Pangenome inventory of Burkholderia sensu lato, Burkholderia sensu stricto, and the Burkholderia cepacia complex reveals the uniqueness of Burkholderia catarinensis. Genomics 114, 398–408. doi: 10.1016/j.ygeno.2021.11.011

PubMed Abstract | Crossref Full Text | Google Scholar

Banerjee, G., Basak, S., Roy, T., Chattopadhyay, P. (2019). Intrinsic role of bacterial secretion systems in phylogenetic niche conservation of Bradyrhizobium spp. FEMS Microbiol. Ecol. 95, fiz165. doi: 10.1093/femsec/fiz165

PubMed Abstract | Crossref Full Text | Google Scholar

Barros-Carvalho, G. A., Hungria, M., Lopes, F. M., Van Sluys, M.-A. (2019). Brazilian-adapted soybean Bradyrhizobium strains uncover IS elements with potential impact on biological nitrogen fixation. FEMS Microbiol. Lett. 366, fnz046. doi: 10.1093/femsle/fnz046

PubMed Abstract | Crossref Full Text | Google Scholar

Bender, F. R., Nagamatsu, S. T., Delamuta, J. R. M., Ribeiro, R. A., Nogueira, M. A., Hungria, M. (2022). Genetic variation in symbiotic islands of natural variant strains of soybean Bradyrhizobium japonicum and Bradyrhizobium diazoefficiens differing in competitiveness and in the efficiency of nitrogen fixation. Microbial Genomics 8, 795. doi: 10.1099/mgen.0.000795

PubMed Abstract | Crossref Full Text | Google Scholar

Bouznif, B., Guefrachi, I., Rodríguez de la Vega, R. C., Hungria, M., Mars, M., Alunni, B., et al. (2019). Phylogeography of the Bradyrhizobium spp. associated with peanut, Arachis hypogaea: Fellow travelers or new associations? Front. Microbiol. 10. doi: 10.3389/fmicb.2019.02041

PubMed Abstract | Crossref Full Text | Google Scholar

Brockhurst, M. A., Harrison, E., Hall, J. P. J., Richards, T., McNally, A., MacLean, C. (2019). he ecology and evolution of pangenomes. Curr. Biol. 29, R1094–R1103. doi: 10.1016/j.cub.2019.08.012

PubMed Abstract | Crossref Full Text | Google Scholar

Brynildsrud, O., Bohlin, J., Scheffer, L., Eldholm, V. (2016). Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary. Genome Biol. 17, 238. doi: 10.1186/s13059-016-1108-8

PubMed Abstract | Crossref Full Text | Google Scholar

Contreras-Moreira, B., Vinuesa, P. (2013). GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Appl. Environ. Microbiol. 79, 7696–7701. doi: 10.1128/AEM.02411-13

PubMed Abstract | Crossref Full Text | Google Scholar

Conway, J. R., Lex, A., Gehlenborg, N. (2017). UpSetR: An R package for the visualization of intersecting sets and their properties. Bioinformatics 33, 2938–2940. doi: 10.1093/bioinformatics/btx364

PubMed Abstract | Crossref Full Text | Google Scholar

Dong, W., Song, Y. (2020). The significance of flavonoids in the process of biological nitrogen fixation. Int. J. Mol. Sci. 21, 5926. doi: 10.3390/ijms21165926

PubMed Abstract | Crossref Full Text | Google Scholar

Doyle, J. J. (2011). Phylogenetic perspectives on the origins of nodulation. Mol. Plant-Microbe Interactions: MPMI 24, 1289–1295. doi: 10.1094/MPMI-05-11-0114

PubMed Abstract | Crossref Full Text | Google Scholar

Doyle, J. J., Luckow, M. A. (2003). The rest of the iceberg. Legume diversity and evolution in a phylogenetic context. Plant Physiol. 131, 900–910. doi: 10.1104/pp.102.018150

PubMed Abstract | Crossref Full Text | Google Scholar

Ferreira, M. C., Hungria, M. (2002). Recovery of soybean inoculant strains from uncropped soils in Brazil. Field Crops Res. 79, 139–152. doi: 10.1016/S0378-4290(02)00119-3

Crossref Full Text | Google Scholar

Ferrés, I., Iraola, G. (2021). Protocol for post-processing of bacterial pangenome data using Pagoo pipeline. STAR Protoc. 2, 100802. doi: 10.1016/j.xpro.2021.100802

PubMed Abstract | Crossref Full Text | Google Scholar

Freitas, V. F., Cerezini, P., Hungria, M., Nogueira, M. A. (2022). Strategies to deal with drought-stress in biological nitrogen fixation in soybean. Appl. Soil Ecol. 172, 104352. doi: 10.1016/j.apsoil.2021.104352

Crossref Full Text | Google Scholar

Giraud, E., Moulin, L., Vallenet, D., Barbe, V., Cytryn, E., Avarre, J.-C., et al. (2007). Legumes symbioses: Absence of nod genes in photosynthetic bradyrhizobia. Sci. (New York N.Y.) 316, 1307–1312. doi: 10.1126/science.1139548

PubMed Abstract | Crossref Full Text | Google Scholar

González, V., Santamaría, R. I., Bustos, P., Pérez-Carrascal, O. M., Vinuesa, P., Juárez, S., et al. (2019). Phylogenomic Rhizobium species are structured by a continuum of diversity and genomic clusters. Front. Microbiol. 10. doi: 10.3389/fmicb.2019.00910

PubMed Abstract | Crossref Full Text | Google Scholar

Gurevich, A., Saveliev, V., Vyahhi, N., Tesler, G. (2013). QUAST: Quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075. doi: 10.1093/bioinformatics/btt086

PubMed Abstract | Crossref Full Text | Google Scholar

Harris, C. D., Torrance, E. L., Raymann, K., Bobay, L.-M. (2021). CoreCruncher: Fast and robust construction of core genomes in large prokaryotic data sets. Mol. Biol. Evol. 38, 727–734. doi: 10.1093/molbev/msaa224

PubMed Abstract | Crossref Full Text | Google Scholar

Herendeen, P. S. (1992). “The fossil history of the Leguminosae: Phylogenetic and biogeographic implications,” in Advances in Legume Systematics, part 4. The Fossil Record (Kew: Royal Botanic Gardens), 303–316.

Google Scholar

Huerta-Cepas, J., Forslund, K., Coelho, L. P., Szklarczyk, D., Jensen, L. J., von Mering, C., et al. (2017). Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Biol. Evol. 34, 2115–2122. doi: 10.1093/molbev/msx148

PubMed Abstract | Crossref Full Text | Google Scholar

Hungria, M., Menna, P., Delamuta, J. R. M. (2015). “Bradyrhizobium, the ancestor of all rhizobia: Phylogeny of housekeeping and nitrogen-fixation genes,” in Biological Nitrogen Fixation (New Jersey: John Wiley & Sons, Ltd, Inc.), 191–202. doi: 10.1002/9781119053095.ch18

Crossref Full Text | Google Scholar

Hungria, M., Nogueira, M. A. (2023). “Nitrogen fixation,” in Marschner’s Mineral Nutrition of Plants, 4th ed. Eds. Rengel, Z., Cakmak, I., White, P. J., P., J. (Academic Press, Elsevier, London), 615–650.

Google Scholar

Katoh, K., Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. doi: 10.1093/molbev/mst010

PubMed Abstract | Crossref Full Text | Google Scholar

Kimbrel, J. A., Thomas, W. J., Jiang, Y., Creason, A. L., Thireault, C. A., Sachs, J. L., et al. (2013). Mutualistic co-evolution of Type III effector genes in Sinorhizobium fredii and Bradyrhizobium japonicum. PloS Pathog. 9. doi: 10.1371/journal.ppat.1003204

PubMed Abstract | Crossref Full Text | Google Scholar

Lavin, M., Luckow, M. (1993). Origins and relationships of tropical north america in the context of the boreotropics hypothesis. Am. J. Bot. 80, 1–14. doi: 10.1002/j.1537-2197.1993.tb13761.x

Crossref Full Text | Google Scholar

Lloret, L., Martínez-Romero, E. (2005). Evolution and phylogeny of rhizobia. Rev. Latinoamericana Microbiologia 47, 43–60.

Google Scholar

Martinez-Romero, E., Peix, A., Hungria, M., Mousavi, S. A., Martinez-Romero, J., Young, P. (2024). Guidelines for the description of rhizobial symbiovars. Int. J. Systematic Evolutionary Microbiol. 74, 6373. doi: 10.1099/ijsem.0.006373

PubMed Abstract | Crossref Full Text | Google Scholar

McInerney, J. O., McNally, A., O’Connell, M. J. (2017). Why prokaryotes have pangenomes. Nat. Microbiol. 2. doi: 10.1038/nmicrobiol.2017.40

PubMed Abstract | Crossref Full Text | Google Scholar

Menna, P., Hungria, M. (2011). Phylogeny of nodulation and nitrogen-fixation genes in Bradyrhizobium: Supporting evidence for the theory of monophyletic origin, and spread and maintenance by both horizontal and vertical transfer. Int. J. Systematic Evolutionary Microbiol. 61, 3052–3067. doi: 10.1099/ijs.0.028803-0

PubMed Abstract | Crossref Full Text | Google Scholar

Minamisawa, K., Isawa, T., Nakatsuka, Y., Ichikawa, N. (1998). New Bradyrhizobium japonicum strains that possess high copy numbers of the repeated sequence RS alpha. Appl. Environ. Microbiol. 64, 1845–1851. doi: 10.1128/AEM.64.5.1845-1851.1998

PubMed Abstract | Crossref Full Text | Google Scholar

Miwa, H., Okazaki, S. (2017). How effectors promote beneficial interactions. Curr. Opin. Plant Biol. 38, 148–154. doi: 10.1016/j.pbi.2017.05.011

PubMed Abstract | Crossref Full Text | Google Scholar

Mosquera-Rendón, J., Rada-Bravo, A. M., Cárdenas-Brito, S., Corredor, M., Restrepo-Pineda, E., Benítez-Páez, A. (2016). Pangenome-wide and molecular evolution analyses of the Pseudomonas aeruginosa species. BMC Genomics 17, 45. doi: 10.1186/s12864-016-2364-4

PubMed Abstract | Crossref Full Text | Google Scholar

Ormeño-Orrillo, E., Martínez-Romero, E. (2019). A Genomotaxonomy view of the bradyrhizobium genus. Front. Microbiol. 10. doi: 10.3389/fmicb.2019.01334

PubMed Abstract | Crossref Full Text | Google Scholar

Page, A. J., Cummins, C. A., Hunt, M., Wong, V. K., Reuter, S., Holden, M. T. G., et al. (2015). Roary: Rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693. doi: 10.1093/bioinformatics/btv421

PubMed Abstract | Crossref Full Text | Google Scholar

Parker, M. A. (2012). Legumes select symbiosis island sequence variants in Bradyrhizobium. Mol. Ecol. 21, 1769–1778. doi: 10.1111/j.1365-294X.2012.05497.x

PubMed Abstract | Crossref Full Text | Google Scholar

Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P., Tyson, G. W. (2015). CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055. doi: 10.1101/gr.186072.114

PubMed Abstract | Crossref Full Text | Google Scholar

Provorov, N. A., Vorob’ev, N. I. (2000). Evolutionary genetics of nodule bacteria: Molecular and population aspects. Russian J. Genet. 36, 1323–1335. doi: 10.1023/A:1009029703470

Crossref Full Text | Google Scholar

Rasko, D. A., Rosovitz, M. J., Myers, G. S. A., Mongodin, E. F., Fricke, W. F., Gajer, P., et al. (2008). The pangenome structure of Escherichia coli: Comparative genomic analysis of E. coli commensal and pathogenic isolates. J. Bacteriology 190, 6881–6893. doi: 10.1128/JB.00619-08

PubMed Abstract | Crossref Full Text | Google Scholar

Rogel, M. A., Ormeño-Orrillo, E., Martinez-Romero, E. (2011). Symbiovars in rhizobia reflect bacterial adaptation to legumes. Systematic Appl. Microbiol. 34, 96–104. doi: 10.1016/j.syapm.2010.11.015

PubMed Abstract | Crossref Full Text | Google Scholar

Rosselli, R., La Porta, N., Muresu, R., Stevanato, P., Concheri, G., Squartini, A. (2021). Pangenomics of the symbiotic Rhizobiales. Core and accessory functions across a group endowed with high levels of genomic plasticity. Microorganisms 9. doi: 10.3390/microorganisms9020407

PubMed Abstract | Crossref Full Text | Google Scholar

Schrire, B., Lewis, G., Mackinder, B., Lock, M. (Eds.) (2005). Legumes of the world (Kew: Royal Botanic Gardens), 21–54.

Google Scholar

Seemann, T. (2014). Prokka: Rapid prokaryotic genome annotation. Bioinf. (Oxford England) 30, 2068–2069. doi: 10.1093/bioinformatics/btu153

PubMed Abstract | Crossref Full Text | Google Scholar

Shen, W., Le, S., Li, Y., Hu, F. (2016). SeqKit: A Cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PloS One 11, e0163962. doi: 10.1371/journal.pone.0163962

PubMed Abstract | Crossref Full Text | Google Scholar

Sprent, J. I. (1994). Evolution and diversity in the legume-rhizobium symbiosis: Chaos theory? Plant Soil 161, 1–10. doi: 10.1007/BF02183080

Crossref Full Text | Google Scholar

Sprent, J. I., Ardley, J. K., James, E. K. (2013). From North to South: A latitudinal look at legume nodulation processes. South Afr. J. Bot. 89, 31–41. doi: 10.1016/j.sajb.2013.06.011

Crossref Full Text | Google Scholar

Sprent, J. I., Ardley, J., James, E. K. (2017). Biogeography of nodulated legumes and their nitrogen-fixing symbionts. New Phytol. 215, 40–56. doi: 10.1111/nph.14474

PubMed Abstract | Crossref Full Text | Google Scholar

Stępkowski, T., Hughes, C. E., Law, I. J., Markiewicz, Ł., Gurda, D., Chlebicka, A., et al. (2007). Diversification of Lupine Bradyrhizobium Strains: Evidence from nodulation gene trees. Appl. Environ. Microbiol. 73, 3254–3264. doi: 10.1128/AEM.02125-06

PubMed Abstract | Crossref Full Text | Google Scholar

Staehelin, C., Krishnan, H. B. (2015). Nodulation outer proteins: Double-edged swords of symbiotic rhizobia. Biochem. J. 470, 263–274. doi: 10.1042/BJ20150518

PubMed Abstract | Crossref Full Text | Google Scholar

Tampakaki, A. (2014). Commonalities and differences of T3SSs in rhizobia and plant pathogenic bacteria. Front. Plant Sci. 5. doi: 10.3389/fpls.2014.00114

PubMed Abstract | Crossref Full Text | Google Scholar

Tamura, K., Stecher, G., Kumar, S. (2021). MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 38, 3022–3027. doi: 10.1093/molbev/msab120

PubMed Abstract | Crossref Full Text | Google Scholar

Taylor, D. W. (1990). Paleobiogeographic relationships of angiosperms from the Cretaceous and early Tertiary of the North American area. Botanical Rev. 56, 279–417. doi: 10.1007/BF02995927

Crossref Full Text | Google Scholar

Tettelin, H., Riley, D., Cattuto, C., Medini, D. (2008). Comparative genomics: The bacterial pan-genome. Curr. Opin. Microbiol. 11, 472–477. doi: 10.1016/j.mib.2008.09.006

PubMed Abstract | Crossref Full Text | Google Scholar

Teulet, A., Camuel, A., Perret, X., Giraud, E. (2022). The versatile roles of Type III Secretion Systems in Rhizobium-legume symbioses. Annu. Rev. Microbiol. 76, 45–65. doi: 10.1146/annurev-micro-041020-032624

PubMed Abstract | Crossref Full Text | Google Scholar

Teulet, A., Gully, D., Rouy, Z., Camuel, A., Koebnik, R., Giraud, E., et al. (2020). Phylogenetic distribution and evolutionary dynamics of nod and T3SS genes in the genus Bradyrhizobium. Microbial Genomics 6, mgen000407. doi: 10.1099/mgen.0.000407

PubMed Abstract | Crossref Full Text | Google Scholar

Tighilt, L., Boulila, F., De Sousa, B. F. S., Giraud, E., Ruiz-Argüeso, T., Palacios, J. M., et al. (2022). The bradyrhizobium sp. LmicA16 type VI secretion system is required for efficient nodulation of lupinus spp. Microbial Ecol. 84, 844–855. doi: 10.1007/s00248-021-01892-8

PubMed Abstract | Crossref Full Text | Google Scholar

Vinuesa, P., León-Barrios, M., Silva, C., Willems, A., Jarabo-Lorenzo, A., Pérez-Galdona, R., et al. (2005). Bradyrhizobium canariense sp. nov., an acid-tolerant endosymbiont that nodulates endemic genistoid legumes (Papilionoideae: Genisteae) from the Canary Islands, along with Bradyrhizobium japonicum bv. genistearum, Bradyrhizobium genospecies alpha and Bradyrhizobium genospecies beta. Int. J. Systematic Evolutionary Microbiol. 55, 569–575. doi: 10.1099/ijs.0.63292-0

PubMed Abstract | Crossref Full Text | Google Scholar

Wangthaisong, P., Piromyou, P., Songwattana, P., Wongdee, J., Teamtaisong, K., Tittabutr, P., et al. (2023). The type IV secretion system (T4SS) mediates symbiosis between bradyrhizobium sp. SUTN9-2 and legumes. Appl. Environ. Microbiol. 89, e00040–e00023. doi: 10.1128/aem.00040-23

PubMed Abstract | Crossref Full Text | Google Scholar

Weisberg, A. J., Rahman, A., Backus, D., Tyavanagimatt, P., Chang, J. H., Sachs, J. L. (2022). Pangenome evolution reconciles robustness and instability of rhizobial symbiosis. mBio 13, e00074–e00022. doi: 10.1128/mbio.00074-22

PubMed Abstract | Crossref Full Text | Google Scholar

Wenzel, M., Friedrich, L., Göttfert, M., Zehner, S. (2010). The type III-secreted protein NopE1 affects symbiosis and exhibits a calcium-dependent autocleavage activity. Mol. Plant-Microbe Interactions: MPMI 23, 124–129. doi: 10.1094/MPMI-23-1-0124

PubMed Abstract | Crossref Full Text | Google Scholar

Werner, G. D. A., Cornwell, W. K., Sprent, J. I., Kattge, J., Kiers, E. T. (2014). A single evolutionary innovation drives the deep evolution of symbiotic N2-fixation in angiosperms. Nat. Commun. 5, 4087. doi: 10.1038/ncomms5087

PubMed Abstract | Crossref Full Text | Google Scholar

Xiang, Q.-W., Bai, J., Cai, J., Huang, Q.-Y., Wang, Y., Liang, Y., et al. (2020). NopD of Bradyrhizobium sp. XS1150 possesses SUMO protease activity. Front. Microbiol. 11. doi: 10.3389/fmicb.2020.00386

PubMed Abstract | Crossref Full Text | Google Scholar

Xie, Z., Tang, H. (2017). ISEScan: Automated identification of insertion sequence elements in prokaryotic genomes. Bioinformatics 33, 3340–3347. doi: 10.1093/bioinformatics/btx433

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Y., Ku, Y.-S., Cheung, T.-Y., Cheng, S.-S., Xin, D., Gombeau, K., et al. (2024). Challenges to rhizobial adaptability in a changing climate: Genetic engineering solutions for stress tolerance. Microbiological Res. 288, 127886. doi: 10.1016/j.micres.2024.127886

PubMed Abstract | Crossref Full Text | Google Scholar

Zhong, C., Hu, G., Hu, C., Xu, C., Zhang, Z., Ning, K. (2024). Comparative genomics analysis reveals genetic characteristics and nitrogen fixation profile of Bradyrhizobium. iScience 27, 108948. doi: 10.1016/j.isci.2024.108948

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Bradyrhizobium, pangenome, biogeography, Nod factors, Fix/Nif, type III secretion system, nodulation, environmental adaptation

Citation: Terra LA, Klepa MS, Nogueira MA and Hungria M (2025) Pangenome analysis indicates evolutionary origins and genetic diversity: emphasis on the role of nodulation in symbiotic Bradyrhizobium. Front. Plant Sci. 16:1539151. doi: 10.3389/fpls.2025.1539151

Received: 03 December 2024; Accepted: 10 March 2025;
Published: 02 April 2025.

Edited by:

Krzysztof Szczyglowski, Agriculture and Agri-Food Canada (AAFC), Canada

Reviewed by:

Louis S. Tisa, University of New Hampshire, United States
Michael Fruci, Agriculture and Agri-Food Canada (AAFC), Canada
Eden Bromfield, Agriculture and Agri-Food Canada (AAFC), Canada

Copyright © 2025 Terra, Klepa, Nogueira and Hungria. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mariangela Hungria, bWFyaWFuZ2VsYS5odW5ncmlhQGVtYnJhcGEuYnI=; YmlvdGVjbm9sb2dpYS5zb2xvQGhvdG1haWwuY29t

ORCID: Leonardo Araujo Terra, orcid.org/0000-0003-4100-8079
Milena Serenato Klepa, orcid.org/0000-0003-4456-4345
Marco Antonio Nogueira, orcid.org/0000-0001-7747-9839
Mariangela Hungria, orcid.org/0000-0002-5132-8685

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more