Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 14 November 2024
Sec. Food Microbiology
This article is part of the Research Topic Mechanisms of Fermented Foods and Interactions with the Gut Microbiome View all articles

Unveiling the whole genomic features and potential probiotic characteristics of novel Lactiplantibacillus plantarum HMX2

\r\nTariq Aziz,,,Tariq Aziz1,2,3,4Muhammad NaveedMuhammad Naveed5Muhammad Aqib ShabbirMuhammad Aqib Shabbir6Abid SarwarAbid Sarwar3Jasra NaseebJasra Naseeb3Liqing Zhao
Liqing Zhao1*Zhennai Yang
Zhennai Yang3*Haiying CuiHaiying Cui4Lin Lin
Lin Lin4*Thamer H. AlbekairiThamer H. Albekairi7
  • 1Department of Food Science and Technology, College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen, Guangdong, China
  • 2School of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong, China
  • 3Key Laboratory of Geriatric Nutrition and Health of Ministry of Education, Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology and Business University, Beijing, China
  • 4School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
  • 5Department of Biotechnology, Faculty of Science and Technology, University of Central Punjab, Lahore, Pakistan
  • 6Department of Biotechnology, Faculty of Biological Sciences, Lahore University of Biological and Applied Sciences, Lahore, Pakistan
  • 7Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia

This study investigates the genomic features and probiotic potential of Lactiplantibacillus plantarum HMX2, isolated from Chinese Sauerkraut, using whole-genome sequencing (WGS) and bioinformatics for the first time. This study also aims to find genetic diversity, antibiotic resistance genes, and functional capabilities to help us better understand its food safety applications and potential as a probiotic. L. plantarum HMX2 was cultured, and DNA was extracted for WGS. Genomic analysis comprised average nucleotide identity (ANI) prediction, genome annotation, pangenome, and synteny analysis. Bioinformatics techniques were used to identify CoDing Sequences (CDSs), transfer RNA (tRNA) and ribosomal RNA (rRNA) genes, and antibiotic resistance genes, as well as to conduct phylogenetic analysis to establish genetic diversity and evolution. The study found a significant genetic similarity (99.17% ANI) between L. plantarum HMX2 and the reference strain. Genome annotation revealed 3,242 coding sequences, 65 tRNA genes, and 16 rRNA genes. Significant genetic variety was found, including 25 antibiotic resistance genes. A phylogenetic study placed L. plantarum HMX2 among closely related bacteria, emphasizing its potential for probiotic and food safety applications. The genomic investigation of L. plantarum showed essential genes, including plnJK and plnEF, which contribute to antibacterial action against foodborne pathogens. Furthermore, genes such as MurA, Alr, and MprF improve food safety and probiotic potential by promoting bacterial survival under stress conditions in food and the gastrointestinal tract. This study introduces the new genomic features of L. plantarum HMX2 about specific genetics and its possibility of relevant uses in food security and technologies. These findings of specific genes involved in antimicrobial activity provide fresh possibilities for exploiting this strain in forming probiotic preparations and food preservation methods. The future research should focus on the experimental validation of antibiotic resistance genes, comparative genomics to investigate functional diversity, and the development of novel antimicrobial therapies that take advantage of L. plantarum's capabilities.

1 Introduction

The development of molecular techniques such as whole-genome sequencing (WGS) has transformed bacterial strain typing, significantly impacting epidemiological monitoring and outbreak analysis. Genomics and whole-genome sequencing (WGS) can significantly improve our understanding of infectious diseases and clinical microbiology (Avershina et al., 2023). The development of benchtop WGS analyzers has made genomics more accessible to clinicaland public health experts in microbiology. Despite the restrictions of resources and infrastructure, WGS is especially beneficial in public health laboratories, reference labs, and hospital infection control labs (Aziz et al., 2023a; Beltrán-Velasco et al., 2024). Lactiplantibacillus plantarum (previously known as L. plantarum) is a Gram-positive bacterium that lives in a variety of environments, including fermented dairy products, sourdough, fruits, vegetables, cereals, meat, fish, and the mammalian gastrointestinal tract. It is commonly employed as a starting culture in a variety of fermented foods, improving its flavor, texture, and sensory qualities (Beltrán-Velasco et al., 2024). The most frequently investigated strain is L. plantarum 299v, which has appeared in over 170 scholarly papers, including more than 60 human clinical investigations. L. plantarum's genome is 3.3 Mb, larger than the other LAB species' typical 2–2.7-Mb genome, indicating high genetic diversity. Studies on six strains of L. plantarum revealed significant differences in prophages, transposases, IS elements, and plantaricin biosynthesis genes, along with notable variations in capsular and extracellular polysaccharide biosynthesis genes (Umanets et al., 2023).

This study aims to investigate the L. plantarum genome, as well as the identification of bacterial strain, sequence similarity comparison, gene annotation, and, finally, to perform a comparative analysis. The specific tools employed to analyze L. plantarum strains include ANI calculator, MetaGeneMark, Rapid Annotations using Subsystems Technology (RAST) toolkit, integrated prokaryotic genome analysis (IPGA) platform, OrthoVenn3 server, the Pathosystems Resource Integration Center (PATRIC), and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. This study not only includes the comparison of nucleotide sequences with the other strain's orthologous genes but also includes the identification of the specialty genes and protein–protein interactions (PPIs), so this work provides a comprehensive understanding of the genomic complexness and evolutionary relationships within this bacterial species. All the information from this research proves useful in further studies related to microbial genetics, biotechnology, and pathogens functioning and diversification (Syaputri et al., 2023). Additionally, the project seeks to investigate genes associated with defense and survival to offer a thorough understanding of the strain's genetic composition and adaptive strategies. This research will enhance our knowledge of L. plantarum's functional capabilities and potential applications in various industries. Investigating its genetic diversity may reveal insights that could derive novel biotechnological innovations (Rajput et al., 2023).

The study enhances understanding of L. plantarum's genetic diversity and potential applications (Liu et al., 2023). It intended to discover and characterize L. plantarum, emphasizing its high genomic similarity, significant genetic diversity, and presence of critical defense and survival genes. The comprehensive genomic research broadens our understanding of L. plantarum's functional capabilities and prospective uses in food safety. The future studies will focus on the strain's abilities for adaptation across multiple environments, its potential in probiotic formulations, and its appropriateness for generating novel antimicrobial drugs.

2 Materials and methods

2.1 Bacterial strain identification and culturing

L. plantarum can be of great commercial importance owing to its possible uses in food and pharmaceutical fields. It has reported positive effects on gut health and immune systems and is used in biopreservation due to its capability for the natural preservation of fermentation products and for creating healthy functional foods (Hu et al., 2023). The target strain, L. plantarum, HMX2 was injected in deMan Rogosa Sharp (MRS) medium and grown for 20 h. The bacterial DNA extraction was carried out according to the “Bacterial Genomic DNA Extraction Kit (Beijing Trangen)” instructions by Beijing Tiangen. In a microcentrifuge tube, 0.5–4.0 ml of cells (maximum 2 × 109 cells) were harvested by centrifugation for 1 min at maximum speed, with the supernatant discarded as much as possible. The pellet was resuspended in 100 μl of erythrocyte lysis (EL) Buffer and stirred thoroughly using a tip. The mixture was then incubated at 37°C for 40 min, with certain bacteria requiring an additional hour or more (Sadanov et al., 2023).

2.2 Whole-genome sequencing

The genomic DNA of L. plantarum HMX2 was sequenced by WGS with the HiSeq system of Illumina, and its coverage reached 12.0 times. The genomic DNA sequence assembly was done using the ABySS v.12.1 making it a circular genome with a size of 3,322,298 base pairs. Identification of genes and non-coding RNAs was done using the National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline (NCBI PGAP) for prokaryotic genomes; the resulting annotation included 3,172 genes, such as CoDing Sequences (CDSs), ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), and ncRNAs. As a result of this high-quality assembly and annotation, aspects of the genomic organization of the strain are described comprehensively. The genome sequence was then submitted to NCBI under the allocated Accession No. GCF_025144505.1.

2.3 Average nucleotide identity prediction

To measure the nucleotide level genomic similarity of the genomes, the average nucleotide identity (ANI) prediction was performed by ANI Calculator available at EZCloud Server (https://www.ezbiocloud.net/tools/ani) (Yoon et al., 2017). The genome of L. plantarum Z.6-1 was utilized as a reference for ANI calculation. The reference genome was retrieved from National Centre for Biotechnology Information (NCBI) by specifically allocated accession number GCA_023973045.1. Both genomes were uploaded in FASTA QC format at ANI calculator and the analysis was performed. The results were analyzed for further interpretations (de Albuquerque and Haag, 2023).

2.4 Genome annotation and pan-genome analysis

The annotation of the genome was performed by MetaGenMark (https://genemark.bme.gatech.edu/meta_gmhmmp.cgi) (Jahanshahi et al., 2023). For this integrated genome analysis, the genomic information of L. plantarum HMX2 was retrieved from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) database (Olson et al., 2023). In this case, the genome of L. plantarum HMX2 was first identified, and then the FASTA files were downloaded. These quality control checks were done using the tools available in the BV-BRC (https://www.bv-brc.org) to see that all the sequences were accurate and complete. In the present investigations, genome annotations were performed employing the integrated RAST toolkit, which delineates comprehensive information regarding the gene functions and metabolic pathways. Numerous tools available at BV-BRC were utilized to compare multiple strains and detect some conserved and strain-specific genomic aspects. To arrive at valid conclusions about the kind of genome and the phylogenetic relatedness of the observed organisms, the results were assimilated and expounded in light of prior biological insights (Horsfield et al., 2023).

Pangenome and gene synteny were compared with the IPGA platform v1 (Liu et al., 2022). The quality of the genome assemblies was first assessed before gene clustering to determine the orthologous group. IPGA v1.09 integrated prokaryotes genome and pan-genome analysis service revealed differences in the gene content and its functional categorization at the scale of pangenome. To investigate the degree of gene order conservation, which sheds light on the evolutionary relationship, synteny analysis was done. The visualization of the genomic and synteny features was performed by IPGA visualization tools (Wang et al., 2023).

2.5 Orthologous analysis

The five L. plantarum strains (BDGP2, DF, HMX2, SRCM100442, and UNQLp11) were compared with Homology comparison of Orthologous Groups of proteins using the OrthoVenn3 server (https://orthovenn3.bioinfotoolkits.net) (Sun et al., 2023). First, the gene call for each strain was extracted from the respective genome assembly to obtain protein sequences. These sequences were then uploaded to the OrthoVenn3 server. The parameters for the analysis were kept to their default settings, which for identification of orthologs are an E-value of 1e-2 and an inflation value of 1.5 per clustering. The parameter for the k-means clustering was set to 5 as per the input of the clustering algorithm. In OrthoVenn3, all the protein sequences are compared to one another directly employing DIAMOND that subsequently groups the homologous clusters (Wang et al., 2015).

The Venn diagram was obtained to illustrate the degree of conservation and differences in the orthologous clusters among the five strains. The figure illustrates the number of clusters as well as the particular count of proteins belonging to every strain of the bacteria, and the overlapping of the clusters. This encapsulation, which depicts the genome, increasingly highlighted the fundamental elements of the virus along with the strain particular features. From here, the clusters were subdivided to deduce functional concerns of the conserved genes and those unique to L. plantarum, adding a clearer picture of functional conservation and diversification in L. plantarum. A comprehensive comparative genomics analysis of L. plantarum strains was conducted using OrthoVenn3, incorporating both a broad and narrow selection of strains.

2.6 Prediction of specialty genes

The sequencing data analysis for identifying specialty genes in L. plantarum HMX2 was performed on PATRIC () (Snyder et al., 2007). First, the WGS of L. plantarum HMX2 was uploaded to Pathosystems Resource Integration Center (PATRIC), and the sequences were automatically annotated and analyzed in Comprehensive Genome Analysis. The relevant bioinformatics tools available in the platform's databases were used to analyze and identify the new genes belonging to the categories of interest, namely, antibiotic resistance, transporters, or functional capabilities (Vidulin et al., 2016).

2.7 Protein interaction network

To map the selected proteins from L. plantarum HMX2 on the PPI network, the STRING database version 11. 5 (https://string-db.org) was used to determine the possible interaction (von Mering et al., 2003). The selected proteins for the PPI study were glpF6, fusA2, glpF1, rho, orf2, yidC1, and tuf. Each protein was submitted to enter STRING, which combines manually assembled and computationally predicted PPIs from various databases, interactions derived from experimental evidence, gene organizations, gene fusion, phylogenetic tree, co-occurrence, text references, gene expression profiles, and sequence similarity (Szklarczyk et al., 2015). To obtain the PPI data analysis and gene occurrence, the protein identifiers were entered into the STRING interface with an interaction score set to a medium confidence level of 0.400 to capture both high and moderate PPIs. The obtained network was then enlarged to emphasize the mutual connectivity of the query proteins and their first coordination sphere. The nodes are the individual proteins, whereas the edges are the anticipated connectivity between the proteins. The interactions were classified according to the evidence types such as, gene expression, experimental taxonomy, direct database, annotations, infrastructural databases, and integrated computational prediction. The visualization differentiated between proteins that had known or predicted structures and those with undetermined structures (filled nodes and empty nodes, respectively) (Al-Aamri et al., 2019; Aziz et al., 2023b).

3 Results

3.1 Average nucleotide identity prediction

The ANI Prediction was conducted using the ANI Calculator available at the EZ Cloud Server (https://www.ezbiocloud.net/tools/ani) to measure the nucleotide-level genomic similarity between two genomes. The genome of L. plantarum Z.6-1, retrieved from NCBI with accession number GCA_023973045.1, was used as the reference. Both genomes were uploaded in FASTA QC format for analysis. Genome sequence A (L. plantarum HMX2) had a total length of 3,322,298 bp with a GC content of 44.51%, while Genome sequence B (L. plantarum Z.6-1) had a total length of 3,333,079 bp with a GC content of 44.42%. The OrthoANIu analysis revealed an OrthoANIu value of 99.17%, with Genome A and B lengths of 3,322,140 and 3,332,340 bp, respectively. The average aligned length was 2,204,278 bp, with coverage of 66.35% for Genome A and 66.15% for Genome B. These results indicate a high level of genomic similarity between the analyzed genomes. The detailed results of the ANI calculation are given in Table 1.

Table 1
www.frontiersin.org

Table 1. The average nucleotide identity (ANI) of L. plantarum HMX2 genome.

The findings from the complete genome analysis offered by the BV-BRC tool offered overall information regarding the genome's general features and alignment statistics; these results were obtained by including the genomes provided above. The analysis comprised 11 genomes; the number of genes in all the genomes varied between 1,126 and 2,659, single-copy genes varied between 532 and 683 with five filtered single copy-genes in all the genomes.

3.2 Genome annotation and genome assembly

PATRIC was used to analyze all the facets of L. plantarum HMX2 complete genome. The submitted assembled genome provided a single contig with a combination of base pairs 3,322,298 and ~44% G + C 51%. No plasmids or chromosomes were detected. The genome annotations were done using the RAST toolkit known as RASTtk. Table 2 represents the assembly details for the genome of L. plantarum HMX2.

Table 2
www.frontiersin.org

Table 2. The assembly details of the Lactiplantibacillus plantarum HMX2 genome obtained from RASTtk.

3.3 Genome annotation

Annotating the genome exposed 3,242 CDS, 65 tRNA genes, and 16 rRNA genes. Out of the 2,242 protein-coding genes, only 1,424 genes were hypothetical, and 1,818 genes had some sort of function annotation; 683 Enzyme Commission (EC) numbers, 577 Gene Ontology (GO) terms, and 484 genes had Kyoto Encyclopedia of Genes and Genomes (KEGG) Orthology mapping. No PLFams were found, but 3,143 of the proteins fell into cross-genus protein families or PGFams. Table 3 represents the annotated genome features of L. plantarum HMX2 genome, and the Table 4 represents the protein features of the genome.

Table 3
www.frontiersin.org

Table 3. Annotated genome features of Lactiplantibacillus plantarum HMX2 genome.

Table 4
www.frontiersin.org

Table 4. Protein features of Lactiplantibacillus plantarum HMX2 genome.

Figure 1 shows a circle chart showing the various genome annotations, such as CDS on the forward and reverse strands in red; RNA genes in violet; ARGs shown in green; VF shown in yellow. The guanine-cytosine (GC) content map is shown in blue and the GC skew map is shown in magenta. Metabolism comprised 63 genes; protein processing 37; stress response, defense, and virulence 22; DNA processing 18, energy 15 and RNA processing 12; cellular processes 9; membrane transport 6; cell envelope 4; regulation and cell signaling 3; and the remaining gene fell under miscellaneous category.

Figure 1
www.frontiersin.org

Figure 1. Circular genome annotation of Lactiplantibacillus plantarum HMX2 genome.

3.4 Subsystem analysis

In this systematic analysis of specialty genes, there were 25 genes that were detected to have a relation with antibiotic resistance, one drug target, and 14 transporters were detected (Table 5; Figure 2). Some of the categories of antimicrobial resistance (AMR) genes were the target modification genes that encode efflux pumps and the carbapenemase encoding genes. Therefore, other genes associated with crucial processes, such as metabolism, membrane transport, and protein processing, were also detected.

Table 5
www.frontiersin.org

Table 5. The presence of specialty genes in Lactiplantibacillus plantarum HMX2 genome.

Figure 2
www.frontiersin.org

Figure 2. An overview of the subsystems in Lactiplantibacillus plantarum HMX2 genome.

3.5 Identification of antibiotic resistance genes

AMR gene identification in Genome Annotation Service in PATRIC was performed using k-mer based approach and functional descriptions and classifications were made. In this technique, AMR gene variants in PATRIC's library were classified into mechanisms of resistance, drug classes, and associated antibiotics (Supplementary Table S1). For instance, genes Alr, Ddl, and folA—direct the antibiotic susceptibility factors, and the genes gyrA and gyrB alter the antibiotic targets. Moreover, factors that modify charges, such as GdpD gene or MprF gene antibiotic resistance, change the charges of the cell wall proteins. At the same time, it should be mentioned that the presence of full-length genes related to AMR does not always mean the presence of AMR phenotype and, thus, the prerequisite for correlating the specific AMR mechanisms and the impact of SNP mutations.

3.6 Phylogenetic analysis

The phylogenetic analysis was designed to advance knowledge of the evolutionary history and genetic diversity of L. plantarum strains regarding their ecological niches and linkage to AMR. This study provides a foundation for the future directions in microbial genomics and its potential uses. Mash/MinHash and RaxML are the tools that help to understand the evolutionary history or phylogenetic relationships of the genome under study. To compare these strains and infer the location in the phylogenetic tree, additional tools were applied to this study to outline the closest reference and the representative genomes, and their protein families were predicted as PGFams. For the protein sequences derived from those families, MUltiple Sequence Comparison by Log-Expectation (MUSCLE) was applied for the sequences' alignment. In contrast, the nucleotide sequences' alignment was incorporated into a fused data matrix. Rate of Divergence (ROD) was then used for stable tree construction. Several types of such comprehensive phylogenetic analyses significantly expand our knowledge of the genetic variance and the processes of evolution within microbial taxa serving as a fundamental basis for further studies in the field of AMR and microbial economics.

3.7 Pangenome and genome synteny analysis

Based on the IPGA v1 platform, several pangenomic analyses were performed. Several highly informative details about the pangenomic content of different genomes were identified and analyzed for the particular set of genomes. First, the correctness of the assemblies was evaluated to exclude low quality assemblies and obtain accurate data for further analyses. After that, gene clustering was used to identify ortholog groups, and this analysis showed the differences in gene content and its classification according to function within the pangenome. Hence, the IPGA offered better visualization aids regarding such genomic differences and inclusively improved the general genetic understanding of the specified genomes. The Figure 3 represents the Cluster Orthologous Groups (COGs) results obtained by the pangenome analysis. The cluster core analysis results are shown in Figure 4.

Figure 3
www.frontiersin.org

Figure 3. The phylogenetic tree indicating the ancestral relationship among various strains of L. plantarum HMX2 genome.

Figure 4
www.frontiersin.org

Figure 4. The cluster core analysis results obtained by PanGenome analysis.

The quality of the assembled genomes was first analyzed to assess the reliability of the data used in the analysis. Through gene clustering, orthologous groups were defined to clarify the gene content, which differed significantly in functional classification between the core and peripheral pangenome. It was also observed that the cluster share analysis identified core genes present in all genomes, while accessory genes are found only in some genomes. These differences in gene content among the compared genomes were highlighted using tools available in IPGA, revealing the direction of genome divergence and potential adaptive strategies within the species under consideration. The results of the cluster-sharing analysis are presented in Figures 5, 6.

Figure 5
www.frontiersin.org

Figure 5. Cluster Cluster Orthologous Groups (COG) results obtained by the pangenome analysis.

Figure 6
www.frontiersin.org

Figure 6. Cluster sharing between five genomes of Lactiplantibacillus plantarum.

The UpSet plot across the five studied genomes provides an in-depth view of how gene clusters intersect and distribute. The numbers in the left bar plot (y-axis) represent each genome's total gene cluster count, ranging from the fewest to the most in the SRCM100442 and DF strains. The top bar plot shows the frequency of intersecting gene clusters, with the largest intersection comprising 2,561 universal core genomes, representing highly conserved genetic content across all genomes. Unique and co-occurring clusters are graphically illustrated in the matrix shown in Figure 7, with black dots indicating specific levels of overlap. For example, 433 gene clusters are exclusively shared between the BDGP2 and DF strains. More localized connections are shown in intersections of smaller sizes, while single dots indicate genome-specific distinct clusters. The plot effectively captures both the relatedness and support for shared conserved gene regions and unique clusters across the genomes (Supplementary Figure S1).

Figure 7
www.frontiersin.org

Figure 7. Core pangenome refraction analysis for the studied genomes.

The central pangenome refraction results from the UpSet plot reveal a substantial core genome, with 2,561 gene clusters shared across all five studied genomes. This extensive core genome reflects a high degree of genetic conservation, suggesting that these shared genes are likely crucial for essential cellular functions and survival. Additionally, the plot identifies several unique gene clusters specific to individual genomes and smaller sets of gene clusters shared among subsets of genomes. This highlights genetic diversity and potential adaptive traits unique to certain genomes (Table 6). Combining a strong core genome and diverse accessory genes indicates a balance between conserved essential functions and variable adaptive capabilities across the studied genomes.

Table 6
www.frontiersin.org

Table 6. Key genes associated with food safety and probiotic potential.

The number of core genes and their distribution patterns change as more genomes are included in the analysis. As shown in the box plots in Figure 7, the number of core gene clusters (in red) decreases sharply and approaches saturation as more genomes are considered. In contrast, the number of pangene clusters (in blue) increases with the discovery of more unique gene clusters as additional genomes are added. These trends illustrate the process of pangenomic diversification and evolution while the core genome content becomes relatively stabilized across the genomes studied. Thus, the analysis contributes to understanding the extent of variation and similarity between species, which is crucial for exploring the evolutionary aspects of their functions. Figure 7 presents the core pangenome refraction analysis results.

3.8 Orthologous analysis

The UpSet plot analysis of the L. plantarum HMX2 genome indicates substantial intersection patterns between five strains: BDGP2, UNQLp11, DF, HMX2, and SRCM100442. Two of these strains reside outside of chromosome 19. The greatest intersection cluster has 2,559 gene clusters, showing that the tested strains are genetically related. Notably, strain BDGP2 contains the most unique components (3,086), followed by strains DF and HMX2, which have 2,987 and 2,948, respectively. Strains SRCM100442 and UNQLp11 share many elements, revealing their high genetic similarity. Supplementary Figure S2 shows the UpSet plot for the five L. plantarum strains.

The cluster Venn diagram study of the L. plantarum genome shows the distribution and intersection of protein clusters in five strains: BDGP2 (Band2 domain G pullin 2), DF (diaphanous), HMX2 (hexaminidase 2), SRCM100442, and UNQLp11. The biggest overlap comprises 2,559 functional units shared by all five strains, totaling 12,884 proteins. This shows that a large percentage of the genome is conserved among various L. plantarum strains. Furthermore, strain-specific clusters are detected, with BDGP2 having distinct clusters that presumably contribute to its high protein count. All strains contribute nearly equally in terms of protein distribution, while BDGP2, HMX2, and UNQLp11 have a greater number of unique proteins. These findings highlight the genetic variations and the shared and distinct genes across the L. plantarum strains, reflecting both the individual characteristics of each strain and the common genes essential for the organism's survival and functionality. Supplementary Figure S3 represents the cluster Venn diagram, showing the distribution and intersection of protein clusters across the five strains.

3.9 Prediction of specialty genes

The whole-genome comparison showed genetic variations not found by the 16S rRNA gene approach, resulting in the identification of 25 antibiotic-resistance genes, emphasizing the need for investigating these features for food safety. Among the antibiotic resistance genes discovered are MurA and Alr, which encode the enzymes UDP-N-acetylglucosamine 1-carboxyvinyltransferase and alanine racemase, respectively. Both are important antibiotic targets in sensitive bacterial strains. Other important genes include kasA, rpoB, rpoC, gyrB, and gyrA, which encode enzymes essential for bacterial life, such as RNA polymerase and DNA gyrase, resulting in a wide variety of antibiotic resistance.

In addition, other transporter-related genes were found, including putative proteins and transporters, such as glycerol uptake facilitator proteins, which belong to the major intrinsic protein (MIP) family and are involved in nutrition and ion transport. The inclusion of genes such as GdpD and MprF, which modulate cell wall charge to aid antibiotic resistance, adds to L. plantarum HMX2's usefulness.

As a result, the whole-genome study of L. plantarum HMX2 sheds light on the strain's genetic diversity and possible function in foodborne pathogens and probiotics. These findings open the path for further research into the strain's antibiotic-resistance genes and their functional importance. Supplementary Table S2 shows the main specialized genes found in the L. plantarum HMX2 genome.

3.10 Food safety and probiotic potential

The genomic investigation of L. plantarum revealed numerous important genes involved in food safety and probiotic activity. Notably, the plnJK and plnEF genes, which produce plantaricin, were discovered to have an important role in suppressing foodborne pathogens such as Listeria, adding to the bacterium's antibacterial capabilities in fermented foods. Furthermore, genes, such as MurA and Alr, are required for cell wall formation, improving bacterial integrity, and guaranteeing survival in tough environments seen in food matrices and the gastrointestinal system. Furthermore, transcription-related genes such as rpoB, rpoC, and rho have been found, allowing L. plantarum to respond to environmental stress, which is critical for both food preservation and probiotic resilience. The presence of MprF, GdpD, and PgsA genes, which help to change the cell membrane and wall charge, increases resistance to environmental stressors, including antibiotics, allowing the bacteria to survive in food processing conditions and during gut colonization. These findings emphasize L. plantarum's dual role in protecting food safety through antimicrobial activity while increasing probiotic potential by surviving in the gastrointestinal environment.

3.11 Protein interaction network

The STRING-based PPI network analysis of the input proteins (glpF6, fusA2, glpF1, rho, orf2, yidC1, and tuf) revealed multiple statistically significant connections. Except for the hypothetical protein ORF2, all proteins are linked together by edges representing particular and important interactions. glpF6 and glpF1, which encode MIP/aquaporin family members, may serve as diffusive transporters for substrates such as lactic acid, urea, and H2O2.

Two more genes, fusA2 and tuf, are involved in translation elongation, and their interaction implies a function in protein synthesis. Rho, a transcription terminator factor, co-transcribes with fusA2, suggesting a link between transcription termination and translation elongation. Additionally, yidC1, which is involved in membrane protein insertion, is also linked to fusA2 and tuf, implying their joint involvement in incorporating proteins essential for translation. This study demonstrates how these proteins are functionally interrelated in critical cellular processes such as transcription termination, translation elongation, and membrane protein insertion by dissecting the protein—protein interactome, notably the Hcm1-Sac3 complex. Figure 8 shows the PPI network produced from STRING analysis.

Figure 8
www.frontiersin.org

Figure 8. The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) network indicating the protein interactions of proteins encoded by specialty genes in Lactiplantibacillus plantarum HMX2.

3.12 Gene co-occurring analysis

The major clusters described reflect numerous bigger “functional groups” that arise in various genomes and typically contain genes that co-occur. GlpF6, a putative transporter protein, is closely associated with genes involved in metabolic processes and transport. Similarly, considerable co-occurrence was found for fusA2 and tuf, which are involved in translation elongation. Rho is largely associated with genes that regulate transcription and translation, showing its importance in gene expression. YidC1 interacts with genes involved in membrane protein insertion, indicating that it functions in protein organization and integration inside the membrane. In contrast, the putative protein orf2 has few connections, indicating that it may have a more specialized or context-dependent role. Figure 9 depicts the co-occurrence analysis of the specialty genes present in the genome of L. plantarum HMX2.

Figure 9
www.frontiersin.org

Figure 9. Gene co-occurrence analysis of the specialty genes present in genome of Lactiplantibacillus plantarum HMX2.

4 Discussion

The genomic features of L. plantarum HMX2 have been comprehensively analyzed using the WGS and bioinformatics. These discoveries lead to important information about its genetic, functional, and evolutionary relationships, laying the foundation for the future research into its various uses. These results provide us with a better understanding of the genetics of L. plantarum and its implications in food research, probiotics, and antibiotic research. Subsequent studies will focus on using these findings in developing biotechnological advancements and evaluating their work to improve food safety and health (Aziz et al., 2023b). The high genomic similarity (99.17% ANI) between L. plantarum HMX2 and the reference strain Z.61 confirmed that it is a member of the L. plantarum species. This relationship is consistent with other studies that have found differences in the genetic diversity of L. plantarum species in different locations. Similar genetic markers have been found in Lactobacillus bacteria. Additionally, 25 potential antibiotic genes in L. plantarum HMX2 deserves a detailed study. Although these genes are present in L. plantarum, it is important to understand their expression patterns and their impact on food safety practices. Although this similarity supports its taxonomic status, further research is needed to explore the functional properties of L. plantarum HMX 2 (Elagamey et al., 2023).

Other species of L. plantarum have been shown to have similar genetic characteristics, including the ability to withstand stress, adapt to the environment, and compete with pathogens. In addition, since there are 25 putative antibiotic resistance genes in L. plantarum HMX2, it needs to be checked in detail. This is consistent with previous studies on the diversity of L. plantarum and supports the concept of the bacterial pangenome. The evolutionary relationships of L. plantarum HMX2 were elucidated using phylogenetic analysis and placed them in a group of related individuals. This underscores the need for research studies to demonstrate the beneficial effects of similar Lactobacillus species (Lu et al., 2024).

Thus, the comparative genomic analysis of the L. plantarum strains brought up significant genetic relatedness and differences that revealed much about their evolutionary history and roles (Aziz et al., 2023c). The OrthoANIu value was at a high of 99. Hence, 17% implies that the studied species share a high degree of genome relatedness and testify to high proportions of nucleotide identity within the species. The draft genome and annotation of L. plantarum HMX2 strain brought out the genetic plan of the economically beneficial L. plantarum highly descriptive with 3, 242 CDS, 65 tRNA, and 16 rRNA genes, many among them were Hypothetical proteins, indicating possibilities for new research in the future. This detailed genomic information helps to develop a better view of the strain's functional potential and its evolutionary history. The analysis for the pangenome revealed a large core genome with 2,561 gene clusters, which could be attributed to the large number of orthologous genes representing the degree of genetic conservation in an organism of basic cellular processes. The finding of specialty genes such as antibiotic resistance, transporters, and other resistance genes provides the light of adaptation to the microbial strain and its significance to food safety and probiotics. These all-encompassing genomic differences and phylogenetic relationships constitute as a strong foundation for the genetic and evolutionary systems within L. plantarum, opening up opportunities for future scientific advances in its functional and adaptive characteristics.

This study represents new insights into L. plantarum HMX2 genomics, providing knowledge about its genomic features and possible relationship with antibiotic resistance (Hu et al., 2023). It is worthy of note that the identification of the 25 antibiotic resistance genes together with their resistance mechanisms we reveal the ability of the strain to resist several antimicrobial agents which has several important implications for food safety and public health. The phylogenetic and pangenomic analyses not only reveal how various L. plantarum strains evolved but also expose the core and accessory genes required for life and versatility. These results highlight the need to further research into microbial genomes to uncover the functionality complexity the different microbial species, especially in the domains of probiotics and foodborne pathogens. This kind of research is important mainly to create effective tactics of how to fight against antibiotic resistance and enhance food security (Contente et al., 2024; Aljohani et al., 2024).

The future prospects of this study are vast and intriguing, with several options for further research. One important area is the experimental confirmation of the antibiotic resistance genes discovered in L. plantarum HMX2, which will be critical in assessing the strain's safety and appropriateness for food applications. Comparative genomics may also be used to explore functional diversity across several L. plantarum strains, providing further insight into their probiotic potential. The future studies might also examine how the strain interacts with the human gut microbiome to better understand its health advantages and potential uses in functional foods.

5 Conclusions

In conclusion, L. plantarum has significantly contributed to genome analysis, genetic diversity research, and food safety applications. The discovery of antibiotic resistance genes in L. plantarum emphasizes its critical role in protecting food safety. The future comparative genomic investigations are likely to give useful insights into L. plantarum's functional diversity and evolutionary dynamics, increasing its potential as a probiotic and broadening its applicability in developing innovative antimicrobial therapies. This thorough understanding of the L. plantarum genome provides a solid framework for its use in various industrial and manufacturing processes, ultimately boosting its incorporation into food production and safety measures.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.

Author contributions

TA: Conceptualization, Methodology, Writing – original draft. MN: Formal analysis, Investigation, Writing – review & editing. MS: Investigation, Methodology, Writing – original draft. AS: Investigation, Software, Writing – review & editing. JN: Methodology, Software, Writing – review & editing. LZ: Project administration, Supervision, Writing – review & editing. ZY: Funding acquisition, Supervision, Writing – review & editing. HC: Formal analysis, Validation, Writing – review & editing. LL: Data curation, Supervision, Writing – review & editing. THA: Validation, Visualization, Writing – original draft.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research work was financially supported by the National Natural Science Foundation of China (Project No. 32272296), the National Key R&D Program of China (2021YFA0910800 and 2023YFF1103402), and the Natural Science Foundation of Guangdong Province (Grant No. 2022A1515012043).

Acknowledgments

The authors extend their appreciation to Research Supporting Project (RSPD2024R568) King Saud University, Riyadh, Saudi Arabia.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2024.1504625/full#supplementary-material

References

Al-Aamri, A., Taha, K., Al-Hammadi, Y., Maalouf, M., and Homouz, D. (2019). Analyzing a co-occurrence gene-interaction network to identify disease-gene association. BMC Bioinform. 20, 1–15. doi: 10.1186/s12859-019-2634-7

PubMed Abstract | Crossref Full Text | Google Scholar

Aljohani, A., Rashwan, N., Vasani, S., Alkhawashki, A., Wu, T. T., Lu, X., et al. (2024). The health benefits of probiotic Lactiplantibacillus plantarum: a systematic review and meta-analysis. Probiot. Antimicrob. Prot. doi: 10.1007/s12602-024-10287-3. [Epub ahead of print].

PubMed Abstract | Crossref Full Text | Google Scholar

Avershina, E., Khezri, A., and Ahmad, R. (2023). Clinical diagnostics of bacterial infections and their resistance to antibiotics—current state and whole genome sequencing implementation perspectives. Antibiotics 12:781. doi: 10.3390/antibiotics12040781

PubMed Abstract | Crossref Full Text | Google Scholar

Aziz, T., Naveed, M., Jabeen, K., Shabbir, M. A., Sarwar, A., Zhennai, Y., et al. (2023b). Integrated genome based evaluation of safety and probiotic characteristics of Lactiplantibacillus plantarum YW11 isolated from Tibetan kefir. Front. Microbiol. 14:1157615. doi: 10.3389/fmicb.2023.1157615

PubMed Abstract | Crossref Full Text | Google Scholar

Aziz, T., Naveed, M., Makhdoom, S. I., Ali, U., Mughal, M. S., Sarwar, A., et al. (2023a). Genome investigation and functional annotation of Lactiplantibacillus plantarum YW11 revealing streptin and ruminococcin-A as potent nutritive bacteriocins against gut symbiotic pathogens. Molecules 28:491. doi: 10.3390/molecules28020491

PubMed Abstract | Crossref Full Text | Google Scholar

Aziz, T., Naveed, M., Shabbir, M. A., Sarwar, A., Ali Khan, A., Zhennai, Y., et al. (2023c). Comparative genomics of food-derived probiotic Lactiplantibacillus plantarum K25 reveals its hidden potential, compactness, and efficiency. Front. Microbiol. 14:1214478. doi: 10.3389/fmicb.2023.1214478

PubMed Abstract | Crossref Full Text | Google Scholar

Beltrán-Velasco, A. I., Reiriz, M., Uceda, S., and Echeverry-Alzate, V. (2024). Lactiplantibacillus (Lactobacillus) plantarum as a complementary treatment to improve symptomatology in neurodegenerative disease: a systematic review of open access literature. Int. J. Mol. Sci. 25:3010. doi: 10.3390/ijms25053010

PubMed Abstract | Crossref Full Text | Google Scholar

Contente, D., Díaz-Formoso, L., Feito, J., Gómez-Sala, B., Costas, D., Hernández, P. E., et al. (2024). Antimicrobial activity, genetic relatedness, and safety assessment of potential probiotic lactic acid bacteria isolated from a rearing tank of rotifers (Brachionus plicatilis) used as live feed in fish larviculture. Animals 14:1415. doi: 10.3390/ani14101415

PubMed Abstract | Crossref Full Text | Google Scholar

de Albuquerque, N. R. M., and Haag, K. L. (2023). Using average nucleotide identity (ANI) to evaluate microsporidia species boundaries based on their genetic relatedness. J. Eukaryot. Microbiol. 70:e12944. doi: 10.1111/jeu.12944

PubMed Abstract | Crossref Full Text | Google Scholar

Elagamey, E., Abdellatef, M. A. E., Haridy, M. S. A., and Abd El-aziz, E. A. E. (2023). Evaluation of natural products and chemical compounds to improve the control strategy against cucumber powdery mildew. Eur. J. Plant Pathol. 165, 385–400. doi: 10.1007/s10658-022-02612-9

Crossref Full Text | Google Scholar

Horsfield, S. T., Tonkin-Hill, G., Croucher, N. J., and Lees, J. A. (2023). Accurate and fast graph-based pangenome annotation and clustering with ggCaller. Genome Res. 33, 1622–1637. doi: 10.1101/gr.277733.123

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, G., Wang, Y., Xue, R., Liu, T., Zhou, Z., and Yang, Z. (2023). Effects of the exopolysaccharide from Lactiplantibacillus plantarum HMX2 on the growth performance, immune response, and intestinal microbiota of juvenile turbot, Scophthalmus maximus. Foods 12:2051. doi: 10.3390/foods12102051

PubMed Abstract | Crossref Full Text | Google Scholar

Jahanshahi, D. A., Ariaeenejad, S., and Kavousi, K. A. (2023). Metagenomic catalog for exploring the plastizymes landscape covering taxa, genes, and proteins. Sci. Rep. 13:16029. doi: 10.1038/s41598-023-43042-9

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, D., Zhang, Y., Fan, G., Sun, D., Zhang, X., Yu, Z., et al. (2022). IPGA: a handy integrated prokaryotes genome and pan-genome analysis web service. iMeta 1:e55. doi: 10.1002/imt2.55

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Y., Zhang, R., Wang, B., Huo, Z., and Zhang, F. (2023). Evaluation of antibiotic resistance in Lactobacillus plantarum and their probiotic characteristics during the laboratory evolution in ampicillin and amoxicillin environment. Int. J. Dairy Technol. 76, 909–919. doi: 10.1111/1471-0307.12978

Crossref Full Text | Google Scholar

Lu, Y.-h., Liang, W.-s., Wang, R., Liang, Q.-c., Zeng, X.-A., and Huang, Y.-y. (2024). Assessment of the safety and probiotic properties of Lactiplantibacillus plantarum HYY-DB9 based on comprehensive genomic and phenotypic analysis. LWT 203:116386. doi: 10.1016/j.lwt.2024.116386

Crossref Full Text | Google Scholar

Olson, R. D., Assaf, R., Brettin, T., Conrad, N., Cucinell, C., Davis, J. J., et al. (2023). Introducing the bacterial and viral bioinformatics resource center (BV-BRC): a resource combining PATRIC, IRD and ViPR. Nucl. Acids Res. 51, D678–D689. doi: 10.1093/nar/gkac1003

PubMed Abstract | Crossref Full Text | Google Scholar

Rajput, A., Chauhan, S. M., Mohite, O. S., Hyun, J. C., Ardalani, O., Jahn, L. J., et al. (2023). Pangenome analysis reveals the genetic basis for taxonomic classification of the Lactobacillaceae family. Food Microbiol. 115:104334. doi: 10.1016/j.fm.2023.104334

PubMed Abstract | Crossref Full Text | Google Scholar

Sadanov, A., Alimzhanova, M., Ismailova, E., Shemshura, O., Ashimuly, K., Molzhigitova, A., et al. (2023). Antagonistic and protective activity of Lactobacillus plantarum strain 17 M against E. amylovora. World J. Microbiol. Biotechnol. 39:314. doi: 10.1007/s11274-023-03765-3

PubMed Abstract | Crossref Full Text | Google Scholar

Snyder, E., Kampanya, N., Lu, J., Nordberg, E. K., Karur, H. R., Shukla, M., et al. (2007). PATRIC: the VBI pathosystems resource integration center. Nucl. Acids Res. 35(Suppl_1), D401–D406. doi: 10.1093/nar/gkl858

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, J., Lu, F., Luo, Y., Bie, L., Xu, L., and Wang, Y. (2023). OrthoVenn3: an integrated platform for exploring and visualizing orthologous data across genomes. Nucl. Acids Res. 51, W397–W403. doi: 10.1093/nar/gkad313

PubMed Abstract | Crossref Full Text | Google Scholar

Syaputri, Y., Lei, J., Hasegawa, T., Fauzia, S., Ratningsih, N., Erawan, T. S., et al. (2023). Characterization of plantaricin genes and lactic acid production by Lactiplantibacillus plantarum strains isolated from Ishizuchi-Kurocha. Appl. Food Biotechnol. 10, 21–31. doi: 10.22037/afb.v10i1.39166

PubMed Abstract | Crossref Full Text | Google Scholar

Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., et al. (2015). STRING v10: protein–protein interaction networks, integrated over the tree of life. Nucl. Acids Res. 43, D447–D452. doi: 10.1093/nar/gku1003

PubMed Abstract | Crossref Full Text | Google Scholar

Umanets, A., Surono, I. S., and Venema, K. I. (2023). am better than I look: genome based safety assessment of the probiotic Lactiplantibacillus plantarum IS-10506. BMC Genom. 24:518. doi: 10.1186/s12864-023-09495-y

PubMed Abstract | Crossref Full Text | Google Scholar

Vidulin, V., Šmuc, T., and Supek, F. (2016). Extensive complementarity between gene function prediction methods. Bioinformatics 32, 3645–3653. doi: 10.1093/bioinformatics/btw532

PubMed Abstract | Crossref Full Text | Google Scholar

von Mering, C., Huynen, M., Jaeggi, D., Schmidt, S., Bork, P., and Snel, B. (2003). STRING: a database of predicted functional associations between proteins. Nucl. Acids Res. 31, 258–261. doi: 10.1093/nar/gkg034

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, J., Yang, W., Zhang, S., Hu, H., Yuan, Y., Dong, J., et al. (2023). A pangenome analysis pipeline provides insights into functional gene identification in rice. Genome Biol. 24:19. doi: 10.1186/s13059-023-02861-9

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, Y., Coleman-Derr, D., Chen, G., and Gu, Y. Q. (2015). OrthoVenn: a web server for genome wide comparison and annotation of orthologous clusters across multiple species. Nucl. Acids Res. 43, W78–W84. doi: 10.1093/nar/gkv487

PubMed Abstract | Crossref Full Text | Google Scholar

Yoon, S. H., Ha, S. M., Kwon, S., Lim, J., Kim, Y., Seo, H., et al. (2017). A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek 110, 1281–1286. doi: 10.1007/s10482-017-0844-4

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: comparative genomics, HMX2, coding sequences, food safety, L. plantarum

Citation: Aziz T, Naveed M, Shabbir MA, Sarwar A, Naseeb J, Zhao L, Yang Z, Cui H, Lin L and Albekairi TH (2024) Unveiling the whole genomic features and potential probiotic characteristics of novel Lactiplantibacillus plantarum HMX2. Front. Microbiol. 15:1504625. doi: 10.3389/fmicb.2024.1504625

Received: 01 October 2024; Accepted: 24 October 2024;
Published: 14 November 2024.

Edited by:

Elena Bartkiene, Lithuanian University of Health Sciences, Lithuania

Reviewed by:

Shilei Wang, Zhengzhou University, China
Shengqian Sun, Yantai Institute of Technology, China

Copyright © 2024 Aziz, Naveed, Shabbir, Sarwar, Naseeb, Zhao, Yang, Cui, Lin and Albekairi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Liqing Zhao, bHF6aGFvJiN4MDAwNDA7c3p1LmVkdS5jbg==; Zhennai Yang, eWFuZ3poZW5uYWkmI3gwMDA0MDsxNjMuY29t; Lin Lin, bGlubCYjeDAwMDQwO3Vqcy5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.