Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 28 March 2022
Sec. Extreme Microbiology
This article is part of the Research Topic Extremophiles: Microbial Genomics and Taxogenomics View all 26 articles

Polyphasic Identification and Genomic Insights of Leptothermofonsia sichuanensis gen. sp. nov., a Novel Thermophilic Cyanobacteria Within Leptolyngbyaceae

  • 1Antibiotics Research and Re-evaluation Key Laboratory of Sichuan Province, Sichuan Industrial Institute of Antibiotics, Chengdu University, Chengdu, China
  • 2School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, China
  • 3Department of Pharmaceutical Microbiology, Faculty of Pharmacy Medical University of Gdańsk, Gdańsk, Poland
  • 4Laboratory of Plant Protection and Biotechnology, Intercollegiate Faculty of Biotechnology University of Gdańsk and Medical University of Gdańsk, University of Gdańsk, Gdańsk, Poland

Thermal environments are an important reservoir of thermophiles with significant ecological and biotechnological potentials. However, thermophilic isolates remain largely unrecovered from their habitats and are rarely systematically identified. In this study, we characterized using polyphasic approaches a thermophilic strain, PKUAC-SCTAE412 (E412 hereafter), recovered from Lotus Lake hot spring based in Ganzi prefecture, China. The results of 16S rRNA/16S-23S ITS phylogenies, secondary structure, and morphology comparison strongly supported that strain E412 represent a novel genus within Leptolyngbyaceae. This delineation was further confirmed by genome-based analyses [phylogenomic inference, average nucleotide/amino-acid identity, and the percentages of conserved proteins (POCP)]. Based on the botanical code, the isolate is herein delineated as Leptothermofonsia sichuanensis gen. sp. nov, a genus adjacent to recently delineated Kovacikia and Stenomitos. In addition, we successfully obtained the first complete genome of this new genus. Genomic analysis revealed its adaptations to the adverse hot spring environment and extensive molecular components related to mobile genetic elements, photosynthesis, and nitrogen metabolism. Moreover, the strain was capable of modifying the composition of its light-harvesting apparatus depending on the wavelength and photoperiod, showing chromatic adaptation capacity characteristic for T1 and T2 pigmentation types. Other physiological studies showed the strain’s ability to utilize sodium bicarbonate and various sulfur compounds. The strain was also shown to be diazotrophic. Interestingly, 24.6% of annotated protein-coding genes in the E412 genome were identified as putatively acquired, hypothesizing that a large number of genes acquired through HGT might contribute to the genome expansion and habitat adaptation of those thermophilic strains. Most the HGT candidates (69.4%) were categorized as metabolic functions as suggested by the KEGG analysis. Overall, the complete genome of strain E412 provides the first insight into the genomic feature of the genus Leptothermofonsia and lays the foundation for future global ecogenomic and geogenomic studies.

Introduction

Thermophilic cyanobacteria are photoautotrophic prokaryotes that inhabit inhospitable niches, such as hot springs and other thermal environments. These thermophiles are essentially primary producers of the geothermal ecosystems with substantial ecological importance (Esteves-Ferreira et al., 2018; Li et al., 2021). Besides, thermophilic cyanobacteria and their secondary metabolites have shown some potentials for biotechnological applications, including biosynthesis of thermostable metabolites and carbon valorisation vehicles (Patel et al., 2019). Unfortunately, thermophilic isolates remain largely unrecovered from their habitats to date and lack comprehensive knowledge about their physiology, ecology, systematics, and adaptations to the adverse thermal habitats.

Recently, next-generation sequencing (NGS) has been extensively applied to explore the cyanobacterial diversity in thermal environments (Tang et al., 2018b; Alcorta et al., 2020; Chen et al., 2021). However, NGS only generates ample abstract data and its assembly often results in unspecified operational taxonomic units (OTUs) and higher taxonomic groups. Isolation of thermophilic cyanobacteria from different ecosystems is fundamental for detailed characterization of their morphology, genetics, physiology, and biochemistry and for a better understanding of thermal ecology and their adaptation to thermal habitats (Cordeiro et al., 2020; Tang et al., 2022). However, the simple morphology of cultivated cyanobacteria makes these strains ambiguous and their taxonomic allocation challenging (Komárek and Anagnostidis, 2005). To address this problem, polyphasic taxonomic classification approaches have been widely demonstrated to be effective for cyanobacterial identification, especially for understudied or unresolved polyphyletic families/genera/species and identification of novel families and genera (Raabova et al., 2019; Shalygin et al., 2020). Consequently, establishing new genera as references will benefit the reallocation of numerous taxonomically ambiguous strains. For instance, Leptolyngbya-like strains have been scattered across phylogenies based on 16S rRNA or multi-locus sequence analysis within or even beyond family Leptolyngbyaceae (Sciuto and Moro, 2016; Yao et al., 2021). As the taxonomic delineation of isolates through systematic identification, numerous Leptolyngbya-like strains were assigned to new genus or species, such as Allonema, Enugrolinea, and Thermoleptolyngbya (Sciuto and Moro, 2016; Walter et al., 2017).

Genomic studies on thermophilic cyanobacteria are increasingly popular, and the better availability of complete genome sequences from thermal isolates and Metagenome-Assembled Genomes (MAGs) allows the studies of taxonomic relationships and genomic adaptations to different ecological niches among thermophilic strains and provides insight into survival strategies in extreme conditions not fully comprehended up to now (Chen et al., 2021). In addition, the acquisition of a complete genome may provide novel insights into the genomic features of new thermophilic microorganisms. More importantly, under the current scenario of global warming, it is important to understand the evolution of hot spring genomes as an example of selective pressure in warmer environments (Alcorta et al., 2020). Therefore, the acquisition of genomes from thermophilic cyanobacteria as much as possible is a prerequisite for such studies.

In the current study, we collected and analyzed morphological, physiological, and molecular data for the representative of an entirely novel genus of a filamentous thermophilic cyanobacterium isolated from a Lotus Lake hot spring based in Ganzi prefecture western Sichuan province of China. After thorough taxogenomic evaluation combined with 16S rRNA/16S-23S ITS phylogenies, secondary structure, and morphology comparison, a new name Leptothermofonsia sichuanensis gen. sp. nov. has been proposed based on the botanical code for the strain as the first representative of genus Leptothermofonsia. Furthermore, genomic features of this genus were studied, and some genes were correlated with the physiological properties.

Materials and Methods

Leptothermofonsia sichuanensis gen. sp. nov. E412: Origins, Cultivation, and Basic Physiological Assessment

The strain E412 presented in the current study was initially isolated from Lotus Lake hot springs in the Ganzi region of Sichuan Province, China. Information about the sampling site and preliminary taxonomic allocation of the strain was reported in our previous studies (Tang et al., 2018a,b). The unicyanobacterial culture of the strain was cryopreserved as 10% DMSO in BG11 stocks in −80°C. Final precultures for experiments were established as previously described (Tang et al., 2018b) and cultivated at 45°C in 150 mL BG-11 medium in 500 mL Erlenmeyer flasks agitated at 100 rpm under 12L:12D photoperiod at 45 μmol m–2 s–1 provided by fluorescent tubes unless stated otherwise. The strain initially denoted as PKUAC-SCTE412 has been recently deposited in the Freshwater Algae Culture Collection at the Institute of Hydrobiology (FACHB-collection) with an accession number FACHB-2490. Physiological assessment of the strain was performed using BG-11 medium free of an essential nutrient (nitrogen or sulfur) supplemented with 17 mM NaNO2, 85 mM NaNO3, 10 mM Na2SO4, and 10 mM NaHSO3. The nitrogen fixation capacity during 72 h was performed using the methodology described by Li et al. (2021).

The chromatic adaptation capacity of strain E412 was assessed by culturing the cells at constant LED illumination using either white light (6,500 K) or far-red light (730 nm) at the intensity of 250 and 25 μmol m–2 s–1, respectively. After 15 days, cyanobacterial cells were collected to measure chlorophyll a, chlorophyll b, carotenoids, allophycocyanin, phycocyanin, and phycoerythrin, according to previously published protocols (Bennett and Bogorad, 1973; Dere et al., 1998). Lipophilic and water-soluble pigments from 15 mg of lyophilized biomass were extracted with 100% methanol and phosphate-buffered saline (PBS), respectively. The absorbance of the supernatant at 470, 562, 615, 652, 653, and 666 nm were recorded against the relative blank using a UV-Vis spectrophotometer (Shimadzu UV-1,800, Japan). The concentration of pigments was calculated using respective formulae for chlorophylls and carotenoids (Dere et al., 1998) and phycobiliproteins (Bennett and Bogorad, 1973).

Genome Sequencing, Assembly, and Annotation of Strain E412

Genomic DNA was extracted and assessed following the previously described method (Tang et al., 2018a). The whole genome of E412 was sequenced using a combination of Oxford Nanopore Technologies (ONT) and Illumina short-read approaches. Illumina sequencing of E412 genome generated 6,810,074 filtered paired-end reads (clean data), providing approximately 185-fold coverage of the genome. The clean data were assembled into contigs using MaSuRCA v. 3.4.1 with default parameters (Zimin et al., 2013) and Flye for final assembly, generating a single contig after manual curation. The draft circular genome was error-corrected with Illumina NovaSeq reads using Burrows-Wheeler Aligner, BWA v0.7.17 (Li and Durbin, 2009), and then Pilon v1.23 (Walker et al., 2014). The complete genome has been deposited in GenBank with an accession number CP072600.

The annotation of the E412 genome was performed using a pipeline described by Tang et al. (2019). In short, initial gene prediction and annotation were completed using the automatic NCBI prokaryotic genome annotation pipeline (Tatiana et al., 2016). Subsequently, poor calls were corrected using the RAST annotation system. The insertion sequence (IS) was detected and annotated by ISsaga (Varani et al., 2011). Prophage regions and CRISPR/Cas loci were detected by PHASTER (David et al., 2016) and by CRISPRCasFinder server (Couvin et al., 2018), respectively.

For functional classification, the predicted protein sequences were searched against the NCBI non-redundant database using BLASTP with an E-value cutoff of 1e-5. The search outputs were imported into BLAST2GO V5.2.5 (Conesa et al., 2005) for GO term mapping. GO classification was subsequently conducted by WEGO (Ye et al., 2006) using the following ontologies: biological process, molecular function, and cellular component. Additionally, KEGG orthology (KO) identifiers were assigned to predicted protein sequences by KofamKOALA (Aramaki et al., 2019), and then BRITE mapping process was performed online.1

Phylogenetic Reconstruction of 16S rRNA and 16S-23S ITS

The 16S rRNA gene and 16S-23S ITS regions were extracted from the E412 for phylogenetic analysis. Cyanobacterial sequences were retrieved from GenBank through BLAST search as references to construct datasets for phylogenetic analyses of 16S rRNA gene and 16S-23S ITS, respectively. Multiple alignments of sequences were performed by Muscle implemented in Mega7 (Kumar et al., 2016) and manually edited where necessary. Maximum-Likelihood (ML) phylogenetic analyses were carried out using PhyML v3.0 (Guindon et al., 2010), and the substitution models were selected by the Model Selection function implemented in PhyML (Vincent et al., 2017) under Akaike information criterion (AIC). A non-parametric bootstrap test (1,000 replications) was performed to assess the robustness of tree topologies.

Prediction of Secondary Structures

The conserved domains of the 16S-23S ITS region: D1-D1′, D2, D3, boxA, and D4; and its variable regions (V2, boxB, and V3) were identified as previously described (Iteman et al., 2000). The tRNAs presented in the spacer were identified by tRNAscan-SE v1.3.1 (Lowe and Eddy, 1997). The secondary structures of the identified fragments were individually determined by RNAstructure web server using default settings (Mathews, 2014).

Microscopic Analysis

Strain E412 was investigated at 400 × magnification using light microscopy (LM, DP72, OLYMPUS, Japan), equipped with an image acquisition system (U-TV0.63XC, OLYMPUS, Japan). Microscopic observations were also performed using scanning electron microscopy (SEM) (SU8100, HITACHI, Japan), and using transmission electron microscopy (TEM) (HT7800, HITACHI, Japan), essentially as described before (Tang et al., 2022).

Genome-Based Analysis

To compare divergence in genomes of focal taxa (E412 and previously established genera within Leptolyngbyaceae), a high-quality dataset comprising genomes with near completeness (≥ 90%) and low contamination (<5%) was constructed to calculate whole-genome average nucleotide identity (ANI) and average amino acid identity (AAI) using the ANI/AAI calculator with default settings.2 In addition, the percentages of conserved proteins (POCP) between the E412 genome and focal taxa were pairwise calculated to estimate their evolutionary and phenotypic distance. Finally, the POCP was determined for prokaryotic genus delineation according to the method described previously (Qin et al., 2014).

The concatenated sequences from single-copy genes shared by all the genomes were used to elucidate the phylogenomic relationship between E412 and focal taxa. Single-copy genes shared by all genomes were refined from the homologous gene clusters identified by OrthoMCL (Li et al., 2003) and concatenated using custom Perl script. MAFFT v7.453 (Standley, 2013) was used to generate multisequence alignment. The supergene alignment was subjected to phylogenomic inference using IQ-TREE v2.1.3 (Minh et al., 2020). ModelFinder implemented in IQ-TREE was employed to select the optimal substitution model for phylogenomic analysis from 546 protein models. Bootstrap tests (1,000 replicates) were performed to assess tree topologies using UltraFast Bootstrap (Hoang et al., 2018). A strain from the family Oculatellaceae, Thermoleptolyngbya sichuanensis, was rooted as an outgroup.

Taxonomic Evaluation

For a valid description, the classification system applied was based on Komárek et al. (2014), and the taxon description follows the prescriptions of the Botanical Code, International Code of Nomenclature for Algae, Fungi, and Plants (Shenzhen code) (Turland et al., 2018).

Identification of Horizontal Gene Transfer Candidates

A BLASTP-based approach was employed to identify potential genes acquired through horizontal gene transfer (HGT). BLASTP searches were performed against the NCBI non-redundant protein database (last accessed January 20, 2018). The pairs that accounted for at least 90% of the query length and amino acid sequence similarity of at least 40% were used for the HGT detection. Taxonomic classification was assigned to each hit with the taxonomy files downloaded from the NCBI database. Protein sequences were identified as HGT candidates if all the top-five hits were not from the family Leptolyngbyaceae.

Results and Discussion

Phylogeny of 16S rRNA Gene

The ML phylogram (Figure 1) inferred from 16S rRNA gene sequences recognized 17 well-supported clades of previously established genera: Alkalinema (Vaz et al., 2015), Arthronema (Komárek and Lukavský, 1988), Chroakolemma (Becerra-Absaln et al., 2018), Kovacikia (Miscoe et al., 2016), Leptodesmis (Raabova et al., 2019), Leptolyngbya sensu stricto (Taton et al., 2010), Limnolyngbya (Li and Li, 2016), Myxacorys (Soares et al., 2019), Neosynechococcus (Dvorak et al., 2014), Onodrimia (Jahodářová et al., 2017), Pantanalinema (Vaz et al., 2015), Phormidesmis (Raabova et al., 2019), Pinocchia (Dvorak et al., 2015), Planktolyngbya (Thomazeau et al., 2010), Plectolyngbya (Taton et al., 2011), Scytolyngbya (Song et al., 2015), Stenomitos (Miscoe et al., 2016), and Gloeobacter as an outgroup of the phylogram. Strain E412, together with Leptolyngbya sp. Greenland 10, formed a well-defined clade that was phylogenetically novel to the other described taxa (Figure 1), suggesting a new genus within the family Leptolyngbyaceae.

FIGURE 1
www.frontiersin.org

Figure 1. ML phylogenetic tree of 16S rRNA gene sequences. Strain no. in bold represent the strains identified in this study. Only bootstrap values > 50% (1,000 non-parametric replications) are indicated at nodes. Scale bar = 2% substitutions per site.

The sequence identity (Supplementary Table 1) indicated that the nearest neighbor of E412 was Leptolyngbya sp. Greenland 10 (96.2% identity), a poorly described strain isolated from the hot spring (56–61°C), Rømerfjord, Greenland (Roeselers et al., 2007), followed by Kovacikia (94.9–95.0% identity) and Pantanalinema (94.0% identity). The other focal taxa exhibited 89.3–93.9% identities of 16S rRNA gene to strain E412 (Supplementary Table 1). According to the recommended threshold of 16S rRNA gene identity for bacterial species (98–99%) or genera (94.5–95%) demarcation (Rodriguez-R et al., 2018), strain E412 was proposed to be categorized into a new genus within the family Leptolyngbyaceae. And Leptolyngbya sp. Greenland 10 was also assigned to this new genus but a different species based on the 16S rRNA gene identity. This was also supported by the ML topology of the 16S rRNA gene that Greenland 10 formed a distinct sister branch alongside strain E412 (Figure 1). Interestingly, the two strains within the proposed genus were both isolated from thermal habitats, but the isolation sources were geographically divergent and exhibited distinct geochemical characteristics (Roeselers et al., 2007; Tang et al., 2018a). Notably, the newly established genus offers a solid basis for further investigations or comparisons on numerous aspects, such as whether the proposed genus has a cosmopolitan distribution and all the strains within this genus are specific to thermal habitat.

Phylogeny of 16S-23S ITS

Additional to the 16S rRNA gene, the 16S-23S ITS region has also been commonly recommended to establish cyanobacterial ecotypes or species (Brito et al., 2017). Several genera included in the 16S rRNA phylogram were excluded in the ITS phylogenetic analysis due to sequence unavailability. As shown in Figure 2, strain E412 was placed into a well-separated branch in the ITS phylogram. Previously described genera were also recognized in the 16S-23S ITS phylogram and supported by robust bootstrap values. However, the ITS phylogram (Figure 2) showed inconsistent topology to that of the 16S rRNA gene (Figure 1). Previous reports have manifested that the sequences of 16S-23S ITS were extraordinarily divergent (Johansen et al., 2011; Tang et al., 2021). Furthermore, regions/domains within ITS were occasionally absent in some cyanobacterial strains, e.g., in the present study, conserved tRNAs were absent in Pantanalinema rosaneae (Vaz et al., 2015). Taken together, phylogenetic inference of ITS could result in erroneous taxonomic classification. Therefore, secondary structure analyses of domains in 16S-23S ITS were also carried out as an essential complement to the final taxonomy determination.

FIGURE 2
www.frontiersin.org

Figure 2. ML phylogenetic tree of 16S-23S ITS sequences. Strain no. in bold represent the strains identified in this study. Only bootstrap values > 50% (1,000 non-parametric replications) are indicated at nodes. Scale bar = 5% substitutions per site.

16S-23S ITS Secondary Structures

Strain E412 and representative strains of 13 focal genera within the family Leptolyngbyaceae were used for 16S-23S ITS secondary structure analysis. After removing the two highly conserved tRNAs (Ile and Ala) from ITS sequences, the remaining ITS sequences varied in length from 230 to 388 bp (Table 1). Such variation was primarily ascribed to the length differences in D1-D1′ (51–121 bp), V2 (7–90 bp), boxB (33–63 bp), and V3 (19–98 bp (Table 1). Consequently, the nucleotide differences of these domains among strains resulted in divergences of the secondary structures.

TABLE 1
www.frontiersin.org

Table 1. The length (bp) summary of regions within16S-23S ITS of Leptolyngbyaceae strains studied.

The inferred D1-D1′ helix of strain E412 (Supplementary Figure 1) was distinct from the other inferred structures. The most similar structure to D1-D1′ helix of strain E412 was that of O. javanensis. Both D1-D1′ helix of the two strains were mainly composed of a stem fragmented by asymmetrical/symmetrical loop, right/left bulge, and terminated with hairpin loop. But the structures of the two strains varied in the residue size and/or the number of loops, bulges, and terminal hairpins. In addition, the fragmented stems also differed in number and length between the two strains.

The hypothetical V2 helix of strain E412 (Supplementary Figure 2) was divergent from the other analyzed strains. The tremendous length variations were responsible for the divergence in secondary structures. The V2 helix of strain E412 was straightforward and comprised 5 stems, 4 symmetrical loops, and an 8-residue hairpin loop. Although K. muscicola, Leptodesmis sp., and L. circumcreta possessed similar sequence lengths with respect to the strain E412, the helices differed in the stem, loop, bulge, and hairpin.

A basal stem structure (AGCA-UGCU) was shared by boxB helices of all strains except for A. pantanalense (Supplementary Figure 3). Strain E412 showed similar residue length to several strains, but no boxB helix structures were analogous to that of strain E412. The boxB helix of strain E412 was mainly composed of a stem orderly fragmented by single base right bulge, single base left bulge, 10-residue symmetrical loop, and terminated with a 5-residue hairpin loop.

Strain E412 exhibited the longest V3 helix (98 residues) (Table 1), comprising 3 asymmetrical loops, 3 symmetrical loops, a single base left bulge, an 8-residue hairpin loop, and fragmented stems (Supplementary Figure 4). The V3 helix of strain E412 was distinct from those of the other strains, whereas a basal stem structure (GUC-GAC) was shared by all the strains. Although the helix length of K. muscicola (95 residues), N. sphagnicola (95 residues), S. timoleontis (94 residues), and S. rutilans (92 residues) was similar to that of strain E412, the structures differed from each other in terms of bulge, loop, hairpin, and stem.

Conclusively, the result of 16S-23S ITS secondary structure analysis and the phylogenetic inferences of 16S rRNA and 16S-23S ITS verified strain E412 as a new genus within the family Leptolyngbyaceae. Our results (Supplementary Figures 14) also demonstrated that the secondary structure analysis of D1-D1′, V2, boxB, and V3 helix was an effective tool for genus-level identification within the family Leptolyngbyaceae.

Genome-Based Analyses

Based on the genomes available on a public database, comparison analyses at the genomic level were performed among genera within Leptolyngbyaceae. Considerable divergences in genomes were observed among different genera as revealed by the ANI and AAI values (Table 2). Particularly, the ANI and AAI values between strain E412 and the other eight focal taxa were less than 79 and 69%, respectively. The results of genome-wide ANI and AAI conformed to the suggested values for genus (ANI < 83%, AAI ≤ 70%) delimitation (Walter et al., 2017; Jain and Rodriguez, 2018), further confirming the taxonomy delineation of a novel genus within the family Leptolyngbyaceae. However, it was reported that the classification of the prokaryotic genus using ANI or AAI might cause misleading results in some cases (Konstantinidis and Tiedje, 2005; Pannekoek et al., 2016). Therefore, the POCP specific for genus delineation was calculated to further verify the genus demarcation of strain E412. The POCP values (Table 3) between the E412 genome and focal taxa ranged from 35.7 to 49.3%, all within the threshold (<50%) for the definition of a prokaryotic genus (Qin et al., 2014). Additionally, the ML genomic phylogram (Figure 3) generated from the concatenated alignment of 845 single-copy genes showed a consistent topology to that of 16S rRNA, and again verified the conclusion that strain E412 belonged to a novel genus within Leptolyngbyaceae. Unfortunately, only some of the genera related to E412 had their genomes sequenced, resulting in a partial snapshot of genomic divergences among these organisms.

TABLE 2
www.frontiersin.org

Table 2. Values of ANI (Average Nucleotide Identity) and AAI (Average Amino acid Identity) among Leptolyngbyaceae genomes studied.

TABLE 3
www.frontiersin.org

Table 3. POCP values between strain E412 and representative species from Leptolyngbyaceae genera.

FIGURE 3
www.frontiersin.org

Figure 3. ML phylogenomic tree of concatenated protein alignment of single-copy genes shared by all genomes. Strain no. in bold represent the strain identified in this study. Bootstrap values (1,000 replications) are indicated at nodes. Scale bar = 5% substitutions per site.

Morphological Characteristics of Strain E412

Analysis of strain E412 with light microscopy revealed that trichomes were brown and exhibited coiled and tangled morphology (Figure 4A). The SEM and TEM analysis indicated unbranched trichomes composed of elongated cylindrical shaped cells, 1.2–1.8 μm in length and 0.8–1.1 μm in width. Constrictions were detected at the cross-walls of cells (Figures 4B,C). Centripetal invaginations of the cell wall separated individual cells of the filaments, but the intracellular connections between individual vegetative cells were not present (Figure 4C). The analysis of TEM micrographs indicated that five to six thylakoids in the parietal arrangement were present at the inner periphery of cells. Additionally, typical components of filamentous cyanobacteria, i.e., sheath, septum, phycobilisomes, and carboxysomes, were identified. The strain was also identified as a cyanophycin producer, and granules of this biopolymer were observed in the cytoplasm. Polyphosphate bodies and small lipid droplets were also identified (Figures 4C,D). Morphological comparison of strain E412 against other focus taxa from the family Leptolyngbyaceae (Table 4) revealed the strain’s closest morphological resemblance to Stenomitos and Kovacikia, in agreement with the phylogenetic allocation. All the three strains had single unbranched filaments, often entangled. Individual cells were approximately 1 μm in width, cylindrical, and slightly elongated. The cells of Leptothermofonsia and Kovacikia were typically not longer than 1.8 μm, as opposed to longer cells of Stenomitos. All the three strains were brown in the natural environment. Strain E412 can adapt to the green phenotype with the modification of light intensity and photoperiod. No such information was available for other strains. Among other focus taxa, only Leptodesmis was distinctly green and shown to be incapable of chromatic adaptation. Available morphological characteristics of focus taxa was collected and summarized in Table 4. Although strain E412 was also morphologically similar to Pseudanabaena spp. (Komárek and Anagnostidis, 2005), they were phylogenetically divergent from each other (Supplementary Figure 5).

FIGURE 4
www.frontiersin.org

Figure 4. Micrographs of strain E412. (A) Light microscopy image. (B) SEM image. (C,D) TEM images. Cb, carboxysome; Cg, cyanophycin granule; Ld, lipid droplet; P, polyphosphate body; Sh, sheath; Sp, septum; T, thylakoid membrane. Magnifications were 1,000× (A), 5,000× (B), 8,000× (C), and 12,000× (D).

TABLE 4
www.frontiersin.org

Table 4. Comparison of morphological features of Leptolyngbyaceae strains.

Physiological Characteristics of Strain E412

Strain E412 was physiologically characterized using different variants of the standard growth medium, BG-11. The strain exhibited active growth when sodium bicarbonate at the concentration ranging from 0.1 to 0.5 M was applied, suggesting that this form of inorganic carbon can be effectively utilized by the strain. Analysis of the impact of sulfur compounds showed that the strain was capable of utilizing 10 mM sulfates and incapable of utilizing equivalent concentrations of sulphites. The strain exhibited flexibility in the utilization of nitrogen sources. Both nitrate (3 mM) and nitrite (6 mM) resulted in the active growth of the strain. Strain E412 was also diazotrophic. The acetylene reduction assay indicated a functional nitrogenase capable of a steady increase of ethylene production during 72 h of the assay (Supplementary Figure 6).

Analysis of the pigment composition of strain E412 under different illumination conditions suggested a significant difference in their composition at white light and far-red light illumination and across the different photoperiods (Figure 5). The strain cultivated at 12L:12D photoperiod in low light exhibited a brown phenotype and pigmentation typical of type T2 (Six et al., 2007) due to increased phycoerythrin accumulation (Figures 5A,D). Meanwhile, cultivation in high light and far-red light under constant illumination resulted in the recomposition of the photosynthetic apparatus and pigmentation type T1 (Figures 5B,D). Therefore, the strain can adjust its photosynthetic apparatus through chromatic adaptation and was capable of far-red light (730 nm) utilization like other thermophilic filamentous strains (Gan and Bryant, 2015). Meanwhile, the concentration of chlorophyll b increased in the cells grown in constant white light illumination (Figure 5C), whereas all the other pigments were relatively constant across the tested cultivation conditions (Figures 5C,D).

FIGURE 5
www.frontiersin.org

Figure 5. Chromatic adaptation of strain E412. (A) Light microscopy image (1,000×) of strain E412 grown in white fluorescent light, 12L: 12D photoperiod at 45 μmol m–2 s–1, (B) light microscopy image (1,000×) of strain E412 grown in far-red (730 nm) LED light, 24L: 0D photoperiod at 25 μmol m–2 s–1, (C) composition of photosynthetic lipophilic pigments of strain E412 grown at different illumination conditions, the figure represents a mean of three biological replicates, (D) composition of photosynthetic water-soluble pigments of strain E412 grown at different illumination conditions, the figure represents a mean of three biological replicates.

General Genomic Features of Strain E412

The combined utilization of ONT and Illumina sequencing systems resulted in the complete genome of strain E412. This genome consisted of a single circular chromosome with a size of 6,426,061 bp (GC content, 50.8%). Two ribosomal RNA (rrn) operons, 74 tRNA genes and 7,904 protein-coding sequences (CDS), were predicted in the E412 chromosome (Supplementary Table 2). Notably, 4,015 out of 7,904 (50.8%) protein-coding genes were annotated as hypothetical proteins. Identifying such a high percentage of hypothetical protein in the E412 genome was not surprising and common to the genomes of thermophilic cyanobacteria (Cheng et al., 2020; Tang et al., 2021, 2022).

The GO analysis indicated that the CDS identified in the E412 genome were assigned to a wide range of functional categories (Supplementary Figure 7). The majority of GO terms were assigned to the biological process. Membrane, catalytic activity, and metabolic process were the most abundant GO term for cellular component, molecular function, and biological process, respectively. The most abundant GO terms distribution pattern was also noticed in another thermophilic strain, T. sichuanensis A183 isolated from a hot spring of Ganzi prefecture, China (Tang et al., 2021). However, the GO terms of the A183 genome were mostly concentrated only in eight functional categories. Meanwhile, their distribution in E412 was across many categories. Further KEGG pathway analysis showed that most genes (66.2%) in the E412 genome were distributed in the sub-category of metabolism (Supplementary Figure 8), suggesting that these metabolism-related genes may be crucial for this strain to survive in the oligotrophic aquatic environments of its origin.

Horizontal Gene Transfer

Based on a stringent screening of BLASP results, 1,944 out of 7,904 (24.6%) annotated protein-coding genes were identified as putatively acquired (Supplementary Table 3). Many putatively acquired genes were also notable in other thermophilic cyanobacteria, e.g., Thermosynechococcus sp. CL-1 (19%) (Cheng et al., 2020). The putatively acquired genes through HGT may contribute to the genome expansion and acquisition of new functions and conceivably acclimation to variable environments of these thermophilic strains. For instance, 22 genes annotated as heat shock proteins were identified as putatively acquired (Supplementary Table 3), which might play a crucial role in adapting to the thermal environment. The putatively acquired gene, flavodoxin gene (fldA), may substitute the function of ferredoxin in the photosynthetic electron transport chain under iron-deficient conditions (Cao et al., 2020). Twelve out of 29 genes annotated as circadian input kinase A (cikA) were identified as acquired genes from diverse donors (Supplementary Table 3). Among the acquired cikA genes, three showed very low protein similarities (38.4–40.5%) to the kinase of Synechococcus elongatus PCC 7942, which was experimentally demonstrated as a critical factor for entraining the clock in the cyanobacterium (Ivleva et al., 2006). The distinct proteins of acquired cikA genes in the E412 genome implied that alternative circadian rhythms could be used to time metabolic and behavioral events with the external environment through entrainment. Concerning function, the HGT candidates in the E412 genome were assigned to a wide range of functional categories as revealed by both GO and KEGG analysis (Supplementary Figures 7, 8). The most represented GO term was catalytic activity, followed by metabolic process and cellular process. Interestingly, the majority of HGT candidates (69.4%) suggested by the KEGG analysis were categorized as being related to metabolic functions. This result was consistent with the complexity hypothesis that fundamental genes are less likely than peripheral and operational genes to be horizontally transferred (Thomas and Nielsen, 2005).

We noticed that the criteria for HGT detection were distinct among reported studies. For example, genus-level (Cheng et al., 2020) or cyanobacteria-level (Chen et al., 2021) were separately customized to identify acquired genes in light of research objectives. Although modified approaches have been employed in this study or other studies, taxon-sampling biases may exist in the high-throughput BLASP-based approach. In addition, data availability was also an important factor affecting the establishment of the analysis pipeline, since only one genome sequence was obtained for this genus. More genome sequences of this genus are required for further HGT analysis, such as identification of recently acquired genes.

Mobile Genetic Elements

A total of 431 ISs (insertion sequences) corresponding to 67 different ISs were identified in the E412 genome. The IS630 family (67.29%) was dominant among the observed IS families, followed by the IS1 family (9.74%) and the IS4 family (7.66%). Besides, genes encoding transposase (86) were also noticed in abundance (Supplementary Table 2), indicating that intragenomic rearrangements might contribute much to the genetic plasticity of the strain.

One incomplete prophage loci was predicted in the chromosome, phiE412 (9.6 kb; positions 1,993,638–2,003,299). Thirteen phage-related genes were identified in the phiE412 region (Supplementary Table 4). No genes corresponding to DNA synthesis were found in the prophage loci, suggesting that this region was replication-defective. Repeats that constituted the core regions of phage attachment were not found to flank phiE412. Thus, this partial prophage loci probably was not functional. Clustered regularly interspaced short palindromic repeats (CRISPRs) were reported to function in the interference pathway to maintain genome integrity (Gasiunas et al., 2014). In the E412 chromosome, we detected three CRISPRs with high evidence levels and three Cas clusters assigned to type IA and III-D (Supplementary Table 5). However, the E412 genome had only one CRISPR-Cas array (type III-D), which was consistent with the finding in thermophilic strain T. sichuanensis A183 (Tang et al., 2021). Besides, several other genes typically associated with this system, e.g., cmr2, were also found to flank the array. The results indicated that this CRISPR-Cas interference system might function to limit HGT (Marraffini and Sontheimer, 2008).

Thermotolerance

As a microorganism living in a hot spring (temperature: 67.2°C), strain E412 must possess survival strategies to adapt to the thermal environment. The heat shock proteins (Hsps) play a crucial role in managing protein concentration, conformation, and subcellular location, especially when numerous stresses such as high temperature are applied. Expectedly, the E412 genome had numerous homologs of genes coding for Hsps (Supplementary Table 2). The homologs of the clp family (clpB, -C, -P, -S, -X) belonged to the Hsp100 family and helped maintain protein homeostasis (Labreck et al., 2017). The homologs of htpG protein of the Hsp90 family were likely to be primarily involved in protecting the photosynthetic machinery from heat stress (Takeshi et al., 2010). The homologs belonging to the Hsp70 family, mainly including dnaK and dnaJ, were abundant in the E412 genome. However, dnaK and dnaJ proteins might have different functions, and only part of them was responsible for thermotolerance (Duppre et al., 2011). Besides, a homolog of grpE, as a cofactor of the Hsp70 family, may help to prevent the aggregation of heat-denatured proteins (Schneider, 2011). As for the Hsp60 family, two distinct homologs of groEL genes, also referred to as the groE chaperone machinery, were identified in the E412 genome, and only one of them formed groESL operon with groES. This composition was similar to that of T. sichuanensis A183, but distinguished from that of Gloeobacter PCC 7421, the genome of which contains two groESL operons. Thus, we speculated that one groESL operon of the two strains lost the groES during the evolutionary process.

Photosynthesis

The E412 genome comprised the homologs of complete sets of genes coding for both photosystem I (14 genes; some with additional homologs) and photosystem II (22 genes; some with additional homologs) (Supplementary Table 2). The genome also had homologs of genes coding for phycobilisome proteins, allophycocyanin, phycocyanin, and phycoerythrin. The genes related to phycoerythrin (blue-light absorbing) might favor the strain to acclimate to low light intensity and oligotrophic environment (Tang et al., 2019). These genomic features strongly support the described experimental evidence for the chromatic adaptation capacity of this strain.

Surprisingly, the E412 genome harbored no homologs of genes encoding photoprotective proteins, such as flavodiiron proteins and orange carotenoid proteins. This result suggested that strain E412 probably copes with photodamage by alternative acclimation mechanisms in light of high altitude and high-light exposure in the environment of its origin. Interestingly, flavodoxin gene (fldA) and iron stress-inducible proteins (isiA) were found in the E412 genome. The presence of the two genes indicated that under iron-deficient conditions, strain E412 might use flavodoxins to substitute ferredoxin in electron transfer and to increase the light-absorbing efficiency of PSI in the form of PSI-isiA-flavodoxin supercomplex (Cao et al., 2020).

Essential genes required for the Carbon-dioxide Concentrating Mechanism (CCM) were present in the E412 genome (Supplementary Table 2). The uptake of gaseous CO2 systems in cyanobacteria relied on NADPH dehydrogenase (NDH-1) complexes. The E412 genome included two NDH-1 complexes: a low-CO2 inducible high-affinity NDH-13 complex encoded by ndhD3, ndhF3, and cupA (chpY) genes, and a constitutive low-affinity NDH-14 complex encoded by ndhD4, ndhF4, and cupB (chpX) genes. In addition, our previous study indicated that strain E412 can utilize bicarbonate as a one of the sources of inorganic carbon (Tang et al., 2018a). The genome analysis verified the experimental results through the presence of homologs of low affinity, high flux, Na+-dependent bicarbonate transporters (bicA1 and bicA2) and several ABC-type bicarbonate transporters. Moreover, the genome had 21 Hat/HatR gene homologs encoding High-affinity carbon uptake protein. A similar abundance of Hat/HatR gene homologs was noticed in many strains, e.g., Acaryochloris marina and Lyngbya aestuarii BL J. On the contrary, strains like Synechococcus sp. WH8102 and Synechocystis sp. PCC 6,803 contain none. The prosperity of the Hat/HatR gene in E412 may be related to the mat habit of its origin, where diffusion becomes the primary transport mechanism for substrates and products of metabolism and can cause diffusion limitations to photosynthesis (Kothari et al., 2013).

Nitrogen Metabolism

Cyanobacteria can utilize various organic and inorganic nitrogen sources, a feature vital for survival in oligotrophic environments (Esteves-Ferreira et al., 2018). The E412 genome had the homologs of genes encoding nitrogenases, including nifB, nifE, nifH, nifN, nifW, and nifX (Supplementary Table S2). The genomic composition suggested that strain E412 was a nitrogen-fixing cyanobacterium, further verified by nitrogenase activity test (Supplementary Figure 6). Interestingly, strain E412 hosted a homolog of the gene hetR, the master regulatory gene involved in heterocyst formation, although it was not found to develop heterocysts under conditions tested.

The experimental result showed the utilization of both nitrate (3 mM) and nitrite (6 mM) by strain E412 for active growth. Tracing back to the genetic basis, the E412 genome possessed the homologs of genes encoding ABC-type nitrate transport system, which were clustered and oriented in the same direction with two essential nitrogen-related genes encoding ferredoxin-nitrite reductase (nir) and ferredoxin-nitrate reductase (nar), respectively (Supplementary Table 2). This result was in accordance with many freshwater cyanobacterial strains (Maeda et al., 2015). In addition, this strain had the homologs of an ammonium transporter and homologs of both glutamine synthetase and glutamine amidotransferase, which played the primary role of ammonium ion assimilation (Muro-Pastor et al., 2005). The homolog of the gene coding for ntcB was also present in this strain, which may function as a nitrogen control system to efficiently utilize intracellular resources in adaptation to changing nitrogen availability in the natural environment (Aichi and Omata, 1997).

Conclusion

In the present study, we have morphologically, physiologically, phylogenetically, and taxogenomically characterized a novel thermophilic cyanobacterium, strain E412 isolated from Lotus Lake hot spring situated in Ganzi prefecture, China. Results of polyphasic analysis suggested that strain E412 was a novel genus within the family Leptolyngbyaceae. Consequently, we have proposed a new genus Leptothermofonsia sichuanensis Daroch, Tang and Shah et al gen. sp. nov. as a best described to date representative of this taxon and proposed its delineation. Phylogenomic inference, average nucleotide/amino-acid identity, and the POCP between genomes supported the delineation of genus Leptothermofonsia. Furthermore, the obtained complete genome of Leptothermofonsia E412 facilitated the elucidation of genetic basis regarding genes related to thermotolerance, photosynthesis, and nitrogen metabolism. Furthermore, experiments regarding chromatic adaptation capacity and utilization of sodium bicarbonate, sulfur, and nitrogen compounds confirmed these physiological characteristics as indicated by the genome sequence. Additionally, approximately a quarter of the annotated protein-coding genes in the E412 genome could be acquired through HGT and may impact genome expansion and habitat adaptation. Overall, the complete genome of strain E412 provides the first insight into the genomic feature of the genus Leptothermofonsia and lays the foundations for future global ecogenomic and geogenomic studies.

Taxonomic Treatment and Description of Leptothermofonsia sichuanensis Daroch, Tang, and Shah et al. gen. nov

Phylum: Cyanobacteria

Order: Synechococcales

Family: Leptolyngbyaceae

Description: Cells brownish colored; filamentous; coiled or tangled filaments; sheath colorless, very thin, not lamellate; trichome slightly constricted and composed of cylindrical, elongated cells; narrow trichome; apical cells are round; cells longer than broad; no false branching (Figure 4A). Trichomes unbranched and composed of cylindrical, elongated cells, 1.2–1.8 μm in length and 0.8–1.1 μm in width. Cross-walls of the cells contained constrictions (Figures 4B,C). Centripetal invagination of the cell wall divided individual cells, and intracellular connections between vegetative cells were not present (Figure 4C). Five to six thylakoids in the parietal arrangement were located parallel at the cells’ inner periphery. Sheath, septum, phycobilisome, carboxysomes, cyanophycin granule, lipid droplets, and polyphosphate bodies were all present in the cytoplasm (Figures 4C,D).

Type strain: is E412 (= FACHB-2490).

Type species: Leptothermofonsia sichuanensis Daroch, Tang, and Shah et al. gen. nov. (see below).

Etymology: “Lepto” exhibiting morphology typical to the members of Leptolynbyaceae family, “thermo” similar to thermophilic (high temperature tolerant), “fonsia,” genus epithet derived from the Latin word fons meaning spring, since both representative of the genus to date, i.e., E412 and Greenland 10, are hot spring isolates; “sichuanensis” species epithet derives from the name of collection province.

Type locality: Thermal spring, Lotus Lake in Ganzi Prefecture of Sichuan Province, China.

Ecology of type locality: the sample occurred as a macroscopic brown-green submerged in the pond. Sample collection was done on 12.05.2016, with the humidity close to 71%. At the time of collection, the air temperature was 15°C, and the light intensity was around 1,000 lux. The pH of the spring was 6.32, and the concentration of total dissolved solids was 447 mmol L–1.

Habitat: Thermal springs in Ganzi Prefecture of Sichuan Province, China (30°05′14″ N, 101°56′55″ E). This species (strain E412) is capable of bicarbonate assimilation (Tang et al., 2018a). The genome has the homologs of genes encoding nitrogenases, including nifB, nifE, nifH, nifN, nifW, and nifX (Supplementary Table 2). The strain was experimentally confirmed as nitrogen-fixing. Interestingly, this strain hosted a homolog of the gene hetR, the master regulatory gene involved in the formation of heterocyst, although it does not develop heterocyst. Strain shows the capacity of chromatic adaptation by modifying the content of phycobilisomes depending on light conditions.

Holotype here designated: the culture of Leptothermofonsia sichuanensis Daroch, Tang and Shah et al. gen. sp. nov. was initially denoted and deposited in Peking University Algae Collection as PKUAC-SCTE412 has also been deposited in the Freshwater Algae Culture Collection at the Institute of Hydrobiology (FACHB-collection) with accession number FACHB-2490 as Leptolyngbya species after identification and authentication based on the full-length sequencing of the 16S rRNA gene along with folding of the secondary structures of the 16S-23S ITS region. After proper identification and authentication, the culture is maintained in the FACHB under the accession number FACHB-2490.

Data Availability Statement

The complete genome sequence reported in this study has been deposited in GenBank with an accession number CP072600.

Author Contributions

JT: conceptualization, methodology, validation, formal analysis, investigation, data curation, writing—original draft, writing—review and editing, visualization, supervision, project administration, and funding acquisition. MS: investigation, methodology, validation, and writing—original draft. DY: formal analysis, software, and data curation. YJ: formal analysis, software, data curation, and writing—review and editing. LD, MW, and MMW: methodology, software, and data analysis. KZ: software and data curation. LL: formal analysis, investigation, data curation, and writing—original draft. ML: formal analysis, investigation, and data curation. KW: conceptualization, methodology, writing—review and editing. MD: conceptualization, methodology, resources, data curation, writing—original draft, writing-review and editing, supervision, project administration, and funding acquisition. All authors contributed to the article and approved the submitted version.

Funding

This research was funded by the National Natural Science Foundation of China (31970092 and 32071480), the Antibiotics Research and Re-evaluation Key Laboratory of Sichuan Province (ARRLKF21-03), the Shenzhen Fundamental Research Program (GXWD20201231165807007-20200806170221001), and Tenure-Track Fund to MD. Funding bodies had no influence over the design and execution of this research.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.765105/full#supplementary-material

Footnotes

  1. ^ https://www.genome.jp/kegg/ko.html
  2. ^ http://enve-omics.ce.gatech.edu/g-matrix/

References

Aichi, M., and Omata, T. (1997). Involvement of NtcB, a LysR family transcription factor, in nitrite activation of the nitrate assimilation operon in the cyanobacterium synechococcus sp. strain PCC 7942. J. Bacteriol. 179, 4671–4675. doi: 10.1128/jb.179.15.4671-4675.1997

PubMed Abstract | CrossRef Full Text | Google Scholar

Alcorta, J., Alarcón-Schumacher, T., Salgado, O., and Díez, B. (2020). Taxonomic novelty and distinctive genomic features of hot spring cyanobacteria. Front. Genet. 11:568223. doi: 10.3389/fgene.2020.568223

PubMed Abstract | CrossRef Full Text | Google Scholar

Aramaki, T., Blanc-Mathieu, R., Endo, H., Ohkubo, K., Kanehisa, M., Goto, S., et al. (2019). KofamKOALA: KEGG ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252. doi: 10.1093/bioinformatics/btz859

PubMed Abstract | CrossRef Full Text | Google Scholar

Becerra-Absaln, I., Johansen, J. R., Muoz-Martn, M. A., and Montejano, G. (2018). Chroakolemma gen. nov. (Leptolyngbyaceae, cyanobacteria) from soil biocrusts in the semi-desert central region of Mexico. Phytotaxa 367, 201–218.

Google Scholar

Bennett, A., and Bogorad, L. (1973). Complementary chromatic adaptation in a filamentous blue-green alga. J. Cell Biol. 58, 419–435. doi: 10.1083/jcb.58.2.419

PubMed Abstract | CrossRef Full Text | Google Scholar

Brito, Â, Ramos, V., Mota, R., Lima, S., Santos, A., Vieira, J., et al. (2017). Description of new genera and species of marine cyanobacteria from the portuguese Atlantic coast. Mol. Phylogenet. Evolu. 111, 18–34. doi: 10.1016/j.ympev.2017.03.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, P., Cao, D., Si, L., Su, X., Tian, L., Chang, W., et al. (2020). Structural basis for energy and electron transfer of the photosystem I–IsiA–flavodoxin supercomplex. Nat. Plants 6, 167–176. doi: 10.1038/s41477-020-0593-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, M.-Y., Teng, W.-K., Zhao, L., Hu, C.-X., Zhou, Y.-K., Han, B.-P., et al. (2021). Comparative genomics reveals insights into cyanobacterial evolution and habitat adaptation. ISME J. 15, 211–227. doi: 10.1038/s41396-020-00775-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, Y. I., Lin, C., Chiu, Y. F., Hsueh, H. T., and Chu, H. A. (2020). Comparative genomic analysis of a novel strain of taiwan hot-spring cyanobacterium Thermosynechococcus sp. CL-1. Front. Microbiol. 11:82. doi: 10.3389/fmicb.2020.00082

PubMed Abstract | CrossRef Full Text | Google Scholar

Conesa, A., Götz, S., Garcíagómez, J. M., Terol, J., Talón, M., and Robles, M. (2005). Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676. doi: 10.1093/bioinformatics/bti610

PubMed Abstract | CrossRef Full Text | Google Scholar

Cordeiro, R. I. P., Luz, R., Vasconcelos, V., Gonalves, V., and Fonseca, A. (2020). Cyanobacteria phylogenetic studies reveal evidence for polyphyletic genera from thermal and freshwater habitats. Diversity 12:298.

Google Scholar

Couvin, D., Bernheim, A., Toffano-Nioche, C., Touchon, M., Michalik, J., Néron, B., et al. (2018). CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 46, W246–W251. doi: 10.1093/nar/gky425

PubMed Abstract | CrossRef Full Text | Google Scholar

David, A., Grant, J. R., Ana, M., Tanvir, S., Allison, P., Yongjie, L., et al. (2016). PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 44, W16–W21. doi: 10.1093/nar/gkw387

PubMed Abstract | CrossRef Full Text | Google Scholar

Dere, S., güneş, T., and Sivaci, R. (1998). Spectrophotometric determination of chlorophyll-a, b and total carotenoid contents of some algae species using different solvents. Turkish J. Bot. 22, 13–17.

Google Scholar

Duppre, E., Rupprecht, E., and Schneider, D. (2011). Specific and promiscuous functions of multiple DnaJ proteins in Synechocystis sp. PCC 6803. Microbiology 157, 1269–1278. doi: 10.1099/mic.0.045542-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Dvorak, P., Hindak, F., Hasler, P., Hindakova, A., and Poulickova, A. (2014). Morphological and molecular studies of Neosynechococcus sphagnicola, gen. et sp. nov. (Cyanobacteria, Synechococcales). Phytotaxa 170, 24–34.

Google Scholar

Dvorak, P., Jahodarova, E., Hašler, P., Gusev, E., and Poulíčková, A. (2015). A new tropical cyanobacterium Pinocchia polymorpha gen. et sp. nov. derived from the genus Pseudanabaena. J. Czech Phycol. Soc. 15, 113–120.

Google Scholar

Esteves-Ferreira, A. A., Inaba, M., Fort, A., Araújo, W. L., and Sulpice, R. (2018). Nitrogen metabolism in cyanobacteria: metabolic and molecular control, growth consequences and biotechnological applications. Crit. Rev. Microbiol. 44, 541–560. doi: 10.1080/1040841X.2018.1446902

PubMed Abstract | CrossRef Full Text | Google Scholar

Gan, F., and Bryant, D. A. (2015). Adaptive and acclimative responses of cyanobacteria to far-red light. Environ. Microbiol. 17, 3450–3465. doi: 10.1111/1462-2920.12992

PubMed Abstract | CrossRef Full Text | Google Scholar

Gasiunas, G., Sinkunas, T., and Siksnys, V. (2014). Molecular mechanisms of CRISPR-mediated microbial immunity. Cell. Mol. Life Sci. 71, 449–465. doi: 10.1007/s00018-013-1438-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Guindon, S., Dufayard, J.-F., Lefort, V., Anisimova, M., Hordijk, W., and Gascuel, O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321. doi: 10.1093/sysbio/syq010

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q., and Vinh, L. S. (2018). UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evolu. 35, 518–522. doi: 10.1093/molbev/msx281

PubMed Abstract | CrossRef Full Text | Google Scholar

Iteman, I., Rippka, R., Tandeau, D. M. N., and Herdman, M. (2000). Comparison of conserved structural and regulatory domains within divergent 16S rRNA-23S rRNA spacer sequences of cyanobacteria. Microbiology 146, 1275–1286. doi: 10.1099/00221287-146-6-1275

PubMed Abstract | CrossRef Full Text | Google Scholar

Ivleva, N., Gao, T., Liwang, A., and Golden, S. (2006). Quinone sensing by the circadian input kinase of the cyanobacterial circadian clock. Proc. Natl. Acad. Sci. U.S.A. 103, 17468–17473. doi: 10.1073/pnas.0606639103

PubMed Abstract | CrossRef Full Text | Google Scholar

Jahodářová, E., Dvorak, P., Hasler, P., and PoulícKová, A. (2017). Revealing hidden diversity among tropical cyanobacteria : the new genus Onodrimia (Synechococcales, Cyanobacteria) described using the polyphasic approach. Phytotaxa 326, 28–40.

Google Scholar

Jain, C., and Rodriguez, R. L. (2018). High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 9:5114.

Google Scholar

Johansen, J. R., Kovacik, L., Casamatta, D. A., Iková, K. F., and Kaštovský, J. (2011). Utility of 16S-23S ITS sequence and secondary structure for recognition of intrageneric and intergeneric limits within cyanobacterial taxa: Leptolyngbya corticola sp. nov. (Pseudanabaenaceae. Cyanobacteria). Nova Hedwigia 92, 283–302.

Google Scholar

Komárek, J., and Anagnostidis, K. (2005). Cyanoprokaryota 2. Teil/2ndPart : Oscillatoriales. München: SpringerSpektrum.

Google Scholar

Komárek, J., Kaštovský, J., Mares, J., and Johansen, J. (2014). Taxonomic classification of cyanoprokaryotes (cyanobacterial genera) 2014, using a polyphasic approach. Preslia 86, 295–335.

Google Scholar

Komárek, J., and Lukavský, J. (1988). Arthronema, a new cyanophyte genus from Afro-Asian deserts. Arch. Fur Hydrobiol. 50, 249–267.

Google Scholar

Konstantinidis, K. T., and Tiedje, J. M. (2005). Towards a genome-based taxonomy for prokaryotes. J. Bacteriol. 187, 6258–6264. doi: 10.1128/JB.187.18.6258-6264.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

Kothari, A., Vaughn, M., and Garcia-Pichel, F. (2013). Comparative genomic analyses of the cyanobacterium, lyngbya aestuarii BL J, a powerful hydrogen producer. Front. Microbiol. 4:363. doi: 10.3389/fmicb.2013.00363

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Stecher, G., and Tamura, K. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874. doi: 10.1093/molbev/msw054

PubMed Abstract | CrossRef Full Text | Google Scholar

Labreck, C. J., Shannon, M., Viola, M. G., Joseph, C., and Camberg, J. L. (2017). The protein chaperone ClpX targets native and non-native aggregated substrates for remodeling, disassembly, and degradation with ClpP. Front. Mol. Biosci. 4:26. doi: 10.3389/fmolb.2017.00026

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Stoeckert, C. J. Jr., and Roos, D. S. (2003). OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189. doi: 10.1101/gr.1224503

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, M., Cheng, L., Tang, J., and Daroch, M. (2021). Molecular components of nitrogen fixation gene cluster and associated enzymatic activities of non-heterocystous thermophilic cyanobacterium Thermoleptolyngbya sp. Life 11:640. doi: 10.3390/life11070640

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., and Li, R. (2016). Limnolyngbya circumcreta gen. & comb. nov. (Synechococcales, Cyanobacteria) with three geographical (provincial) genotypes in China. Phycologia 55, 478–491.

Google Scholar

Lowe, T. M., and Eddy, S. R. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964. doi: 10.1093/nar/25.5.955

PubMed Abstract | CrossRef Full Text | Google Scholar

Maeda, S.-i, Murakami, A., Ito, H., Tanaka, A., and Omata, T. (2015). Functional characterization of the FNT family nitrite transporter of marine picocyanobacteria. Life 5, 432–446. doi: 10.3390/life5010432

PubMed Abstract | CrossRef Full Text | Google Scholar

Marraffini, L. A., and Sontheimer, E. J. (2008). CRISPR interference limits horizontal gene transfer in staphylococci by targeting DNA. Science 322, 1843–1845. doi: 10.1126/science.1165771

PubMed Abstract | CrossRef Full Text | Google Scholar

Mathews, D. H. (2014). RNA secondary structure analysis using RNAstructure. Curr. Protocols Bioinform. 46, 1–25.

Google Scholar

Minh, B. Q., Schmidt, H. A., Chernomor, O., Schrempf, D., Woodhams, M. D., von Haeseler, A., et al. (2020). IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evolu. 37, 1530–1534.

Google Scholar

Miscoe, L. H., Johansen, J. R., Kociolek, J. P., Lowe, R. L., Vaccarino, M. A., Pietrasiak, N., et al. (2016). The diatom flora and cyanobacteria from caves on kauai, hawaii. II. novel cyanobacteria from caves on kauai, hawaii. Bibliotheca Phycologica 58, 3–4.

Google Scholar

Muro-Pastor, M. I., Reyes, J. C., and Florencio, F. J. (2005). Ammonium assimilation in cyanobacteria. Photosynthesis Res. 83, 135–150.

Google Scholar

Pannekoek, Y., Qi-Long, Q., Zhang, Y.-Z., and van der Ende, A. (2016). Genus delineation of chlamydiales by analysis of the percentage of conserved proteins justifies the reunifying of the genera chlamydia and chlamydophila into one single genus chlamydia. Pathogens Dis. 74:ftw071. doi: 10.1093/femspd/ftw071

PubMed Abstract | CrossRef Full Text | Google Scholar

Patel, A., Matsakas, L., Rova, U., and Christakopoulos, P. (2019). A perspective on biotechnological applications of thermophilic microalgae and cyanobacteria. Bio. Technol. 278, 424–434. doi: 10.1016/j.biortech.2019.01.063

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, Q., Xie, B., Zhang, X., Chen, X., Zhou, B., Zhou, J., et al. (2014). A proposed genus boundary for the prokaryotes based on genomic insights. J. Bacteriol. 196, 2210–2215.

Google Scholar

Raabova, L., Kovacik, L., Elster, J., and Strunecky, O. (2019). Review of the genus phormidesmis (cyanobacteria) based on environmental, morphological, and molecular data with description of a new genus leptodesmis. Phytotaxa 395, 1–16.

Google Scholar

Rodriguez-R, L. M., Santosh, G., Harvey, W. T., Ramon, R.-M., Tiedje, J. M., Cole, J. R., et al. (2018). The microbial genomes atlas (MiGA) webserver: taxonomic and gene diversity analysis of archaea and bacteria at the whole genome level. Nuclc Acids Res. 46, W282–W288. doi: 10.1093/nar/gky467

PubMed Abstract | CrossRef Full Text | Google Scholar

Roeselers, G., Norris, T. B., Castenholz, R. W., Rysgaard, S., Glud, R. N., Kühl, M., et al. (2007). Diversity of phototrophic bacteria in microbial mats from arctic hot springs (greenland). Environ. Microbiol. 9, 26–38. doi: 10.1111/j.1462-2920.2006.01103.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Schneider, D. (2011). Thermostability of two cyanobacterial GrpE thermosensors. Plant Cell Physiol. 52, 1776–1785. doi: 10.1093/pcp/pcr116

PubMed Abstract | CrossRef Full Text | Google Scholar

Sciuto, K., and Moro, I. (2016). Detection of the new cosmopolitan genus thermoleptolyngbya (Cyanobacteria, Leptolyngbyaceae) using the 16S rRNA gene and 16S–23S ITS region. Mol. Phylogenet. Evolu. 105, 15–35. doi: 10.1016/j.ympev.2016.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Shalygin, S., Shalygina, R., Redkina, V., Gargas, C., and Johansen, J. (2020). Description of stenomitos kolaenensis and S. hiloensis sp. nov. (Leptolyngbyaceae, Cyanobacteria) with an emendation of the genus. Phytotaxa 440, 108–128.

Google Scholar

Six, C., Thomas, J.-C., Garczarek, L., Ostrowski, M., Dufresne, A., Blot, N., et al. (2007). Diversity and evolution of phycobilisomes in marine synechococcus spp.: a comparative genomics study. Genome Biol. 8:R259. doi: 10.1186/gb-2007-8-12-r259

PubMed Abstract | CrossRef Full Text | Google Scholar

Soares, F., Tiago, I., Trovo, J., Coelho, C., and Portugal, A. (2019). Description of myxacorys almedinensis sp. nov. (Synechococcales, Cyanobacteria) isolated from the limestone walls of the old cathedral of coimbra, portugal (UNESCO world heritage site). Phytotaxa 419, 77–90.

Google Scholar

Song, G., Jiang, Y., and Li, R. (2015). Scytolyngbya timoleontis, gen. et sp. nov. (Leptolyngbyaceae, Cyanobacteria): a novel false branching cyanobacteria from China. Phytotaxa 224, 72–84.

Google Scholar

Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evolu. 30, 772–780. doi: 10.1093/molbev/mst010

PubMed Abstract | CrossRef Full Text | Google Scholar

Takeshi, S., Shun, M., and Erika, K. (2010). HtpG, the prokaryotic homologue of Hsp90, stabilizes a phycobilisome protein in the cyanobacterium Synechococcus elongatus PCC 7942. Mol. Microbiol. 76, 576–589. doi: 10.1111/j.1365-2958.2010.07139.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, J., Du, L., Li, M., Yao, D., Waleron, M., Waleron, K. F., et al. (2022). Characterization of a novel hot-spring cyanobacterium leptodesmis sichuanensis sp. nov. and genomic insights of molecular adaptations into its habitat. Front. Microbiol. 12:739625. doi: 10.3389/fmicb.2021.739625

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, J., Du, L.-M., Liang, Y.-M., and Daroch, M. (2019). Complete genome sequence and comparative analysis of Synechococcus sp. CS-601 (SynAce01), a cold-adapted cyanobacterium from an oligotrophic antarctic habitat. Int. J. Mol. Sci. 20:152. doi: 10.3390/ijms20010152

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, J., Liang, Y., Jiang, D., Li, L., Luo, Y., Shah, M. M. R., et al. (2018b). Temperature-controlled thermophilic bacterial communities in hot springs of western sichuan, china. BMC Microbiol. 18:134. doi: 10.1186/s12866-018-1271-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, J., Jiang, D., Luo, Y., Liang, Y., Li, L., Shah, M. M. R., et al. (2018a). Potential new genera of cyanobacterial strains isolated from thermal springs of western sichuan. China. Algal Res. 31, 14–20.

Google Scholar

Tang, J., Li, L., Li, M., Du, L., Shah, M. R., Waleron, M., et al. (2021). Description, taxonomy, and comparative genomics of a novel species, Thermoleptolyngbya sichuanensis sp. nov., isolated from hot springs of ganzi, sichuan, China. Front. Microbiol. 12:696102. doi: 10.3389/fmicb.2021.696102

PubMed Abstract | CrossRef Full Text | Google Scholar

Tatiana, T., Michael, D. C., Azat, B., Vyacheslav, C., Nawrocki, E. P., Leonid, Z., et al. (2016). NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 44, 6614–6624.

Google Scholar

Taton, A., Grubisic, S., Ertz, D., Hodgson, D. A., Piccardi, R., Biondi, N., et al. (2010). Polyphasic study of Antarctic cyanobacterial strains. J. Phycol. 42, 1257–1270.

Google Scholar

Taton, A., Wilmotte, A., Smarda, J., Elster, J., and Komarek, J. (2011). Plectolyngbya hodgsonii: a novel filamentous cyanobacterium from Antarctic lakes. Polar Biol. 34, 181–191.

Google Scholar

Thomas, C. M., and Nielsen, K. M. (2005). Mechanisms of, and barriers to, horizontal gene transfer between Bacteria. Nat. Rev. Microbiol. 3, 711–721. doi: 10.1038/nrmicro1234

PubMed Abstract | CrossRef Full Text | Google Scholar

Thomazeau, S., Houdan-Fourmont, A., Couté, A., Duval, C., Couloux, A., Rousseau, F., et al. (2010). The contribution of sub-saharan african strains to the phylogeny of cyanobacteria: focusing on the nostocaceae (nostocales, cyanobacteria). J. Phycol. 46, 564–579.

Google Scholar

Turland, N., Wiersema, J., Barrie, F. R., Greuter, W., and Smith, G. F. (2018). International Code of Nomenclature for Algae, Fungi, and Plants (Shenzhen Code) Adopted by the Nineteenth International Botanical Congress Shenzhen. Glashütten: Koeltz Botanical Books.

Google Scholar

Varani, A. M., Siguier, P., Gourbeyre, E., and Charneau, V. (2011). ISsaga is an ensemble of web-based methods for high throughput identification and semi-automatic annotation of insertion sequences in prokaryotic genomes. Genome Biol. 12:R30. doi: 10.1186/gb-2011-12-3-r30

PubMed Abstract | CrossRef Full Text | Google Scholar

Vaz, M. G. M. V., Genuário, D. B., Andreote, A. P. D., Malone, C. F. S., Sant’Anna, C. L., Barbiero, L., et al. (2015). Pantanalinema gen. nov. and alkalinema gen. nov.: novel pseudanabaenacean genera (cyanobacteria) isolated from saline–alkaline lakes. Int. J. Syst. Evolu. Microbiol. 65, 298–308. doi: 10.1099/ijs.0.070110-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Vincent, L., Jean-Emmanuel, L., and Olivier, G. (2017). SMS: smart model selection in PhyML. Mol. Biol. Evolu. 34, 2422–2424. doi: 10.1093/molbev/msx149

PubMed Abstract | CrossRef Full Text | Google Scholar

Walker, B. J., Abeel, T., Shea, T., Priest, M., and Earl, A. M. (2014). Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963

PubMed Abstract | CrossRef Full Text | Google Scholar

Walter, J. M., Coutinho, F. H., Dutilh, B. E., Swings, J., Thompson, F. L., and Thompson, C. C. (2017). Ecogenomics and taxonomy of cyanobacteria phylum. Front. Microbiol. 8:2132. doi: 10.3389/fmicb.2017.02132

PubMed Abstract | CrossRef Full Text | Google Scholar

Yao, D., Cheng, L., Du, L., Li, M., Daroch, M., and Tang, J. (2021). Genome-wide investigation and analysis of microsatellites and compound microsatellites in leptolyngbya-like species, cyanobacteria. Life 11:1258. doi: 10.3390/life11111258

PubMed Abstract | CrossRef Full Text | Google Scholar

Ye, J., Fang, L., Zheng, H., Zhang, Y., Chen, J., Zhang, Z., et al. (2006). WEGO: a web tool for plotting GO annotations. Nucleic Acids Res. 34, 293–297.

Google Scholar

Zimin, A. V., Marcais, G., Puiu, D., Roberts, M., Salzberg, S. L., and Yorke, J. A. (2013). The MaSuRCA genome assembler. Bioinformatics 29, 2669–2677. doi: 10.1093/bioinformatics/btt476

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: 16S rRNA, 16S-23S ITS, genomics, thermophilic cyanobacterium, Leptolyngbyaceae, Kovacikia, Stenomitos

Citation: Tang J, Shah MR, Yao D, Jiang Y, Du L, Zhao K, Li L, Li M, Waleron MM, Waleron M, Waleron K and Daroch M (2022) Polyphasic Identification and Genomic Insights of Leptothermofonsia sichuanensis gen. sp. nov., a Novel Thermophilic Cyanobacteria Within Leptolyngbyaceae. Front. Microbiol. 13:765105. doi: 10.3389/fmicb.2022.765105

Received: 26 August 2021; Accepted: 31 January 2022;
Published: 28 March 2022.

Edited by:

Brian P. Hedlund, University of Nevada, Las Vegas, United States

Reviewed by:

Vartul Sangal, Northumbria University, United Kingdom
Gongliang Yu, Institute of Hydrobiology (CAS), China
Janaina Rigonato, Genoscope, France

Copyright © 2022 Tang, Shah, Yao, Jiang, Du, Zhao, Li, Li, Waleron, Waleron, Waleron and Daroch. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Maurycy Daroch, bS5kYXJvY2hAcGt1c3ouZWR1LmNu

Present address: Mahfuzur R. Shah, Department of Cell Biology, Metabolism and Systems Biology, Noblegen Inc., Peterborough, ON, Canada

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.