- 1Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Dossenheim, Germany
- 2Biofa GmbH, Münsingen, Germany
Bacillus thuringiensis subsp. tenebrionis (Btt) produces a coleopteran-specific crystal protoxin protein (Cry3Aa δ-endotoxin). After its discovery in 1982, the strain NB125 (DSM 5526) was eventually registered in 1990 to control the Colorado potato beetle (Leptinotarsa decemlineata). Gamma-irradiation of NB125 resulted in strain NB176-1 (DSM 5480) that exhibited higher cry3Aa production and became the active ingredient of the plant protection product Novodor® FC. Here, we report a comparative genome analysis of the parental strain NB125, its derivative NB176-1 and the current commercial production strain NB176. The entire genome sequences of the parental and derivative strains were deciphered by a hybrid de novo approach using short (Illumina) and long (Nanopore) read sequencing techniques. Genome assembly revealed a chromosome of 5.4 to 5.6 Mbp and six plasmids with a size range from 14.9 to 250.5 kbp for each strain. The major differences among the original NB125 and the derivative strains NB176-1 and NB176 were an additional copy of the cry3Aa gene, which translocated to another plasmid as well as a chromosomal deletion (~ 178 kbp) in NB176. The assembled genome sequences were further analyzed in silico for the presence of virulence and antimicrobial resistance (AMR) genes.
1 Introduction
Bacillus thuringiensis (Bt) is a rod-shaped, spore-forming, gram-positive, and entomopathogenic bacterium (Ibrahim et al., 2010; Jouzani et al., 2017). During sporulation, Bt produces parasporal crystals, consisting of one or more insect-specific crystal (Cry) and/or cytolytic (Cyt) proteinaceous protoxins (δ-endotoxins) (Bravo et al., 2011; Crickmore et al., 2021). After oral ingestion by insect larvae, these proteins are proteolytically activated in the midgut and bind to specific receptors located in the midgut epithelial cell membrane, leading to intestinal perforation, feeding stop and finally insect death (Schnepf et al., 1998; Bravo et al., 2007). The δ-endotoxins are encoded by cry and/or cyt genes, typically located on large plasmids (Carlton, 1980; Kronstad et al., 1983; Carlson et al., 1996). With the discovery of Bt strains specific to Lepidoptera, Diptera or Coleoptera, many Bt strains have been successfully developed and used as highly specific biological control agents (Bravo et al., 2007).
The coleopteran specific Bacillus thuringiensis subsp. tenebrionis (Btt) (strain BI 256-82) was first isolated from an infected pupa of the yellow mealworm Tenebrio molitor L. (Coleoptera: Tenebrionidae) at the Institute for Biological Control in Darmstadt, Germany, in 1982 (Krieg et al., 1983). According to serological investigations, the strain was assigned to serovar “morrisoni” (H8a8b) (Krieg et al., 1987). The coleopteran activity of this strain resides in a flat, plate-like crystal, composed primarily of CrylllA (now Cry3Aa) (Höfte et al., 1987; Höfte and Whiteley, 1989; Crickmore et al., 1998; Crickmore et al., 2021). The original isolate of Btt, strain BI 256-82, was patented (US 4,766,203 and US 4,889,918) and deposited at the Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures GmbH (Braunschweig, Germany) under the number DSM 2803.
Soon after its discovery, several Btt based plant protection products were placed on the market in the US, Canada, and Europe (Gelernter, 2004). One of these commercial products was Novodor®, containing the production strain NB125 (DSM 5526), a subculture of BI 256-82, which was registered in 1990 to control the Colorado potato beetle (Leptinotarsa decemlineata) (Gelernter, 2004). By Gamma-irradiation of NB125, NB176-1 (DSM 5480) was produced, which differed from NB125 by (i) oligosporogeny, (ii) a more than twofold increase of Cry3Aa protein production and crystals up to more than five times bigger than the crystals produced by NB125, and hence an increased insecticidal activity (Adams et al., 1994; Gurtler and Petersen, 1994). Based on cloning of the cry3Aa gene and Southern hybridization it had been proposed that a transposition-mediated duplication of the cry3Aa gene caused the Cry3Aa overproduction (Adams et al., 1994). The radiation-derived NB176-1 (DSM 5480) became eventually the new basis of the commercial product Novodor® FC.
Recently, genomic data of NB176 (Novodor® FC) has been published (Biggel et al., 2021), though its genome and number of plasmids have not been fully resolved, probably due to collapsing repeat sequences during the assembly process (Ekblom and Wolf, 2014; Fadeev et al., 2016; Arredondo-Alonso et al., 2017; Smits, 2019).
Here, we report the full genomes of the three Btt strains NB125 (DSM 5526), NB176-1 (DSM 5480) and the current production strain NB176, formulated in the plant protection product Novodor® FC. The sequences were obtained by applying a hybrid de novo assembly approach using a combination of short (Illumina) and long (Nanopore) sequencing reads (Wick et al., 2017b). Comparisons of the three genomes allowed following and understanding the genomic rearrangements that had taken place from the former (parental strain before radiation) to the current production strain used as biocontrol agent (European Food Safety Authority, 2013).
The entomopathogenic Bt and their subspecies are members of the Bacillus cereus sensu lato (s.l.) group which also contains vertebrate pathogens, such as B. cereus sensu stricto (s.s.), B. anthracis, B. weihenstephanensis and others (Helgason et al., 2000; Liu et al., 2015; Ehling-Schulz et al., 2019).
Human pathogenicity of B. cereus is expressed as food poisoning of the emetic (vomiting) or diarrheal type (Turnbull et al., 1979; Schoeni and Wong, 2005; Rajkovic et al., 2008; Stenfors Arnesen et al., 2008; Messelhäußer and Ehling-Schulz, 2018). Whereas genes encoding the emetic toxin cereulide (CES) are absent from Bt strains (Ehling-Schulz et al., 2006; EFSA Panel on Biological Hazards, 2016; Bonis et al., 2021), diarrheal toxin genes encoding hemolysin BL (HBL), non-hemolytic enterotoxin (NHE), and cytotoxin K (CytK) have been demonstrated to be present on the chromosomes of several Bt subspecies (Prüß et al., 1999; Swiecicka et al., 2006; Ngamwongsatit et al., 2008; Bonis et al., 2021). However, there is little to no evidence that Btt or other commercial Bt strains produce such toxins in relevant enterotoxic levels (Damgaard, 1995; Raymond and Federici, 2017; Johler et al., 2018; Raymond and Federici, 2018). Since the use of Btt is for biocontrol purposes, a further focus of our study laid on the identification of such enterotoxin genes as well as on antimicrobial resistance (AMR) genes.
2 Materials and methods
2.1 Bacterial strains and cultivation
The parental NB125 (DSM 5526) and radiation-derived strain NB176-1 (DSM 5480) were obtained as freeze-dried cultures from the DSMZ (Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany), whereas the current production strain NB176 (formulation Novodor® FC) was provided as glycerol stock (vegetative cells in 1 ml glycerol) by Valent BioSciences (Libertyville, Illinois, United States) (Table 1).
After receiving the three different Btt strains, they were always handled under sterile conditions to avoid any contamination. Glassware and culture media were autoclaved prior to use and all steps were performed under a sterile workbench. The freeze-dried strains NB125 and NB176-1 were received in tightly sealed glass vials and were handled according to the protocol of the DSMZ. In brief, the glass vials were broken open and the dried strains were mixed with 0.5 ml LB medium (1% tryptone, 0.5% yeast extract, 0.5% NaCl, pH 7.0 ± 0.2; Carl Roth, Karlsruhe, Germany). After 30 min rehydration, 0.25 ml of the suspension were used to inoculate 5 ml liquid LB medium. To recover NB176, the frozen glycerol stock was stabbed using a sterile loop and 5 ml liquid LB medium were directly inoculated. All bacterial cultures were grown at 30 °C with shaking at 200 rpm overnight.
2.2 Total genomic DNA extraction and whole genome sequencing
Starting from an aliquot of 1 ml overnight culture, total genomic (chromosomal + plasmid) DNA (gDNA) was extracted using the Wizard® Genomic DNA Purification Kit (Promega, Madison, WI, USA), following the protocol for gram-positive bacteria. For cell lysis, the bacteria were treated with lytic enzyme in a final concentration of 2 mg/ml lysozyme (Sigma-Aldrich, St. Louis, Missouri, USA) for 60 min at 37 °C. After DNA precipitation, the gDNA was dissolved in 100 μl dH2O and stored at -80 °C until further use. The quality of gDNA was verified by measuring the absorbance ratios 260/280 and 260/230 using a Nanodrop 2000c spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA). Concentration of gDNA was determined with a Quantus Fluorometer (Promega, Madison, WI, USA) and integrity was confirmed by TAE gel electrophoresis (0.8% agarose).
The identical stocks of gDNA were used for both Illumina (short read) and Nanopore (long read) sequencing. Short-read genomic data was obtained from approximately 500 ng gDNA per sample using a 150 bp paired-end protocol on an Illumina NextSeq 2000 sequencer (StarSEQ GmbH, Mainz, Germany). The genome size was estimated with 6 Mbp and between 6 to 7 M reads were ordered to achieve an estimated 150 to 175-fold read depth. The quality of the raw sequencing data was validated with FastQC (v0.72) (Andrews, 2010) and adapter and quality trimming (Phred quality score ≥ 30; minimum read length after trimming = 20 nt) was performed using Trim Galore! (v0.6.3.) (Krueger, 2015).
For long-read sequencing, the MinION sequencer (Oxford Nanopore Technologies, Oxford, UK) was utilized. Extracted gDNA was concentrated to approximately 30 ng/µl by vacuum evaporation using an Eppendorf Concentrator plus (Eppendorf, Hamburg, Germany) with the following options: 30 °C, brake = off and V-AQ. Multiplexing and library construction were performed using the Native Barcoding Expansion Kit (EXP-NBD104) and Ligation Sequencing Kit (SQK-LSK109), following the manufactures instructions with minor modifications: To increase DNA recovery, freshly prepared 80% ethanol was used throughout the protocol. Incubation times for gDNA binding to and elution from magnetic beads were extended to 15 min. All elution steps were conducted at 37 °C. After adapter ligation, large DNA fragments longer than 3 kbp were enriched using the Long Fragment Buffer (LFB), provided by the kit. The prepared library, consisting of three barcoded DNA samples, was loaded and sequenced on a MinION R9.4.1 flow cell (Oxford Nanopore Technologies Ltd., Oxford, UK) for 68 hours to receive approximately 5 M reads (15.76 Gb). Basecalling, demultiplexing and adapter trimming of long nanopore reads were performed using Guppy (v4.3.4). Subsequently, reads were filtered on minimum read quality (Q = 10) and minimum read length (≥1000 nt) using NanoFilt (v2.3.0) (De Coster et al., 2018). For each sample, basic statistics of long read sequencing data before and after filtering were calculated with NanoStat (v1.1.2.) (De Coster et al., 2018).
2.3 Hybrid de novo genome assembly
For the assembly of the Btt genomes (chromosomes and plasmids) the Unicycler tool (v0.4.8) was used, a SPAdes algorithm based genome assembler with varying k-mer sizes (Wick et al., 2017b). Default parameters and normal bridging mode were selected as settings for the Unicycler, using the quality filtered and adapter trimmed short (Illumina) and long read (MinION) sequencing data. The assembled genomes were visualized with Bandage (v0.8.1.) (Wick et al., 2015) and were further evaluated using QUAST (v5.0.2.) (Gurevich et al., 2013). To resolve the entire genomic structure, manual multiplicity adjustments were required for both NB176-1 and NB176 due to large inter-plasmidic repeats. Briefly, the best SPAdes graphs obtained from the hybrid de novo assemblies were visualized in Bandage. Subsequently, regions in question (e.g. assembled linear plasmid contig) were BLAST searched in Bandage and multiplicity calls were inspected using Unicycler’s graph color scheme (green = single-copy, yellow = two-copy, orange = three-copy, red = multi-copy). In regions where mis-assemblies have occurred, the multiplicities were adjusted manually following the protocol of the Unicycler webpage (https://github.com/rrwick/Unicycler/wiki/Multiplicity-mistake). The modified best SPAdes graph was then used as input for a follow-up hybrid assembly using Unicycler. For NB176 further manual curation was needed, since two plasmids collapsed into a shared repeat sequence, although both plasmids sequences were available in Unicycler’s final assembly graph. To finish the assembly and separate the two plasmids based on read depth, contigs were extracted and merged according to their graph path. Each completed assembly was further polished with the integrated tool Pilon (v1.20) (Walker et al., 2014) using the paired, highly accurate Illumina short reads.
2.4 Evaluation of genome assemblies
The quality of the assembly was evaluated by mapping paired Illumina reads back to the de novo assembled genome using BWA-MEM (Li and Durbin, 2009; Li, 2013). Proportion of mapped and unmapped reads was calculated for each strain using samtools (v1.9.). To examine the reads, which did not map to their respective assembly, unmapped paired-end reads were extracted from BWA-MEM output and de novo assembled into contigs. Taxonomy was assigned to each contig by blastn search against the nt NCBI database.
Coverage and depth of mapping (mean ± standard deviation [SD]) were calculated for each replicon (chromosome and plasmids) within each strain. Coverage plots were visually inspected in Geneious Prime (v2021.2.2) (Biomatters Inc., San Diego, CA, USA) to detect ambiguous regions in the assembly indicating possible errors in the assembled sequences. Circularity of replicons was verified by selecting reads spanning from both contig ends.
To assess assembly completeness in terms of gene content, BUSCO (v5.2.2.; genome mode) (Simão et al., 2015) with lineage-specific ortholog set for Bacillales was used (bacillales_odb10, 2021-02-23). Missing and duplicated BUSCO genes were reported, as they may not be of technical but biological origin such as gene loss and gene duplication.
2.5 Multiple sequence alignment and verification of nucleotide deletions
Multiple sequence alignment (MSA) of NB125, NB176-1 and NB176 was performed using Mauve progressive algorithm (Darling et al., 2010) in Geneious Prime (v2021.2.2). Each replicon (bacterial chromosome, plasmids) was considered individually and compared between all strains carrying that replicon. After MSA, indels were left realigned manually to correct gaps introduced by the aligner. To compare replicon sequences, sequence identity was calculated. Here, sequence identity [%] was defined as the percentage of bases that are identical in an alignment. For bacterial chromosome comparison, alignment was performed twice: An additional sequence identity analysis was performed after excluding a chromosomal deletion from the alignment. To verify detected nucleotide deletions in genome assemblies, paired Illumina reads were mapped with BWA-MEM against the de novo assembled genome sequence of the parental NB125. Samtools depth (Galaxy version 1.9) was used to compute the read depth at each genomic position and the output file was loaded in R (v4.1.2.) using RStudio (v2021.09.2) to create a coverage plot for the corresponding replicon.
2.6 Genome annotation and homology analysis of predicted coding sequences
Automatic annotation of protein coding DNA sequences (CDS) and RNA genes, transfer RNA (tRNA), ribosomal RNA (rRNA) as well as transfer messenger RNA (tmRNA), was performed using Prokka software (v1.14.6, default settings) (Seemann, 2014). Ambiguities in the chromosome annotation between the three strains were identified by aligning the entire chromosome including annotations with Mauve progressive algorithm in Geneious. One CDS in NB176 was not annotated automatically although the complete sequence was available (100% sequence identity). To adjust the annotation, the CDS was added manually.
In the next step, annotated gene sets (GFF3) were used for homology analysis of CDS: Each replicon (bacterial chromosome, plasmids) was considered individually and the predicted CDS were compared between all strains carrying that replicon. To identify CDS homologies, the fully automated Roary software (v3.13, default settings, minimum percentage amino acid identity = 95%, split-paralogs, 100% core genes threshold) was used (Page et al., 2015). Occurring paralogous genes were analyzed individually (paralog splitting) to investigate not only sequence homologies but also their genomic position. The number of shared (= present in all three strains), variable (= present in two strains) and unique (= present in a single strain) CDS were determined. Following this computer based CDS comparison, detected homologies and CDS identified as variable or unique were verified manually. Gene presence/absence files created by Roary software were further analyzed in R (v4.1.2.) using RStudio (v2021.09.2).
Since all annotated features (CDS + RNA genes) were numbered consecutively by Prokka software, the location and annotation of RNA genes were also included in the final gene presence/absence files (Supplementary Tables 1–8). Homology of RNA genes was not investigated.
2.7 Analysis of CDS located on a chromosomal deletion
It was assessed if copies of the CDS located on a deleted stretch of chromosomal DNA can be found somewhere else in the corresponding genome. The set of deleted CDS was extracted and batch searched against the corresponding genome using MEGABLAST (default settings, i.e. max. E-value = 0.05, word size = 28) in Geneious Prime. MEGABLAST results were sorted into two bins (hit vs. no hit) and identified hits were batch searched again to obtain a hit table with nucleotide identity and coverage values. Only CDS that were found to cover at least 70% of the BLAST searched CDS were considered as present. The analysis was carried out at the nucleotide level, since there is no comparable MEGABLAST for protein searches available.
To classify CDS that were located on the deletion site into functional categories, corresponding Clusters of Orthologous Groups (COG) identifiers were used. COG numbers were extracted from Prokka annotations and subsequently searched in the NCBI COG database. The total number of CDS in each detected COG class was counted to investigate overrepresented COG categories. If a CDS was assigned to more than one COG category, a 1 was added to the count of each respective COG class.
2.8 In silico prediction of virulence and resistance genes and sequence typing
Assembled genomes were submitted to BTyper software (Galaxy version 2.0.3, default settings) (Carroll et al., 2017) to identify virulence genes. The software uses a search/comparison based approach (default cutoffs: minimum nucleotide identity = 50%, minimum query coverage = 70%) against the BTyper virulence database, containing 44 virulence genes specific for the B. cereus group.
Besides virulence gene prediction, BTyper was used for in silico multi-locus sequence typing (MLST), panC clade assignment, and rpoB allelic typing. To determine the genome position of all sequences detected by BTyper software, the corresponding reference amino acid (aa) or nucleotide (nt) sequences were downloaded and tblastn (aa) or blastn (nt) searched in Geneious Prime. Location of BLAST hits and corresponding annotation IDs of strain NB125 were reported. Homologous genes (and their annotation IDs) for NB176-1 and NB176 can be found in the corresponding annotation tables (Supplementary Tables 1, 8).
Antimicrobial resistance genes were predicted using the software ABRicate (Galaxy Version 1.0.1; (Seemann, 2018)), including the ARG-ANNOT (2022-Jun-13, n = 1749; (Gupta et al., 2014)), ResFinder (2022-Jun-13; n = 3144; (Zankari et al., 2012)), CARD (2022-Jun-13; n = 2220; (Jia et al., 2016)), and NCBI AMRFinderPlus (2022-Jun-13; n = 6146; (Feldgarden et al., 2019)) databases. ABRicate was used with an 80% nucleotide identity and 70% coverage threshold. In addition, the tool AMRFinderPlus (Galaxy Version 3.1.1b+galaxy1; (Feldgarden et al., 2021), database version 2022-05-26.1) was run in combined mode (nucleotide + protein) allowing the use of Hidden Markow Models (HMM) to enhance sensitivity for novel AMR genes. The default settings (“minimum identity for a blast-based hit =-1.0, means use a curated threshold if it exists and 0.9 otherwise, minimum coverage of the reference protein = 0.5”) were used to allow the use of manually curated BLAST cutoffs.
2.9 Phylogenetic analysis
The complete genome sequences of NB125, NB176-1 and NB176 were uploaded to the Type (Strain) Genome Server (TYGS) for a whole genome-based taxonomic analysis (Meier-Kolthoff and Göker, 2019). Closest type strain genomes were determined automatically (Supplementary Table 11). All pairwise comparisons between the three uploaded strains as well as between the three strains and closest related type strains were performed using the Genome Blast Distance Phylogeny (GBDP) approach. Accurate intergenomic distances were inferred under the algorithm “trimming” and distance formula d5 (Meier-Kolthoff et al., 2013). One hundred distance replicates were calculated each. Digital DNA-DNA-Hybridization (dDDH) values and confidence intervals were calculated using the recommended settings of the GGDC 3.0 (Meier-Kolthoff et al., 2013; Meier-Kolthoff et al., 2022). Based on the intergenomic distances, a balance minimum evolution tree with branch support was inferred via FASTME 2.1.6.1., including Subtree-Pruning-Regrafting (SPR) postprocessing (Lefort et al., 2015). The tree was rooted at the midpoint (Farris, 1972) and visualized with PhyD3 (Kreft et al., 2017). Branch support was inferred from 100 pseudo-bootstrap replicates each.
The automatically determined closest type strain genomes were downloaded from NCBI and subsequently uploaded to the JSpeciesWS webserver (Richter et al., 2016) for ANIb (Average nucleotide identity algorithm using BLAST) calculation (Goris et al., 2007). Pairwise ANIb values between the selected type strain genomes and the three genomes under assessment were calculated. The average of the two reciprocal ANIb values was calculated to provide a single ANIb value for each genome pair.
3 Results
3.1 Sequencing statistics
The output from the Illumina platform was 6.4 M (NB125), 6.9 M (NB176-1), and 7.2 M (NB176) reads with an average Phred quality score of Q = 33, corresponding to a sequencing accuracy of >99.9% (Table 2). After adapter removal, quality trimming and read pairing the number of reads was reduced by approximately 2% (Table 2). Long read sequencing (MinION) yielded in 1.2 M (NB125), 2 M (NB176-1) and 896 K (NB176) reads with a length ranging between N50 = 4.4 kbp (NB176-1) to N50 = 6.4 kbp (NB125) (Table 2). The number of long reads was reduced by 35-40% per sample after filtering by quality (Q ≥ 10) and length (≥ 1000 nt) (Table 2).
3.2 Hybrid assemblies and evaluation
To resolve the entire genome of NB125, NB176-1 and NB176, a hybrid de novo assembly based on short and long read sequencing data was performed. This approach resulted in sufficient high-quality reads to allow the separation of chromosomal and plasmid sequences and their complete reconstruction (Tables 3, 4). The genome length was found to vary between 6 to 6.3 Mbp with an identical GC content of 35.1% (Table 3). The chromosomes were 5.4 to 5.6 Mbp long and in each strain, the chromosome was associated with six putative plasmids that were between 14.9 to 250.5 kbp in length (Tables 3, 4). The plasmid sequences were called putative plasmids (ppl) since their identification method was sequence based. In the process of whole genome reconstruction, the Unicycler tool resolved the genome of NB125 entirely, without any manual adjustment. The assembled contigs of NB125 chromosome (5.6 Mbp) as well as its six plasmids (250-ppl, 185-ppl, 137-ppl, 68-ppl, 43-ppl and 14-ppl) were circular and covalently closed (Tables 3, 4). For NB176-1 and NB176, the reconstruction was hampered by inter-plasmidic repeats and required manual adjustment by fixing ambiguous repeat regions with the help of multiplicities. Here, the chromosomes were found to be 5.6 and 5.4 Mbp long for NB176-1 and NB176, respectively (Table 3). The plasmid set up for NB176-1 and NB176 was slightly different from NB125: 250-ppl, 185-ppl, 99-ppl, 68-ppl, 43-ppl and 14-ppl (Table 4).
Table 3 Assembly statistics of Btt strains NB125, NB176-1, and NB176. Total length [bp] and GC content [%] of the whole genome (chromosome + plasmids) and the single bacterial chromosome as well as the total number of plasmids are given.
Table 4 Sequence based detection of seven putative plasmids (ppl) found in NB125, NB176-1 and NB176.
To assess the quality of the genome assemblies, paired Illumina reads were mapped against their corresponding whole genome assembly. Here, 99.71%, 99.73% and 99.86% of all Illumina reads of NB125, NB176-1 and NB176 mapped, respectively. To investigate the origin of unaligned reads, unmapped, paired-end reads were extracted, de novo assembled into contigs and taxonomy was assignend to each contig by BLAST search against the NCBI database. For each NB125 and NB176-1, two contigs were de novo assembled from unmapped paired-end reads. These contigs were assigned to cattle chromosomes (Bos gaurus, Bos taurus), most likely due to culture media: Prior to freeze-drying, the two strains deposited at DSMZ were cultured on media containing beef-extract. For NB176 it was not possible to de novo assemble the unmapped, paired-end reads into contiguous sequences. Therefore, 0.14% unassigned reads were found for NB176.
Another evaluation step comprised the identification of 450 bacillales core genes using BUSCO software leading to a completeness score of 99.8% (98.7% single copy, 1.1% duplicated and 0.2% missing) for NB125, NB176-1 and NB176. In all three strains, the tryptophan repressor (BUSCO ID 148693at1385) was missing and five BUSCO genes were duplicated (Table 5). Both copies of four duplicated BUCSO genes were located on the chromosome (Table 5), whereas recA was located on the chromosome and on 43-ppl.
Table 5 Duplicated BUSCO genes in NB125, NB176-1, and NB176 according to analysis of 450 core genes listed in the bacillales_odb10, (2021-02-23) dataset.
3.3 Sequence comparison
The chromosome of NB176-1 was 1003 bp longer than that of NB125, but they shared an identical GC content (35.3%, Table 3). The chromosome of NB176 was shorter (~178.5 kbp) than those of NB125 and NB176-1 and had a slightly higher GC content of 35.4% (Table 3). Multiple sequence alignment revealed a chromosomal deletion of 178,499 bp in NB176 (Figure 1A). To verify the detected deletion, short reads of NB176 were mapped against the de novo assembled genome sequence of the parental NB125 where a drop to zero in read depth between 4,203,170 and 4,381,670 confirmed the detected deletion (Figure 1B). MSA was also used to compare the three chromosomes at the nucleotide level: The nucleotide sequence identity of NB125, NB176-1, and NB176 was above 96% and increased to >99.95% after excluding the deletion site from the alignment (Supplementary Table 13).
Figure 1 (A) Schematic linear presentation of the circular chromosomes of NB125, NB176-1, and NB176 (grey boxes). Chromosomal positions are given for each strain below. NB176 contains a 178.5 kbp deletion between positions 4,204,263 and 4,204,264. (B) Chromosome coverage plot of NB176. Short reads of NB176 were mapped against the genome reference sequence of parental NB125. The drop in the sequencing depth between position 4,203,170 and 4,381,670 verified the detected chromosomal deletion of ~178.5 kbp (highlighted in grey) in NB176. Read depth was averaged every 5000 bp along the reference genome.
Six circular plasmids (14-ppl, 43-ppl, 68-ppl, 137-ppl, 185-ppl, 250-ppl) were reconstructed for strain NB125, ranging in length and GC content from 14.9 kbp to 250.5 kbp as well as 31% (14-ppl) to 35.4% (43-ppl) (Table 4). For both NB176-1 and NB176, six circular plasmids (14-ppl, 43-ppl, 68-ppl, 99-ppl, 185-ppl, 250-ppl) were assembled. These plasmids were identical in their length and GC content (Table 4). The difference between the two sets of plasmids was due to the presence of 137-ppl in NB125 and 99-ppl in NB176-1 and NB176 (Table 4). Multiple sequence alignment revealed that 99-ppl is a recombinant plasmid consisting of two parts: One larger part (72,351 bp) was homologous to 137-ppl of NB125 (100% nt sequence identity) and a shorter part (27,082 nt) that was also found in 185-ppl (100% nt sequence identity), located on an inter-plasmidic repeat region with a total length of 28,743 nt (Figure 2). This duplicated region is carrying the cry3Aa gene, encoding the coleopteran-specific protein Cry3Aa (Höfte et al., 1987). Therefore, cry3Aa is present in two copies, on 99-ppl (ID99-ppl = 109) and 185-ppl (ID185-ppl = 168) for the derivative strains NB176-1 and NB176, and only in one copy for NB125 (185-ppl). Moreover, a deletion of a single stretch of plasmid DNA (65,061 nt) compared to 137-ppl of the parental strain NB125 was detected in NB176-1 and NB176.
Figure 2 Schematic presentation of the six plasmids of NB125, NB176-1, and NB176. Each circular plasmid is linearized and represented in scale by a rectangle. Same colors represent homologous regions. Plasmid 137-ppl has a unique region (red) and a region of 72,351 nt (pink) that is shared with 99-ppl (100% nt sequence identity). In addition to that shared region, a shorter part of 99-ppl (27,082 nt) is also found in 185-ppl (blue, 100% nt sequence identity), located on an inter-plasmidic repeat region with a total length of 28,743 nt. This duplicated region is carrying the cry3Aa gene, which thereby is present in two copies, on 99-ppl and 185-ppl, for NB176-1 and NB176, and only in one copy for NB125. Besides this recombination, a deletion of a single stretch of plasmid DNA (65,061 nt) compared to 137-ppl of the parental strain NB125 was detected in NB176-1 and NB176.
In addition to the cry3Aa gene, two other pesticidal genes, encoding for Mpp23Aa1 (ID185-ppl = 8; 100% aa identity/100% coverage to GenBank accession no. AAF76375) and Xpp37Aa1 (ID185-ppl = 7; 100% aa identity/100% coverage to GenBank accession no. AAF76376) were found on the original Cry3Aa encoding plasmid 185-ppl.
3.4 Genome annotation
For the chromosome of NB125, NB176-1 and NB176, a total number of 5760, 5761 and 5579 annotations were predicted, respectively. The majority of 5610 (NB125), 5611 (NB176-1) and 5429 (NB176) annotations were identified as CDS, whereas the remaining 150 annotations were assigned to RNA genes (Table 6). A total number of 30, 56, 78, 109, and 168 CDS were predicted for 14-ppl, 43-ppl, 68-ppl, 99-ppl, and 137-ppl, respectively (Table 4 and Supplementary Tables 2–6). For 185-ppl a total number of 168 CDS was predicted for each strain, although nucleotide sequences were slightly different (Table 4 and Supplementary Tables 7, 12). Homology analysis of CDS (homology aa cutoff ≥95%) revealed that there were no differences in gene content. In addition, order and orientation of predicted CDS was identical for 185-ppl. A total number of 237 (NB125, NB176) and 236 (NB176-1) CDS as well as one tRNA gene was found for 250-ppl (Table 4 and Supplementary Table 8). According to the computer-based comparison of CDS, 236 CDS were found to be homologous in all strains. In 250-ppl of NB176-1, a nucleotide difference caused the stop codon to be skipped in CDS ID250-ppl/NB176-1 = 66. Thus, CDS no. 66 of NB176-1 spans the length of the two CDS no. 66 and no. 67 of NB125 and NB176 (ID250-ppl/NB125/NB176 = 66 + 67, Supplementary Table 8).
Table 6 Annotation statistic for each final chromosome assembly of Btt strain NB125, NB176-1, and NB176.
Coding sequences of the three annotated chromosomes were compared using an automated approach based on Roary software with a homology criterion of ≥95% amino acid sequence identity. This automated method was verified manually and adjusted when required. Based on this approach, a set of 5428 chromosomal CDS was found to be homologous for all three strains NB125, NB176-1, and NB176 (=shared genes) (Figure 3). The main differences in chromosomal CDS were the absence of 182 CDS (IDNB125 = 4312-4493) in NB176 due to the chromosomal deletion of 178.5 kbp, and one additional IS110 family transposase ISBth166 gene (IDNB176-1 = 1492 and IDNB176 = 1492) that was present in NB176-1 and NB176, but not in the parental NB125 (Figure 3). Initially, two CDS were reported to fail the homology criterion due to short nucleotide insertions/deletions (indels): The homologous CDS IDNB125 = 1544, IDNB176-1 = 1545 and IDNB176 = 1545 were located at the same relative position of the chromosomes. It was found that IDNB176-1 = 1545 differed from IDNB125 = 1544 and IDNB176 = 1545 by a 441 bp deletion resulting in the loss of 147 amino acids (aa) (aa identity = 78.1%) (Table 7). Similarly, the homologous CDS IDNB125 = 3195, IDNB176-1 = 3196 and IDNB176 = 3196 differed from each other that IDNB176-1 = 3196 and IDNB176 = 3196 had a 294 bp deletion resulting in the loss of 98 aa in the middle of the aa sequence, which reduced the overall aa identity to 92.3%. The remaining predicted aa of the CDS were identical between all strains (Table 7).
Figure 3 Shared, variable and unique chromosomal CDS detected for parental strain NB125 and its derived strains NB176-1 and NB176. In total, 5611 CDS were detected, of which 5428 were shared by all three strains. The number of chromosomal CDS predicted for each strain are given in parenthesis below the strain names. Numbers inside the Venn diagram represent unique, variable and shared CDS.
Table 7 Two sets of chromosomal CDS failing the homology criterion (≥95% amino acid sequence identity) of the fully automated detection of homologous CDS using Roary software.
It was further assessed if the additional transposase gene (IDNB176-1 = 1492 and IDNB176 = 1492) occurs once, or whether it is an extra gene copy that occurs frequently in the bacterial chromosome. The results of a blastn search revealed that this gene occurs 13 times on the chromosome of the parental NB125 (8 copies with 100% nt identity and 100% coverage, and 5 copies with 97.6% nt identity/100% coverage), whereas it occurs 14 times on the chromosome of NB176-1 and NB176 (9 copies with 100% nt identity/100% coverage, and 5 copies with 97.6% nt identity/100% coverage). Therefore, both NB176-1 and NB176 carry one additional copy of this transposase gene.
In the next step, the automatic annotation of all 182 CDS located on the deletion site, was examined. From these, a total of 93 CDS (~51.1%) were annotated as hypothetical proteins (Supplementary Table 9). Eighty-nine CDS (~48.9%) were annotated with the name of an encoded protein and 34 (~38.2%) of these annotations included Enzyme Commission (EC) numbers (Supplementary Table 9). Sixty (~67.4%) out of 89 CDS were assigned to Cluster of Orthologous Groups (COG) identifiers (Supplementary Table 9). To classify these genes into functional categories, the corresponding COG identifiers were searched in the NCBI COG database. Genes located on the deletion were assigned to 16 functional groups (+ R = General function prediction only, + S = Function unknown) (Figure 4). After this functional assignment, the total number of CDS in each detected COG class was counted. Most genes were assigned to the COG category Amino acid transport and biosynthesis. As an example, the ABC transporter complex TcyABC involved in L-cystine import is located within the deletion site. The second largest COG category represented proteins with function in Defense mechanisms. Several genes associated with bacitracin resistance, e.g. a bacitracin efflux BceAB type ATP transporter were detected. In addition, some annotations without a COG identifier can be assigned to the function Defense mechanisms e.g. tetracycline resistance protein, class B; guanidinium exporter, and bacitracin transport ATP- binding protein BcrA (Supplementary Table 9). Further examples for CDS located at the deletion site were two putative ABC transporters YknXYZ (Cell wall/membrane/envelope biogenesis; Defense mechanisms), a Na(+)/H(+)-K(+) antiporter GerN associated with endospore germination (Inorganic ion transport and metabolism) and a licABC operon (Carbohydrate transport and metabolism), involved in the uptake and metabolism of lichenin degradation products (Supplementary Table 9).
Figure 4 Cluster of Orthologous Groups (COG) analysis of CDS located within the deletion site of NB176. The number of CDS per COG category is given. A total of 182 predicted CDS were located on the chromosomal deletion in NB176. Eighty-nine CDS were annotated with the name of the encoded protein and sixty of these annotations included COG identifiers. These COG numbers were searched in the NCBI COG database to classify the 60 genes into 18 groups (16 functional groups + R = General function prediction only, + S = Function unknown). Due to assignments of CDS into multiple groups, 74 assignments were made. To investigate overrepresented COG categories, the total number of CDS in each detected COG class was counted. If a CDS was assigned to more than one COG category, a 1 was added to the count of each respective COG class.
To analyze, whether the 182 located within the deletion site, can be found somewhere else in the genome of NB176 the set of deleted CDS was extracted from the parental genome NB125 and batch searched against the NB176 genome using MEGABLAST algorithm. From the 182 CDS located on the chromosomal deletion, twelve CDS were found with hits elsewhere in the genome (Table 8). Four CDS were found with one homologous copy: IDNB125 = 4334, 4336, 4412, and 4417. Except for IDNB125 = 4336, which is located on 99-ppl, the CDS were located on the chromosome. For these four CDS, the nucleotide identity and CDS coverage ranged between 73.8% to 92.3% and 76.8% to 100%, respectively (Table 8).
Table 8 Coding sequences, located on the deleted region that were found elsewhere in the genome of NB176.
The remaining eight CDS had more than one homologous CDS copy on the genome. Copies were found on the chromosome, 99-ppl, 185-ppl and 250-ppl. Except for IDNB125 = 4426 and 4475, CDS copies were found in duplicates on the chromosome and another two copies on 185-ppl and 250-ppl (Table 8). The nucleotide identity of the multi copy CDS varied between 93.9% to 100% and the coverage of the CDS was above >99.8% (Table 8). A block of six consecutive CDS (IDNB125 = 4420 to 4425) was found in the same order and orientation at four other locations in the genome of NB176. This block of genes occurred twice on the chromosome and once on 185-ppl and 250-ppl (Table 8). This repetitively occurring block of CDS carried BcrA and BcrC genes, both associated with bacitracin resistance.
In total, three out twelve reoccurring CDS were annotated as hypothetical proteins; four CDS were annotated as transposase genes, and three were associated with bacitracin resistance, In addition, one virulence gene associated with secretion and one gene annotated as a transcriptional regulatory protein was found to be located elsewhere in the genome (Table 8).
3.5 In silico prediction of virulence and antimicrobial resistance genes
The presence of 44 virulence genes specific for the B. cereus group was examined in silico by BTyper software (nucleotide identity ≥50%, coverage ≥70%). For NB125, NB176-1 and NB176 the same set of 23 virulence genes was predicted (Table 9) and the corresponding nucleotide sequences were identical. Some enterotoxins associated with diarrheal symptoms are expressed by operons: Both nhe (nheABC) and hbl (hblABCD) operon genes were identified in their entirety in the genomes of three analyzed strains (Table 9). However, the nhe promoter was disrupted by a transposase (IDNB125 = 3455, IDNB176-1 = 3456, IDNB176 = 3456) insertion in all three strains. Further, three putative enterotoxin genes encoding for the one-component proteins FM (entFM), T (BceT) and A (entA); two metalloprotease genes (inhA1, inhA2); and two genes (clo, hlyII) encoding for pore forming hemolysins (cerolysin O, hemolysin II) were detected. Three different phospholipase C genes were found; plcA encoding phosphatidylinositol-specific phospholipase C (PI-PLC), cerA encoding phosphatidylcholine-specific phospholipase C (PC-PLC), and cerB encoding sphingomyelinase (SM-PLC). PC-PLC and SM-PLC are encoded in an operon (cerAB) and form the biological complex cereolysin AB. Two genes coding for regulatory proteins, hlyIIR, regulating hlyII, and the pleiotropic regulator plcR, controlling the expression of multiple virulence genes, were found. Capsule genes (hasA, cap) and genes associated with the production of anthrax toxin (pagA, lef, atxA, cya) were not detected. In addition, all strains were negative for ces as well as cytK.
Table 9 Virulence genes specific for the B. cereus group predicted by BTyper software (minimum percent amino acid (aa) identity to reference sequence: 50%, minimum coverage of reference sequence = 70%).
Three out of nine bps genes belonging to the exo-polysaccharide operon bpsX-H were predicted. However, the sequence identities were relatively low (<70%) for two of these positive hits (bpsF, bpsH) and the corresponding CDS were located on different chromosomal loci. In general, predicted virulence genes were mostly located on the bacterial chromosome, except for plcA. This gene was not only predicted for the chromosome, but also for the 250-ppl with a lower nt sequence identity (Table 9). To verify this predicted plasmid-encoded phospholipase C gene despite the relatively low nt sequence identity, the corresponding aa sequence of IDNB125/250-ppl = 147 was blastp searched in the NCBI database, revealing that the aa sequence is identical to a multispecies phosphatidylinositol-specific phospholipase C domain-containing protein (WP_086405655.1).
By using ABRicate and AMRFinderPlus software, the presence of antimicrobial resistance (AMR) genes was assessed in silico. For each of the three strains, the same AMR genes were predicted (Table 10) and the corresponding nucleotide sequences were identical. In total, 13 AMR genes were predicted for the bacterial chromosome (Table 10). These genes are associated with resistance to beta-lactam (bla, bla2), glycopeptide (vanZ-F), macrolide (mhpL, abc-f), lincosamide-streptogramin (lsa), streptogramin (vat), streptothricin (sat A), phenicol (catA) and fosfomycin (fosB) drug families. Three resistance genes, assigned to aminoglycoside class of antibiotics, were predicted for plasmid 250-ppl (Table 10). No resistance genes were predicted for plasmids 185-ppl, 137-ppl, 99-ppl, 68-ppl, 43-ppl, and 14-ppl.
3.6 In silico MLST, panC clade assignment and rpoB allelic typing
In silico analysis performed by BTyper software confirmed that the three strains NB125, NB176-1 and NB176 belonged to the B. cereus group. The different typing methods (panC, MLST, rpoB) revealed identical results for each Btt strain (Table 11). All three strains were assigned to panC clade 4 and the same closest-matching B. cereus group genome was reported. Analysis of seven MLST genes showed, that all three strains shared an identical allelic profile and were therefore assigned to the same sequence type (ST=23). Based on the rpoB sequence of each strain, the same allele type (AT0296) was identified, which showed a high similarity (100%) to B. cereus s.l.
Table 11 In silico panC clade typing, multi-locus sequence typing (MLST), and rpoB allelic typing of the three sequenced Btt strains.
3.7 Phylogenetic analysis
Based on digital DNA-DNA hybridization (dDDH), twelve most closely related type strain genomes, all belonging to the Bacillus cereus group, were determined automatically using the TYGS server (Supplementary Table 14 and Supplementary Figure 1). The most closely related type strain genome of NB125, NB176-1, and NB176 was Bacillus thuringiensis ATCC 10792 with a dDDH value of 68.3%, 68.1%, and 68.1%, respectively, followed by Bacillus cereus ATCC 14579 with a dDDH value of 65.6%, 65.6% and 65.7%, respectively (Supplementary Table 14). All pairwise dDDH values between the three genomes under assessment and the type strain genomes were below the species delineation threshold of 70%, but the upper limit of the 95% confidence interval regarding Bacillus thuringiensis ATCC 10792 was above 70%. According to dDDH, NB125, NB176-1, and NB176 are highly similar, with pairwise dDDH values ranging from 99.9 to 100% (Supplementary Table 14), resulting in the same species (70% dDDH species threshold) as well as subspecies (79% dDDH subspecies threshold) clustering (Supplementary Table 14, Supplementary Figure 1). The overall high sequence similarity of NB125, NB176-1 and NB176 was further confirmed by pairwise ANIb values ranging from 99.89 to 99.99% (Supplementary Table 15). Pairwise ANIb analysis of the three genome sequences and the closest type strains revealed that NB125, NB176-1 and NB176 showed ANIb values slightly above 95% with both B. thuringiensis and B. cereus type strain genomes (Supplementary Table 15).
4 Discussion
With the first and complete de novo reconstruction of the genome of NB125, the molecular changes that led to the creation of NB176-1 and NB176, the current production strain formulated in the plant protection product Novodor® FC, could be traced. To decipher their complete genomes, a hybrid approach of short (Illumina) and long read (Nanopore) sequencing (Wick et al., 2017b) was applied, which allowed the detection of all genomic features underlining the strength of this technique. Long sequencing reads spanning repetitive regions helped to resolve these critical genomic features although manual intervention had been required to finalize the assembly (Wick et al., 2017a). The reliability of the genomic assemblies of NB125, NB176-1 and NB176 were supported by the recent complete genome assembly of Bacillus thuringiensis serovar morrisoni (strain 4AA1) (Genbank assembly GCA_022810725.1). Its chromosomal length, number and length of plasmids was similar to NB125, NB176-1 and NB176, although the applied sequencing technique was different (Illumina sequencing, PacBio (single molecule sequencing) and Sanger sequencing techniques). In general, resolving repetitive sequences and multi-copy genes compromising the correct assembly of high throughput sequencing data is key to a complete bacterial genome reconstruction. The challenge of unresolved plasmid sequences might be the reason why in a previous sequencing approach of NB176 (Biggel et al., 2021) the identification of the plasmid number and length was not successful.
In the literature, NB176-1 and NB176 were described as high-yielding derivatives of NB125 (Adams et al., 1994; Gurtler and Petersen, 1994; European Food Safety Authority, 2013). However, only little information was known on the differences between the parental strain NB125 and its radiation-derived strains. Although a duplication of the cry3Aa gene in NB176-1 was already predicted by Adams et al. (1994) by Southern hybridization and partial gene sequencing, a conclusive proof is finally provided by this study, identifying cry3Aa on plasmids 99-ppl and 185-ppl of NB176-1. Furthermore, the genomic analysis revealed that 99-ppl was a recombination product of the original 137-ppl plasmid and a region of 185-ppl carrying the cry3Aa gene and therefore resulting in its duplication and the Cry3Aa overproducing phenotype of NB176-1. By this finding, the origin of the cry3Aa gene could be unequivocally defined on the 185-ppl (~120 kDa) of NB125, which is close to the plasmid size of 90 kDa that had been proposed as location of cry3Aa (Sekar et al., 1987).
Besides the noted re-arrangement, we further propose that the shortening of 137-ppl, which led to the creation of 99-ppl, was mediated by Tn3 family transposases (TnBth1, TnBth2) and an IS4 family transposase IS231C. It was described previously that insertion sequences, such as IS231 and IS232 and transposons are frequently found in the vicinity of cry genes enabling transposition and gene duplication (Menou et al., 1990; Mahillon et al., 1994; Murawska et al., 2014; Fiedoruk et al., 2017). In addition to the cry3Aa gene, two other insecticidal genes, encoding for Mpp23Aa1 (old name: Cry23Aa) and Xpp37Aa1 (old name: Cry37Aa) were located on the original Cry3Aa encoding plasmid 185-ppl. These findings are in line with Caballero et al. (2020) who suggested that the commercial strain NB176 produces the three insecticidal proteins Cry3Aa, Mpp23Aa1 (Cry23Aa), and Xpp37Aa1 (Cry37Aa) (Caballero et al., 2020). These conclusions are difficult to draw from the previously published incomplete assembly of NB176 (Biggel et al., 2021) since collapsing repeat regions did not allow assumptions about changes and recombination in plasmid sequences. Genome assemblies are further challenged by the presence of repetitive mobile regions that could flank δ-endotoxin and antimicrobial resistance genes (Kronstad et al., 1983; Fiedoruk et al., 2017; Orlek et al., 2017) hiding their genomic position (He et al., 2015; Sheppard et al., 2016; Arredondo-Alonso et al., 2017). The exact location of antimicrobial resistance genes is crucial, because they might be spread when present on mobile genetic elements such as plasmids (Tschäpe, 1994; Thomas and Nielsen, 2005; Berbers et al., 2020).
Although NB176-1 and NB176 are almost identical in their plasmid sequences, a single chromosomal deletion of ~175.8 kbp was identified in NB176, when compared to NB125 and NB176-1. The deletion comprised a set of 182 CDS, of which 170 CDS had no additional copies on the chromosomal or any other plasmid sequences. Although sixty CDS located on the deletion site were used for functional analyses, it remains unclear how this deletion influence phenotypic characteristics. Some of the deleted genes might enable the parental strain NB125 to survive in its natural environment but may be unnecessary under production conditions. Since NB176-1 was deposited at the DSMZ in 1989 and NB176 is representing the current strain under commercial production, it can be hypothesized that the genome of NB176 in its current state is highly stable and did not undergo any further changes. Since the genomes of NB176-1 and NB176 are almost identical, the 175.8 kbp deletion could be regarded as a singular event in the history of NB176.
The screening for antimicrobial resistance genes and “Bacillus cereus s.l.” (Bc) virulence genes did not reveal differences between the parental and derivative strains. Regarding enterotoxin genes, all three Btt strains harbored genes encoding for the non-hemolytic enterotoxin (nhe) and hemolysin BL (hbl), but were negative for cytotoxin K (cyt K), consistent with previous studies concerning Btt (Kyei-Poku et al., 2007; Schwenk et al., 2020; Biggel et al., 2021; Bonis et al., 2021). Moreover, this study confirmed that the nhe promoter was disrupted by a transposase insertion in all three strains and thus that Nhe expression might be prevented in the commercial NB176 (Johler et al., 2018; Biggel et al., 2021). Genes encoding for one-component putative enterotoxins (entFM, bceT, entA) pore forming haemolysins (clo, hlyII), metalloproteases (inhA1, inhA2), phospholipases (plcA, cerA, cerB), and regulatory proteins (hlyIIR, plcR) were detected. It can be assumed that the three detected exo-polysaccharide genes (bpsE/F/H) are false positive hits, since only three out of nine bpsX-H genes were detected and located on different chromosomal loci. Previous studies have demonstrated that some members of the B. cereus group possess genes sharing a high degree of homology with Bps-encoding genes (Carroll et al., 2019; Carroll et al., 2020). Therefore, it was concluded that 19 chromosomally and one plasmid-encoded (putative phosphatidylinositol-specific phospholipase C located on plasmid 250-ppl) virulence genes were found.
Thirteen chromosomally encoded AMR genes were predicted. Four beta-lactamase genes were detected, in agreement with previous studies that found beta-lactamase genes as well as phenotypic resistance to be very common in the B. cereus group, including B. thuringiensis (Luna et al., 2007; Park et al., 2009; Kim et al., 2015; Yim et al., 2015; Fiedler et al., 2019; Bianco et al., 2021; Mills et al., 2022). Even though a single vancomycin resistance gene was found, a vancomycin resistance seems unlikely, as individual van genes were previously reported in the B. cereus group but were insufficient to confer phenotypic resistance (Arthur et al., 1992; Patel et al., 1997; Bianco et al., 2021; Mills et al., 2022). Genes associated with resistance to macrolide (mhpL, abc-f), lincosamide-streptogramin (lsa), streptogramin (vat), streptothricin (sat A), phenicol (catA) and fosfomycin (fosB) drug families were also detected in the three analyzed strains. In addition to the chromosomally encoded AMR genes, three nucleotidyltranferase genes associated with aminoglycoside resistance were predicted for plasmid 250-ppl. Some isolates of the B. cereus group are known for aminoglycoside resistance (Parulekar and Sonawane, 2018; Fiedler et al., 2019) whereas others carried nucleotidyltransferase genes but were not phenotypically resistant (Fiedler et al., 2019). Since only few studies associate the presence of AMR (Mills et al., 2022) or virulence genes to phenotypic traits, no direct conclusion can be drawn from in silico studies about the actual functionality of any of these genes or the conditions under which these genes might be expressed.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, PRJNA886432.
Author contributions
LS, RK, JJ, and JW designed the study. FV helped providing the Btt strain. LS conducted the experiments. LS performed the analysis of the sequences. LS wrote the original draft and RK, JJ and JW reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This study received funding from JKI Project BioSam and Biofa GmbH. Biofa GmbH was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.
Acknowledgments
The authors would like to thank Jens Keilwagen und Thomas Berner for their bioinformatic assistance during the genome analysis.
Conflict of interest
Author FV is employed by Biofa GmbH.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2023.1129177/full#supplementary-material
References
Adams, L. F., Mathewes, S., O'Hara, P., Petersen, A., Gürtler, H. (1994). Elucidation of the mechanism of CryIIIA overproduction in a mutagenized strain of bacillus thuringiensis var. tenebrionis. Mol. Microbiol. 14 (2), 381–389. doi: 10.1111/j.1365-2958.1994.tb01298.x
Andrews, S. (2010). “FastQC: A quality control tool for high throughput sequence data,” in Babraham Bioinformatics (United Kingdom: Babraham Institute, Cambridge).
Arredondo-Alonso, S., Willems, R. J., van Schaik, W., Schürch, A. C. (2017). On the (im) possibility to reconstruct plasmids from whole genome short-read sequencing data. Microbial Genomics 3 (10), e000128. doi: 10.1101/086744
Arthur, M., Molinas, C., Courvalin, P. (1992). The VanS-VanR two-component regulatory system controls synthesis of depsipeptide peptidoglycan precursors in Enterococcus faecium BM4147. J. Bacteriology 174 (8), 2582–2591. doi: 10.1128/jb.174.8.2582-2591.1992
Berbers, B., Saltykova, A., Garcia-Graells, C., Philipp, P., Arella, F., Marchal, K., et al. (2020). Combining short and long read sequencing to characterize antimicrobial resistance genes on plasmids applied to an unauthorized genetically modified Bacillus. Sci. Rep. 10 (1), 1–13. doi: 10.1038/s41598-020-61158-0
Bianco, A., Capozzi, L., Monno, M. R., Del Sambro, L., Manzulli, V., Pesole, G., et al. (2021). Characterization of bacillus cereus group isolates from human bacteremia by whole-genome sequencing. Front. Microbiol. 11, 599524. doi: 10.3389/fmicb.2020.599524
Biggel, M., Etter, D., Corti, S., Brodmann, P., Stephan, R., Ehling-Schulz, M., et al. (2021). Whole genome sequencing reveals biopesticidal origin of Bacillus thuringiensis in foods. Front. Microbiol. 4149. doi: 10.3389/fmicb.2021.775669
Bonis, M., Felten, A., Pairaud, S., Dijoux, A., Maladen, V., Mallet, L., et al. (2021). Comparative phenotypic, genotypic and genomic analyses of bacillus thuringiensis associated with foodborne outbreaks in France. PloS One 16 (2), e0246885. doi: 10.1371/journal.pone.0246885
Bravo, A., Gill, S. S., Soberon, M. (2007). Mode of action of Bacillus thuringiensis cry and cyt toxins and their potential for insect control. Toxicon 49 (4), 423–435. doi: 10.1016/j.toxicon.2006.11.022
Bravo, A., Likitvivatanavong, S., Gill, S. S., Soberón, M. (2011). Bacillus thuringiensis: A story of a successful bioinsecticide. Insect Biochem. Mol. Biol. 41 (7), 423–431. doi: 10.1016/j.ibmb.2011.02.006
Caballero, J., Jiménez-Moreno, N., Orera, I., Williams, T., Fernández, A. B., Villanueva, M., et al. (2020). Unraveling the composition of insecticidal crystal proteins in Bacillus thuringiensis: A proteomics approach. Appl. Environ. Microbiol. 86 (12), e00476–e00420. doi: 10.1128/AEM.00476-20
Carlson, C. R., Johansen, T., Lecadet, M.-M., Kolsto, A.-B. (1996). Genomic organization of the entomopathogenic bacterium Bacillus thuringiensis subsp. berliner 1715 Microbiology 142 (7), 1625–1634. doi: 10.1099/13500872-142-7-1625
Carlton, B. C. (1980). Patterns of plasmid DNA in crystalliferous and acrystalliferous strains of Bacillus thuringiensis. Plasmid 3 (1), 92–98. doi: 10.1016/S0147-619X(80)90038-4
Carroll, L. M., Cheng, R. A., Kovac, J. (2020). No assembly required: Using BTyper3 to assess the congruency of a proposed taxonomic framework for the Bacillus cereus group with historical typing methods. Front. Microbiol. 11, 580691. doi: 10.3389/fmicb.2020.580691
Carroll, L. M., Kovac, J., Miller, R. A., Wiedmann, M. (2017). Rapid, high-throughput identification of anthrax-causing and emetic Bacillus cereus group genome assemblies via BTyper, a computational tool for virulence-based classification of Bacillus cereus group isolates by using nucleotide sequencing data. Appl. Environ. Microbiol. 83 (17), e01096–e01017. doi: 10.1128/AEM.01096-17
Carroll, L. M., Wiedmann, M., Mukherjee, M., Nicholas, D. C., Mingle, L. A., Dumas, N. B., et al. (2019). Characterization of emetic and diarrheal bacillus cereus strains from a 2016 foodborne outbreak using whole-genome sequencing: Addressing the microbiological, epidemiological, and bioinformatic challenges. Front. Microbiol. 10, 144. doi: 10.3389/fmicb.2019.00144
Crickmore, N., Berry, C., Panneerselvam, S., Mishra, R., Connor, T. R., Bonning, B. C. (2021). A structure-based nomenclature for Bacillus thuringiensis and other bacteria-derived pesticidal proteins. J. Invertebrate Pathol. 186, 107438. doi: 10.1016/j.jip.2020.107438
Crickmore, N., Zeigler, D., Feitelson, J., Schnepf, E., Van Rie, J., Lereclus, D., et al. (1998). Revision of the nomenclature for the Bacillus thuringiensis pesticidal crystal proteins. Microbiol. Mol. Biol. Rev. 62 (3), 807–813. doi: 10.1128/MMBR.62.3.807-813.1998
Damgaard, P. H. (1995). Diarrhoeal enterotoxin production by strains of Bacillus thuringiensis isolated from commercial Bacillus thuringiensis-based insecticides. FEMS Immunol. Med. Microbiol. 12 (3-4), 245–249. doi: 10.1111/j.1574-695X.1995.tb00199.x
Darling, A. E., Mau, B., Perna, N. T. (2010). progressiveMauve: Multiple genome alignment with gene gain, loss and rearrangement. PloS One 5 (6), e11147. doi: 10.1371/journal.pone.0011147
De Coster, W., D’hert, S., Schultz, D. T., Cruts, M., Van Broeckhoven, C. (2018). NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics 34 (15), 2666–2669. doi: 10.1093/bioinformatics/bty149
EFSA Panel on Biological Hazards (2016). Risks for public health related to the presence of Bacillus cereus and other Bacillus spp. including Bacillus thuringiensis in foodstuffs. EFSA J. 14 (7), e04524. doi: 10.2903/j.efsa.2016.4524
Ehling-Schulz, M., Fricker, M., Grallert, H., Rieck, P., Wagner, M., Scherer, S. (2006). Cereulide synthetase gene cluster from emetic Bacillus cereus: Structure and location on a mega virulence plasmid related to Bacillus anthracis toxin plasmid pXO1. BMC Microbiol. 6 (1), 1–11. doi: 10.1186/1471-2180-6-20
Ehling-Schulz, M., Lereclus, D., Koehler, T. M. (2019). The Bacillus cereus group: Bacillus species with pathogenic potential. Microbiol. Spectr. 7 (3):7–3. doi: 10.1128/microbiolspec.GPP3-0032-2018
Ekblom, R., Wolf, J. B. (2014). A field guide to whole-genome sequencing, assembly and annotation. Evolutionary Appl. 7 (9), 1026–1042. doi: 10.1111/eva.12178
European Food Safety Authority (2013). Conclusion on the peer review of the pesticide risk assessment of the active substance bacillus thuringiensis ssp. tenebrionis strain NB-176. EFSA J. 11 (1), 3024. doi: 10.2903/j.efsa.2013.3024
Fadeev, E., De Pascale, F., Vezzi, A., Hübner, S., Aharonovich, D., Sher, D. (2016). Why close a bacterial genome? The plasmid of alteromonas macleodii HOT1A3 is a vector for inter-specific transfer of a flexible genomic island. Front. Microbiol. 7, 248. doi: 10.3389/fmicb.2016.00248
Farris, J. S. (1972). Estimating phylogenetic trees from distance matrices. Am. Nat. 106 (951), 645–668. doi: 10.1086/282802
Feldgarden, M., Brover, V., Gonzalez-Escalona, N., Frye, J. G., Haendiges, J., Haft, D. H., et al. (2021). AMRFinderPlus and the reference gene catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci. Rep. 11 (1), 1–9. doi: 10.1038/s41598-021-91456-0
Feldgarden, M., Brover, V., Haft, D. H., Prasad, A. B., Slotta, D. J., Tolstoy, I., et al. (2019). Validating the AMRFinder tool and resistance gene database by using antimicrobial resistance genotype-phenotype correlations in a collection of isolates. Antimicrobial Agents Chemotherapy 63 (11), e00483–e00419. doi: 10.1128/AAC.00483-19
Fiedler, G., Schneider, C., Igbinosa, E. O., Kabisch, J., Brinks, E., Becker, B., et al. (2019). Antibiotics resistance and toxin profiles of Bacillus cereus-group isolates from fresh vegetables from German retail markets. BMC Microbiol. 19 (1), 1–13. doi: 10.1186/s12866-019-1632-2
Fiedoruk, K., Daniluk, T., Mahillon, J., Leszczynska, K., Swiecicka, I. (2017). Genetic environment of cry1 genes indicates their common origin. Genome Biol. Evol. 9(9), 2265–2275. doi: 10.1093/gbe/evx165
Gelernter, W. (2004). The rise and fall of Bacillus thuringiensis tenebrionis. Phytoparasitica 32 (4), 321–324. doi: 10.1007/BF02979840
Goris, J., Konstantinidis, K. T., Klappenbach, J. A., Coenye, T., Vandamme, P., Tiedje, J. M. (2007). DNA–DNA Hybridization values and their relationship to whole-genome sequence similarities. Int. J. Systematic Evolutionary Microbiol. 57 (1), 81–91. doi: 10.1099/ijs.0.64483-0
Gupta, S. K., Padmanabhan, B. R., Diene, S. M., Lopez-Rojas, R., Kempf, M., Landraud, L., et al. (2014). ARG-ANNOT, a new bioinformatic tool to discover antibiotic resistance genes in bacterial genomes. Antimicrobial Agents Chemotherapy 58 (1), 212–220. doi: 10.1128/AAC.01310-13
Gurevich, A., Saveliev, V., Vyahhi, N., Tesler, G. (2013). QUAST: quality assessment tool for genome assemblies. Bioinformatics 29 (8), 1072–1075. doi: 10.1093/bioinformatics/btt086
Gurtler, H., Petersen, A. S. (1994). Mutants or variants of bacillus thuringiensis producing high yields of delta endotoxin. (U.S. Patent No. 5,279,962)
He, S., Hickman, A. B., Varani, A. M., Siguier, P., Chandler, M., Dekker, J. P., et al. (2015). Insertion sequence IS26 reorganizes plasmids in clinically isolated multidrug-resistant bacteria by replicative transposition. MBio 6 (3), e00762–e00715. doi: 10.1128/mBio.00762-15
Helgason, E., Økstad, O. A., Caugant, D. A., Johansen, H. A., Fouet, A., Mock, M., et al. (2000). Bacillus anthracis, Bacillus cereus, and Bacillus thuringiensis–one species on the basis of genetic evidence. Appl. Environ. Microbiol. 66 (6), 2627–2630. doi: 10.1128/AEM.66.6.2627-2630.2000
Höfte, H., Seurinck, J., Van Houtven, A., Vaeck, M. (1987). Nucleotide sequence of a gene encoding an insecticidal protein of Bacillus thuringiensis var. tenebrionis toxic against coleoptera. Nucleic Acids Res. 15 (17), 7183. doi: 10.1093/nar/15.17.7183
Höfte, H., Whiteley, H. R. (1989). Insecticidal crystal proteins of Bacillus thuringiensis. Microbiological Rev. 53 (2), 242–255. doi: 10.1128/mr.53.2.242-255.1989
Ibrahim, M. A., Griko, N., Junker, M., Bulla, L. A. (2010). Bacillus thuringiensis: A genomics and proteomics perspective. Bioengineered Bugs 1 (1), 31–50. doi: 10.4161/bbug.1.1.10519
Jia, B., Raphenya, A. R., Alcock, B., Waglechner, N., Guo, P., Tsang, K. K., et al. (2016). CARD 2017: Expansion and model-centric curation of the comprehensive antibiotic resistance database. Nucleic Acids Res. 45 (D1), D566–D573. doi: 10.1093/nar/gkw1004
Johler, S., Kalbhenn, E. M., Heini, N., Brodmann, P., Gautsch, S., Bağcioğlu, M., et al. (2018). Enterotoxin production of Bacillus thuringiensis isolates from biopesticides, foods, and outbreaks. Front. Microbiol. 9. doi: 10.3389/fmicb.2018.01915
Jouzani, G. S., Valijanian, E., Sharafi, R. (2017). Bacillus thuringiensis: A successful insecticide with new environmental features and tidings. Appl. Microbiol. Biotechnol. 101 (7), 2691–2711. doi: 10.1007/s00253-017-8175-y
Kim, C. W., Cho, S. H., Kang, S. H., Park, Y. B., Yoon, M. H., Lee, J. B., et al. (2015). Prevalence, genetic diversity, and antibiotic resistance of Bacillus cereus isolated from Korean fermented soybean products. J. Food Sci. 80 (1), M123–M128. doi: 10.1111/1750-3841.12720
Kreft, Ł., Botzki, A., Coppens, F., Vandepoele, K., Van Bel, M. (2017). PhyD3: A phylogenetic tree viewer with extended phyloXML support for functional genomics data visualization. Bioinformatics 33 (18), 2946–2947. doi: 10.1093/bioinformatics/btx324
Krieg, A.v., Huger, A., Langenbruch, G., Schnetter, W. (1983). Bacillus thuringiensis var. tenebrionis: ein neuer, gegenüber larven von coleopteren wirksamer pathotyp. Z. Für Angewandte Entomologie 96 (1-5), 500–508. doi: 10.1111/j.1439-0418.1983.tb03704.x
Krieg, A., Schnetter, W., Huger, A., Langenbruch, G. (1987). Bacillus thuringiensis subsp. tenebrionis, strain BI 256–82: A third pathotype within the h-serotype 8a8b. Systematic Appl. Microbiol. 9 (1-2), 138–141. doi: 10.1016/S0723-2020(87)80068-1
Kronstad, J., Schnepf, H., Whiteley, H. (1983). Diversity of locations for Bacillus thuringiensis crystal protein genes. J. Bacteriology 154 (1), 419–428. doi: 10.1128/jb.154.1.419-428.1983
Krueger, F. (2015). Trim galore. a wrapper tool around cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files. Babraham Bioinformatics. 516, 517. Available at: https://github.com/FelixKrueger/TrimGalore.
Kyei-Poku, G., Gauthier, D., Pang, A., Van Frankenhuyzen, K. (2007). Detection of Bacillus cereus virulence factors in commercial products of Bacillus thuringiensis and expression of diarrheal enterotoxins in a target insect. Can. J. Microbiol. 53 (12), 1283–1290. doi: 10.1139/W07-106
Lefort, V., Desper, R., Gascuel, O. (2015). FastME 2.0: A comprehensive, accurate, and fast distance-based phylogeny inference program. Mol. Biol. Evol. 32 (10), 2798–2800. doi: 10.1093/molbev/msv150
Li, H. (2013). Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint arXiv 1303, 3997. doi: 10.48550/arXiv.1303.3997
Li, H., Durbin, R. (2009). Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics 25 (14), 1754–1760. doi: 10.1093/bioinformatics/btp324
Liu, Y., Lai, Q., Göker, M., Meier-Kolthoff, J. P., Wang, M., Sun, Y., et al. (2015). Genomic insights into the taxonomic status of the Bacillus cereus group. Sci. Rep. 5 (1), 1–11. doi: 10.1038/srep14082
Luna, V. A., King, D. S., Gulledge, J., Cannons, A. C., Amuso, P. T., Cattani, J. (2007). Susceptibility of Bacillus anthracis, Bacillus cereus, Bacillus mycoides, Bacillus pseudomycoides and Bacillus thuringiensis to 24 antimicrobials using sensititre® automated microbroth dilution and etest® agar gradient diffusion methods. J. Antimicrobial Chemotherapy 60 (3), 555–567. doi: 10.1093/jac/dkm213
Mahillon, J., Rezsöhazy, R., Hallet, B., Delcour, J. (1994). IS231 and other Bacillus thuringiensis transposable elements: A review. Genetica 93 (1), 13–26. doi: 10.1007/BF01435236
Meier-Kolthoff, J. P., Auch, A. F., Klenk, H.-P., Göker, M. (2013). Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinf. 14 (1), 1–14. doi: 10.1186/1471-2105-14-60
Meier-Kolthoff, J. P., Carbasse, J. S., Peinado-Olarte, R. L., Göker, M. (2022). TYGS and LPSN: A database tandem for fast and reliable genome-based classification and nomenclature of prokaryotes. Nucleic Acids Res. 50 (D1), D801–D807. doi: 10.1093/nar/gkab902
Meier-Kolthoff, J. P., Göker, M. (2019). TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy. Nat. Commun. 10 (1), 1–10. doi: 10.1038/s41467-019-10210-3
Menou, G., Mahillon, J., Lecadet, M., Lereclus, D. (1990). Structural and genetic organization of IS232, a new insertion sequence of Bacillus thuringiensis. J. Bacteriology 172 (12), 6689–6696. doi: 10.1128/jb.172.12.6689-6696.1990
Messelhäußer, U., Ehling-Schulz, M. (2018). Bacillus cereus–a multifaceted opportunistic pathogen. Curr. Clin. Microbiol. Rep. 5 (2), 120–125. doi: 10.1007/s40588-018-0095-9
Mills, E., Sullivan, E., Kovac, J. (2022). Comparative analysis of Bacillus cereus group isolates' resistance using disk diffusion and broth microdilution and the correlation between antimicrobial resistance phenotypes and genotypes. Appl. Environ. Microbiol. 88 (6), e02302–e02321. doi: 10.1128/aem.02302-21
Murawska, E., Fiedoruk, K., Swiecicka, I. (2014). Modular genetic architecture of the toxigenic plasmid pIS56-63 harboring cry1Ab21 in Bacillus thuringiensis subsp. thuringiensis strain IS5056. Polish J. Microbiol. 63 (2), 147. Available at: https://pdfs.semanticscholar.org/5ae6/f1f8188e4ef113ab287f3fac55ad6f1709fb.pdf.
Ngamwongsatit, P., Buasri, W., Pianariyanon, P., Pulsrikarn, C., Ohba, M., Assavanig, A., et al. (2008). Broad distribution of enterotoxin genes (hblCDA, nheABC, cytK, and entFM) among Bacillus thuringiensis and Bacillus cereus as shown by novel primers. Int. J. Food Microbiol. 121 (3), 352–356. doi: 10.1016/j.ijfoodmicro.2007.11.013
Orlek, A., Stoesser, N., Anjum, M. F., Doumith, M., Ellington, M. J., Peto, T., et al. (2017). Plasmid classification in an era of whole-genome sequencing: Application in studies of antibiotic resistance epidemiology. Front. Microbiol. 8, 182. doi: 10.3389/fmicb.2017.00182
Page, A. J., Cummins, C. A., Hunt, M., Wong, V. K., Reuter, S., Holden, M. T., et al. (2015). Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31 (22), 3691–3693. doi: 10.1093/bioinformatics/btv421
Park, Y.-B., Kim, J.-B., Shin, S.-W., Kim, J.-C., Cho, S.-H., Lee, B.-K., et al. (2009). Prevalence, genetic diversity, and antibiotic susceptibility of Bacillus cereus strains isolated from rice and cereals collected in Korea. J. Food Prot. 72 (3), 612–617. doi: 10.4315/0362-028X-72.3.612
Parulekar, R. S., Sonawane, K. D. (2018). Insights into the antibiotic resistance and inhibition mechanism of aminoglycoside phosphotransferase from bacillus cereus: In silico and in vitro perspective. J. Cell. Biochem. 119 (11), 9444–9461. doi: 10.1002/jcb.27261
Patel, R., Uhl, J. R., Kohner, P., Hopkins, M. K., Cockerill, 3. F. (1997). Multiplex PCR detection of vanA, vanB, vanC-1, and vanC-2/3 genes in enterococci. J. Clin. Microbiol. 35 (3), 703–707. doi: 10.1128/jcm.35.3.703-707.1997
Prüß, B. M., Dietrich, R., Nibler, B., Maörtlbauer, E., Scherer, S. (1999). The hemolytic enterotoxin HBL is broadly distributed among species of the bacillus cereus group. Appl. Environ. Microbiol. 65 (12), 5436–5442. doi: 10.1128/AEM.65.12.5436-5442.1999
Rajkovic, A., Uyttendaele, M., Vermeulen, A., Andjelkovic, M., Fitz-James, I., In ‘t Veld, P., et al. (2008). Heat resistance of Bacillus cereus emetic toxin, cereulide. Lett. Appl. Microbiol. 46 (5), 536–541. doi: 10.1111/j.1472-765X.2008.02350.x
Raymond, B., Federici, B. A. (2017). In defence of Bacillus thuringiensis, the safest and most successful microbial insecticide available to humanity–a response to EFSA. FEMS Microbiol. Ecol. 93 (7), fix084. doi: 10.1093/femsec/fix084
Raymond, B., Federici, B. A. (2018). An appeal for a more evidence based approach to biopesticide safety in the EU. FEMS Microbiol. Ecol. 94 (1), fix169. doi: 10.1093/femsec/fix169
Richter, M., Rosselló-Móra, R., Oliver Glöckner, F., Peplies, J. (2016). JSpeciesWS: A web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32 (6), 929–931. doi: 10.1093/bioinformatics/btv681
Schnepf, E., Crickmore, N., Van Rie, J., Lereclus, D., Baum, J., Feitelson, J., et al. (1998). Bacillus thuringiensis and its pesticidal crystal proteins. Microbiol. Mol. Biol. Rev. 62 (3), 775–806. doi: 10.1128/MMBR.62.3.775-806.1998
Schoeni, J., Wong, A. (2005). Bacillus cereus food poisoning and its toxins. J. Food Prot. 68 (3), 636–648. doi: 10.4315/0362-028X-68.3.636
Schwenk, V., Riegg, J., Lacroix, M., Märtlbauer, E., Jessberger, N. (2020). Enteropathogenic potential of Bacillus thuringiensis isolates from soil, animals, food and biopesticides. Foods 9 (10), 1484. doi: 10.3390/foods9101484
Seemann, T. (2014). Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30 (14), 2068–2069. doi: 10.1093/bioinformatics/btu153
Seemann, T. (2018). ABRicate: mass screening of contigs for antimicrobial and virulence genes. Department Microbiol. Immunol. The University of Melbourne, Melbourne, Australia. Available online: https://github.com/tseemann/abricate (accessed on 13 June 2022).
Sekar, V., Thompson, D. V., Maroney, M. J., Bookland, R. G., Adang, M. J. (1987). Molecular cloning and characterization of the insecticidal crystal protein gene of Bacillus thuringiensis var. tenebrionis. Proc. Natl. Acad. Sci. 84 (20), 7036–7040. doi: 10.1073/pnas.84.20.7036
Sheppard, A. E., Stoesser, N., Wilson, D. J., Sebra, R., Kasarskis, A., Anson, L. W., et al. (2016). Nested Russian doll-like genetic mobility drives rapid dissemination of the carbapenem resistance gene bla KPC. Antimicrobial Agents Chemotherapy 60 (6), 3767–3778. doi: 10.1128/AAC.00464-16
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., Zdobnov, E. M. (2015). BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31 (19), 3210–3212. doi: 10.1093/bioinformatics/btv351
Smits, T. H. (2019). The importance of genome sequence quality to microbial comparative genomics. BMC Genomics 20 (1), 1–4. doi: 10.1186/s12864-019-6014-5
Stenfors Arnesen, L. P., Fagerlund, A., Granum, P. E. (2008). From soil to gut: Bacillus cereus and its food poisoning toxins. FEMS Microbiol. Rev. 32 (4), 579–606. doi: 10.1111/j.1574-6976.2008.00112.x
Swiecicka, I., van der Auwera, G. A., Mahillon, J. (2006). Hemolytic and nonhemolytic enterotoxin genes are broadly distributed among Bacillus thuringiensis isolated from wild mammals. Microbial Ecol. 52 (3), 544–551. doi: 10.1007/s00248-006-9122-0
Thomas, C. M., Nielsen, K. M. (2005). Mechanisms of, and barriers to, horizontal gene transfer between bacteria. Nat. Rev. Microbiol. 3 (9), 711–721. doi: 10.1038/nrmicro1234
Tschäpe, H. (1994). The spread of plasmids as a function of bacterial adaptability. FEMS Microbiol. Ecol. 15 (1-2), 23–31. doi: 10.1111/j.1574-6941.1994.tb00226.x
Turnbull, P., Kramer, J., Jørgensen, K., Gilbert, R., Melling, J. (1979). Properties and production characteristics of vomiting, diarrheal, and necrotizing toxins of Bacillus cereus. Am. J. Clin. Nutr. 32 (1), 219–228. doi: 10.1093/ajcn/32.1.219
Walker, B. J., Abeel, T., Shea, T., Priest, M., Abouelliel, A., Sakthikumar, S., et al. (2014). Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One 9 (11), e112963. doi: 10.1371/journal.pone.0112963
Wick, R. R., Judd, L. M., Gorrie, C. L., Holt, K. E. (2017a). Completing bacterial genome assemblies with multiplex MinION sequencing. Microbial Genomics 3 (10), e000132. doi: 10.1099/mgen.0.000132
Wick, R. R., Judd, L. M., Gorrie, C. L., Holt, K. E. (2017b). Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PloS Comput. Biol. 13 (6), e1005595. doi: 10.1371/journal.pcbi.1005595
Wick, R. R., Schultz, M. B., Zobel, J., Holt, K. E. (2015). Bandage: Interactive visualization of de novo genome assemblies. Bioinformatics 31 (20), 3350–3352. doi: 10.1093/bioinformatics/btv383
Yim, J. H., Kim, K. Y., Chon, J. W., Kim, D. H., Kim, H. S., Choi, D. S., et al. (2015). Incidence, antibiotic susceptibility, and toxin profiles of Bacillus cereus sensu lato isolated from Korean fermented soybean products. J. Food Sci. 80 (6), M1266–M1270. doi: 10.1111/1750-3841.12872
Keywords: Bacillus thuringiensis subsp. tenebrionis, Bacillus cereus group, whole genome sequencing, nanopore, enterotoxin, cry3Aa
Citation: Schäfer L, Volk F, Kleespies RG, Jehle JA and Wennmann JT (2023) Elucidating the genomic history of commercially used Bacillus thuringiensis subsp. tenebrionis strain NB176. Front. Cell. Infect. Microbiol. 13:1129177. doi: 10.3389/fcimb.2023.1129177
Received: 21 December 2022; Accepted: 17 February 2023;
Published: 20 March 2023.
Edited by:
Marcelo Berretta, Instituto Nacional de Tecnología Agropecuaria, ArgentinaReviewed by:
Ruchir Mishra, University of Florida, United StatesSathesh K. Sivasankaran, University of Missouri, United States
Copyright © 2023 Schäfer, Volk, Kleespies, Jehle and Wennmann. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jörg T. Wennmann, joerg.wennmann@julius-kuehn.de