ORIGINAL RESEARCH article

Front. Microbiol., 04 January 2023

Sec. Physiology and Metabolism of Microorganisms

Volume 13 - 2022 | https://doi.org/10.3389/fmicb.2022.1031023

Phenotypic analysis and genome sequence of Rhizopus oryzae strain Y5, the causal agent of tobacco pole rot

  • ZL

    Zhen Li 1,2

  • CS

    Cai-hua Shi 1,3*

  • YH

    Yang Huang 4

  • HW

    Han-cheng Wang 2*

  • WL

    Wen-hong Li 5

  • LC

    Liu-ti Cai 2

  • 1. MARA Key Laboratory of Sustainable Crop Production in the Middle Reaches of the Yangtze River (Co-construction by Ministry and Province), College of Agriculture, Yangtze University, Jingzhou, China

  • 2. Guizhou Provincial Academician Workstation of Microbiology and Health, Guizhou Academy of Tobacco Science, Guiyang, Guizhou, China

  • 3. School of Food Science and Technology & School of Chemical Engineering, Hubei University of Arts and Science, Xiangyang, China

  • 4. China Tobacco Sichuan Industrial Corporation Technical Centre, Chengdu, China

  • 5. Guizhou Institute of Plant Protection, Guizhou Academy of Agricultural Sciences, Guiyang, Guizhou, China

Abstract

Rhizopus oryzae is a destructive pathogen that frequently causes tobacco pole rot in curing chambers. Phenotypic characterization of the pathogen was conducted to provide basic biological and pathological information using Biolog Phenotype MicroArray (PM). In addition, the Y5 strain of R. oryzae was sequenced using Illumina HiSeq and Pacific Biosciences (PacBio) technologies. Using PM plates 1–8, 758 growth conditions were tested. Results indicated that R. oryzae could metabolize 54.21% of tested carbon sources, 86.84% of nitrogen sources, 100% of sulfur sources, and 98.31% of phosphorus sources. About 37 carbon compounds, including D-xylose, N-acetyl-D-glucosamine, D-sorbitol, β-methyl-D-glucoside, D-galactose, L-arabinose, and D-cellobiose, significantly supported the growth of the pathogen. PM 3 indicated the active nitrogen sources, including Gly-Asn, Ala-Asp., Ala-Gln, and uric acid. PM 6–8 showed 285 different nitrogen pathways, indicating that different combinations of different amino acids support the growth of the pathogen. Genome sequencing results showed that the R. oryzae Y5 strain had raw data assembled into 2,271 Mbp with an N50 value of 10,563 bp. A genome sequence of 50.3 Mb was polished and assembled into 53 contigs with an N50 length of 1,785,794 bp, maximum contig length of 3,223,184 bp, and a sum of contig lengths of 51,182,778 bp. A total of 12,680 protein-coding genes were predicted using the Nonredundant, Gene Ontology, Clusters of Orthologous Groups, Kyoto Encyclopedia of Genes and Genomes, and SWISS-PROT databases. The genome sequence and annotation resources of R. oryzae provided a reference for studying its biological characteristics, trait-specific genes, pathogen-host interaction, pathogen evolution, and population genetic diversity. The phenomics and genome of R. oryzae will provide insights into microfungal biology, pathogen evolution, and the genetic diversity of epidemics.

Introduction

Tobacco (Nicotiana tabacum L.) is a leafy, annual, solanaceous plant grown commercially for its leaves (Chen et al., 2020). It is one of the most widely grown commercial non-food crops in the world (Liu et al., 2017). Tobacco pole rot has been described as a postharvest pathogen (Chen et al., 2020) as it frequently infects tobacco leaves when that temperature was above 35°C, which occurs in the curing chamber, and had not been shown to infect tobacco leaves in the field. Yearly losses due to tobacco pole rot are immense. In the last 5 years, it became a potentially serious disease of flue-cured tobacco that led to the complete loss of the harvest in southwest China, especially in Guizhou, the second largest tobacco production province. Under high humidity and warm temperatures during curing, leaf rot can damage the whole leaf within the first 2 days in the curing chamber. The first symptoms are white fuzzy spots on the petiole and a watery brown soft rot. Afterward, dark fruiting bodies are formed. These fruiting bodies are filled with spores easily released by the wind (Kortekamp et al., 2003). It frequently occurs during the curing stage, and the disease incidence rate can reach 100% (Surhone et al., 2010).

The fungus of Rhizopus oryzae is widely studied, it commonly used for production of some oriental traditional foods, it is mainly recognized as a good producer of lactic acid (Londoño-Hernández et al., 2017). Meanwhile, the pathogen is also the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50% (Ma et al., 2009). R. oryzae, the causative agent of tobacco pole rot, has also received much attention in recent years. Many studies on the biological characteristics of R. oryzae have been conducted. It has a wide range of temperature adaptability, ranging from 25–45°C, with an optimum temperature of 38°C (Chen et al., 2021). Early studies showed that R. oryzae was a heat-resistant pathogenic fungus, exhibiting higher growth rates at 25–37°C, and much lower rates were observed at temperatures higher than 40°C (Gayed et al., 1972). Previous studies showed that tobacco pole rot usually occurred during the flue-cured stage, and the termination temperature was about 45°C (Su et al., 2018; Zhang et al., 2018). The pathogen mainly infected petioles and leaves, the pathogenicity varied among different tissues, and the petiole was more conducive to disease (Chen et al., 2021). Carbons such as ribitol, D-arabitol, and ß-cyclodextrin (Wang et al., 2018a), as well as pH (Chen et al., 2021) affect the growth of R. oryzae. However, the metabolic basis for the absence of host specificity by R. oryzae is unknown. This includes the absorption and utilization of carbon, nitrogen, phosphorus, and sulfur and whether there is a significant difference in R. oryzae infection of tobacco under adverse conditions. Knowing the metabolic phenotype of R. oryzae will be valuable to understanding its biochemical properties. It may also help develop potential measures to decrease the overall effect of tobacco pole rot.

Genome sequencing is an important tool for studying the pathogenicity mechanism of plant pathogens. Genomic data are a useful resource to understand the mechanism of plant-pathogen interaction and are used in the phylogenetic analyses of the species (Ailloud et al., 2015). Beyond the isolate of R. oryzae from tobacco, many strains of R. oryzae were isolated from sweet potato, mulberry, lily (Holmes et al., 2002) and the human body (Nguyen et al., 2020). Information obtained from the National Center for Biotechnology Information (NCBI) could be learned that a total of 43 strains of R. oryzae had been sequenced, and there were less differences of genomic data with different R. oryzae strains. The genome size ranged from 37.46–55.79 Mb except for strain GL39, with a size of 72.36 Mb. The GC content of most strains was 34%. However, R. oryzae has a wide range of hosts. Therefore, more genomic sequences are required to analyze the entire species, especially those isolates from tobacco. To better understand the functions of pathogenicity determinants and the traits of aggressiveness of R. oryzae under different ecological environments, the entire genome of the pathogen isolated from tobacco must be sequenced.

Therefore, the objectives of this study were to (i) characterize the metabolic phenotype of R. oryzae and (ii) sequence the genome of R. oryzae. The genome combined with the metabolic phenotype of the pathogen could provide a reference for the study of fungal biological characteristics, trait-specific genes determination, pathogen-host interaction, pathogen evolution, and population genetic diversity. The data provided by this study will be valuable in expanding our knowledge of the biochemical and metabolic phenomics of R. oryzae. It would aid in developing more effective control measures for tobacco pole rot.

Materials and methods

Fungal strain and culture conditions

One isolate of R. oryzae strain Y5 (Wang et al., 2016) (GenBank Accession Nos. KT390187) from tobacco was chosen randomly among the isolates of the pathogen (Chen et al., 2020) for analysis from the laboratory of Guizhou Academy of Tobacco Science. It was also conserved at the China Typical Microorganism Conservation Center with conservation No. CCTCC M2015720. The isolate was maintained on potato dextrose agar (PDA), in a controlled climate cabinet at 25°C in darkness. After 5 days of incubation on PDA, conidia were produced (Figure 1), based on the observed colony attributes, the presence of pale brown sporangiospores (5 to 8 μm in diameter) with bluish stripes (Watanabe, 2002). Sterile cotton swabs were moistened with sterile FF Inoculating Fluid (FF-IF), spores were collected by rotating the cotton swabs on the surface of the colony, and then the swabs were mixed into 12 ml FF-IF inoculation solution. The suspension was filtered through a double-layer of sterile cheesecloth (Grade # 40: 24 × 20 threads per inch) to remove mycelial fragments, and the resulting conidia suspension was diluted to a final concentration of 1 × 105 spores mL−1.

Figure 1

Table 1

WellSubstrateWellSubstrateWellSubstrate
PM1
A02L-ArabinoseC07D-FructoseH06L-Lyxose
A03N-Acetyl-D-GlucosamineC09α-D-GlucoseH09L-Galactonic acid-γ-Lactone
A06D-GalactoseC10MaltoseH10D-Galacturonic acid
A08L-ProlineD08α-Methyl-D-Galactoside
A10D-TrehaloseE08β-Methyl-D-Glucoside
A11D-MannoseE09Adonitol
B02D-SorbitolE10Maltotriose
B08D-XyloseF11D-Cellobiose
C04D-RiboseG05L-Alanyl glycine
PM2
A05γ-CyclodextrinB04AmygdalinD08Xylitol
A06DextrinB05D-ArabinoseG04L-Arginine
A07GelatinB06D-ArabitolH01L-Ornithine
A08GlycogenB08ArbutinH09Dihydroxy acetone
A10LaminarinC01Gentiobiose
A12PectinD02Salicin

Substrates in PM 1–2 carbon source Micro plates significantly supported the growth of Rhizopus oryzae.

Phenotypic characterization

The metabolic capacity of R. oryzae was tested using the Phenotype MicroArray (PM) system (Biolog, Hayward, CA, United States) to determine its phenotype (Bochner et al., 2001; Zhou et al., 2003; Von Eiff et al., 2006). The PM system involved 758 different growth conditions, including 190 diverse carbon sources (PM 1–2), 95 nitrogen sources (PM 3), 285 nitrogen pathways (PM 6–8), 59 phosphorus sources (PM 4), 35 sulfur sources (PM 4) and 94 biosynthetic pathways (PM 5). All materials, media, and reagents for the PM system were purchased from Biolog corporation. In total, 8 PM plates were used in this study. Carbon, nitrogen, phosphorus, sulfur, and biosynthetic pathways were tested for catabolic pathways in Plates 1–8. R. oryzae conidia suspension was prepared as detailed above and suspended in a suitable medium containing sterile FF-IF. The turbidity of conidial suspension was tested and was modified to a density of 62% T (transmittance). FF-IF was used for PM plates 1 and 2, FF-IF plus 100 mM D-glucose, 5 mM potassium phosphate (pH 6.0), and 2 mM sodium sulfate was used for PM plates 3 and 5–8. FF-IF plus 100 mM D-glucose was used for PM plate 4 (Wang M. S. et al., 2015). Plates containing 100 μl of the specified mixture were incubated in an OmniLog at 28°C for 1 week, and readings were taken every 15 min. Incubation and recording of phenotypic data were performed in the OmniLog station by capturing digital images of microarrays and storing turbidity values. Kinetic and Parametric software (Biolog, Hayward, CA, United States) was used to analyze the data. The phenotype was estimated according to the area of each well under the staining formation kinetics curve. The experiment was repeated twice.

Phylogenetic analyses of the sequences of genomic strains of R. oryzae

Reference sequences (Supplementary Table S1) from Nguyen et al. (2020) were downloaded from GenBank, the evolutionary history was inferred using the Neighbor-Joining method (Saitou and Nei, 1987), evolutionary analyses were conducted in MEGA7 (Kumar et al., 2016).

Whole genome sequencing of R. oryzae strain of Y5

Genome sequencing

Extraction of genome DNA

Genomic DNA was extracted with the SDS method (Lim et al., 2016). The harvested DNA was detected by the agarose gel electrophoresis and quantified by Qubit® 2.0 Fluorometer (Thermo Scientific).

Library construction

Illumina NovaSeq platform

A total amount of 1 μg DNA per sample was used as input material for the DNA sample preparations. Sequencing libraries were generated using NEBNext® Ultra DNA Library Prep Kit for Illumina (NEB, United States) following manufacturer’s recommendations and index codes were added to attribute sequences to each sample. Briefly, the DNA sample was fragmented by sonication to a size of 350 bp, then DNA fragments were end-polished, A-tailed, and ligated with the full-length adaptor for Illumina sequencing with further PCR amplification. At last, PCR products were purified (AMPure XP system) and libraries were analyzed for size distribution by Agilent2100 Bioanalyzer and quantified using real-time PCR.

PacBio sequel platform

Libraries for single-molecule real-time (SMRT) sequencing was constructed with an insert size of 20 kb using the SMRT bell TM Template kit, version 1.0. Briefly, the process was that fragment and concentrate DNA, repair DNA damage and ends, prepare blunt ligation reaction, purify SMRTbell Templates with 0.45X AMPure PB Beads, size-selection using the BluePippin System, repair DNA damage after size-selection. At last, the library quality was assessed on the Qubit® 2.0 Fluorometer (Thermo Scientific) and detected the insert fragment size by Agilent 2,100 (Agilent Technologies).

Sequencing

The whole genome of strain Y5 was sequenced using PacBio Sequel platform and Illumina NovaSeq PE150 at the Beijing Novogene Bioinformatics Technology Co., Ltd. The Illumina reads was only used to polish the assembly generated by PacBio reads and helped to reduce gaps and merge contigs.

Genome assembly

Preliminary assembly with SMRT Link v5.0.1 (Ardui et al., 2018; Reiner et al., 2018).

In order to ensure the accuracy of the subsequent analysis results, the low-quality reads were filtered (less than 500 bp) to obtain clean data. Using the automatic error correction function of SMRT portal, the long reads were selected (more than 6,000 bp) as the seed sequence, and the other shorter reads were aligned to the seed sequence by Blasr, so that the accuracy of the seed sequence could be improved further. After assembling we got an initial result.

Correct the results of the preliminary assembly

By the variant Caller module of the SMRT Link software, the arrow algorithm was used to correct and count the variant sites in the preliminary assembly results.

Genome component prediction

Genome component prediction included the prediction of the coding gene, repetitive sequences and non-coding RNA. The available steps were proceeded as follows:

For Fungi, by default, the Augustus (Stanke et al., 2008) 2.7 program to retrieve the related coding gene was used. Homology reference gene sequences and transcript sequencing data were provided, a complete annotation pipeline, PASA, as implemented at the Broad Institute, involves the following steps: (A)ab initio gene finding using a selection of the following software tools: GeneMarkHMM, FGENESH, Augustus, and SNAP, GlimmerHMM. (B) protein homology detection and intron resolution using the GeneWise software and the uniref90 non-redundant protein database. (C) alignment of known ESTs, full-length cDNAs, and most recently, Trinity RNA-Seq assemblies to the genome. (D) PASA alignment assemblies based on overlapping transcript alignments from step (C). (E) use of EVidenceModeler (EVM) to compute weighted consensus gene structure annotations based on the above (A, B, C, D). (F) use of PASA to update the EVM consensus predictions, adding UTR annotations and models for alternatively spliced isoforms (leveraging D and E).

The interspersed repetitive sequences were predicted using the RepeatMasker (Saha et al., 2008).1 The tandem Repeats were analyzed by the TRF (Tandem repeats finder; Benson, 1999).

Transfer RNA (tRNA) genes were predicted by the tRNAscan-SE (Lowe and Eddy, 1997). Ribosome RNA (rRNA) genes were analyzed by the rRNAmmer (Lagesen et al., 2007). sRNA, snRNA and miRNA were predicted by BLAST against the Rfam database (Gardner et al., 2009; Nawrocki and Kolbe, 2009).

Gene function prediction

Seven databases were used to predict gene functions. They were respective GO (Gene Ontology; Ashburner et al., 2000), KEGG (Kyoto Encyclopedia of Genes and Genomes; Kanehisa et al., 2004, 2006), KOG (Clusters of Orthologous Groups), NR (Non-Redundant Protein Database databases; Li et al., 2002), TCDB (Transporter Classification Database; Milton et al., 2014), P450 (Crešnar and Petrič, 2011) and Swiss-Prot (Amos and Rolf, 2000). A whole genome Blast search (E-value less than 1e-5, minimal alignment length percentage larger than 40%) was performed against above seven databases. The secretory proteins were predicted by the Signal P database (Petersen et al., 2011). Meanwhile, we analyzed the secondary metabolism gene clusters by the antiSMASH (Medema et al., 2011). For pathogenic fungi, we added the pathogenicity and drug resistance analyses. We used the PHI (Martin et al., 2015; Pathogen Host Interactions), DFVF (database of fungal virulence factors) to perform the above analyses. Carbohydrate-Active enzymes were predicted by the Carbohydrate-Active enZYmes Database (Cantarel et al., 2009).

Results

Phenotypic characterization of R. oryzae strain Y5

Using the OmniLog PM system, a kinetic response curve which parallels microbial growth can be generated for each well, allowing growth to be compared between samples through multiple parameters such as lag, slope, and area under the curve (Figure 2). The isolate Y5 of R. oryzae tested presented a representative phenotypic fingerprint. The fungus was able to metabolize 54.21% of tested carbon sources (51/95 in plate PM1 and 52/95 in plate PM2), 86.84% of nitrogen sources (61/95 in plate PM3, 93/95 in plate PM6, 94/95 in plate PM7, and 82/95 in plate PM8), 100% of sulfur sources (35/35 in plate PM4, wells F02-H12) and 98.31% of phosphorus sources (58/59 in plate PM4, wells A02-E12). The efficient metabolism of carbon sources rate was 19.27%, and the opportune metabolism of carbon sources rate was 17.19%. The efficient metabolism of nitrogen sources rate was 53.91%, and the opportune metabolism of carbon sources rate was 27.34%. The efficient metabolism of sulfur sources rate was 97.14%. The efficient metabolism of phosphorus sources rate was 76.27%, and the opportune metabolism of carbon sources rate was 16.49%. The pathogen presented 94 different biosynthetic pathways (94/94 tested, plate PM5, wells A3-H12).

Figure 2

Table 2

WellSubstrateWellSubstrateWellSubstrate
PM3
A05UreaE11N-Acetyl-D-GlucosamineH05Ala-His
A07L-AlanineF03AdenosineH06Ala-Leu
A08L-ArginineF06GuanineH07Ala-Thr
A09L-AsparagineF07GuanosineH08Gly-Asn
A10L-Aspartic acidG01XanthineH09Gly-Gln
A12L-Glutamic acidG03Uric acidH10Gly-Glu
B01L-GlutamineG05AllantoinH11Gly-Met
B02GlycineG08γ-Amino-N-Butyric acidH12Met-Ala
B09L-ProlineH01Ala-Asp
C12L-OrnithineH02Ala-Gln
D11PutrescineH03Ala-Glu
D12AgmatineH04Ala-Gly
PM6
A02Positive control: L-GlutamineB12Arg-LysE03Gly-Arg
A03Ala-AlaC01Arg-MetE07Gly-Leu
A04Ala-ArgC02Arg-PheE12Gly-Ser
A05Ala-AsnC03Arg-SerF01Gly-Thr
A06Ala-GluC04Arg-TrpF02Gly-Trp
A07Ala-GlyC05Arg-TyrF04Gly-Val
A08Ala-HisC06Arg-ValF12His-Trp
A09Ala-LeuC07Asn-GluG04Ile-Arg
A10Ala-LeuC08Asn-ValG05Ile-Gln
A11Ala-PheC11Asp-LeuG09Ile-Met
A12Ala-ProC12Asp-LysG12Ile-Ser
B01Ala-SerD01Asp-PheH01lle-Trp
B02Ala-ThrD02Asp-TrpH02Ile-Tyr
B03Ala-TrpD03Asp-ValH04Leu-Ala
B05Arg-AlaD04Cys-GlyH05Leu-Arg
B06Arg-ArgD05Gln-GlnH06Leu-Asp
B07Arg-AspD06Gln-GlyH07Leu-Glu
B08Arg-GlnD09Glu-GlyH08Leu-Gly
B09Arg-GluD11Glu-TrpH09Leu-Ile
B10Arg-IleE01Glu-ValH11Leu-Met
B11Arg-LeuE02Gly-AlaH12Leu-Phe
PM7
A02Positive control:L-GlutamineE04Ser-LeuH03Val-Arg
A04Leu-TrpE07Ser-ProH04Val-Asn
A05Leu-ValE10Ser-ValH05Val-Gly
A06Lys-AlaE11Thr-AlaH06Val-Gly
A07Lys-ArgE12Thr-ArgH07Val-His
A09Lys-IleF01Thr-GluH08Val-Ile
A10Lys-LeuF02Thr-GlyH09Val-Leu
A12Lys-PheF03Thr-LeuH10Val-Tyr
B07Met-ArgF06Trp-AlaH11Val-Val
B09Met-GlnF07Trp-Arg
B12Met-HisF08Trp-Asp
C01Met-IleF10Trp-Gly
C02Met-LeuF11Trp-Leu
C08Met-ProG02Trp-Ser
C11Phe-IleG06Tyr-Gln
D04Pro-AlaG10Tyr-Leu
D06Pro-GlnG11Tyr-Lys
D09Pro-LeuG12Tyr-Phe
D12Pro-TyrH01Tyr-Trp
E01Ser-AlaH02Tyr-Tyr
PM8
A02Positive Control:L-GlutamineC01Lys-GlyE04Tyr-lle
A03Ala-AspC03Met-PhrE06Val-Ala
A04Ala-GlnC04Met-TyrE07Val-Gln
A05Ala-lleC05Phe-AspE08Val-Glu
A06Ala-MetC07Gln-GluE09Val-Lys
A07Ala-ValC08Phe-MetE10Val-Met
A08Asp-AlaC10Phe-ValE11Val-Phe
A09Asp-GlnC11Pro-ArgE12Val-Pro
A10Ala-GlyC12Pro-AsnF01Val-Ser
A11Glu-AlaD01Pro-GluG11Ala-Ala-Ala
A12Gly-AsnD02Pro-lleH01Gly-Gly-Ala
B01Gly-AspD04Pro-SerH03Gly-Gly-Gly
B02Gly-lleD06Pro-ValH04Gly-Gly-IIe
B03His-AlaD07Ser-AsnH05Gly-Gly-Leu
B05His-HisD08Ser-AspH06Gly-Gly-Phe
B06lle-AsnD09Ser-GlnH07Val-Tyr-Val
B07Ile-LeuD10Ser-GluH08Gly-Phe-Phe
B08Leu-AsnD11Thr-AspH09Leu-Gly-Gly
B10Leu-ProD12Thr-GlnH10Leu-Leu-Leu
B11Leu-TyrE01Thr-PheH11Phe-Gly-Gly
B12Lys-AspE03Trp-ValH12Tyr-Gly-Gly

Substrates in PM 3, 6–8 nitrogen source Micro plates significantly supported the growth of Rhizopus oryzae.

Carbon source utilization characteristics of strain Y5

Based on data from PM1 and PM2 (carbon sources), the isolate of R. oryzae from tobacco could use 103 different carbon sources and about 37 compounds (Table 1), including D-xylose, N-acetyl-D-glucosamine, D-sorbitol, β-methyl-D-glucoside, D-galactose, L-arabinose, D-cellobiose, D-Mannose, D-Galacturonic acid, maltotriose, D-trehalose, L-alanyl glycine, D-fructose, maltose, α-D-glucose, L-proline, D-ribose, adonitol, L-lyxose, L-galactonic acid-γ-lactone, α-methyl-D-galactoside, γ-cyclodextrin, pectin, dextrin, laminarin, gentiobiose, salicin, D-arabinose, D-arabitol, dihydroxy acetone, arbutin, glycogen, L-ornithine, xylitol, amygdalin, gelatin, and L-arginine significantly supported the growth of the pathogen. In comparison, around 87 compounds significantly inhibited the growth of the pathogen. Therefore, the utilization rate of carbon was lower than that of other sources.

Nitrogen sources utilization characteristics of strain Y5

Based on data from the PM3 plate, the isolate was tested for its ability to grow on 95 different nitrogen sources (amino acids). Sixty one compounds supported the growth of the pathogen, typical compounds included Gly-Asn, Ala-Asp., Ala-Gln, uric acid, Ala-Gly, Gly-Gln, Met-Ala, Gly-Glu, L-arginine, L-glutamine, L-ornithine, Ala-Glu, L-asparagine, Ala-Thr, Ala-Leu, allantoin, Gly-Met, agmatine, urea, guanine, Ala-His, guanosine, putrescine, xanthine, L-glutamic acid, L-aspartic acid, adenosine, L-proline, L-alanine, N-acetyl-D-glucosamine, glycine, and γ-amino-N-butyric acid. In comparison, 34 out of 95 nitrogen sources supported growth in the negative control, indicating that R. oryzae cannot metabolize these compounds. Based on PM6 to PM8 (nitrogen pathway) data, R. oryzae showed 285 different nitrogen pathways, indicating that different combinations of different amino acids support the growth of the pathogen. The result showed that 260 nitrogen pathways supported the growth of the pathogen, and more than 175 efficient nitrogen pathways, including L-glutamine, Arg-Arg, Ala-Arg, Arg-Lys, Ala-Asn, Arg-Met, Arg-Gln, Arg-Ala, Arg-Ile, Arg-Val, Arg-Asp., Ile-Arg, Arg-Glu, Leu-Phe, Arg-Leu, Arg-Ser, Ala-Pro, Arg-Trp, Leu-Arg, Ile-Gln, Gly-Arg, Asn-Glu, Asn-Val, Arg-Phe, Arg-Tyr, Ala-Gly, Ile-Ser, Ala-Trp, Asp-Lys, Ala-Ala, and lle-Trp. In comparison, around 25 compounds significantly inhibited the growth of the pathogen (Table 2). Therefore, the utilization rate of nitrogen sources was higher than that of carbon sources.

Phosphorus and sulfur sources utilization characteristics of strain Y5

The pathogen presented unapparent growth in the negative control without any phosphorus source (plate PM4, Well A01). Meanwhile, the pathogen could assimilate all S-containing compounds tested (35/35 tested, plate PM4, Wells F1-H12) (Table 3). Typical compounds included tetramethylene sulfone, butane sulfonic acid, L-methionine sulfone, hypotaurine, methane sulfonic acid, thiourea, L-cysteine sulfinic acid, D, L-lipoamide, L-cysteic acid, ρ-amino benzene sulfonic acid, 1-thio-β-D-glucose, glycyl-L-methionine, taurine, L-cysteinylglycine, 2-hydroxyethane sulfonic acid, cysteamine, thiosulfate, and cystathionine. The pathogen presented 94 different biosynthetic pathways (94/94 tested, plate PM5, wells A3-H12). Typical biosynthetic pathways included Tween 80, caprylic acid, myo-inositol, Tween 20, 2′-deoxy inosine, D, L-carnitine, 2′-deoxy adenosine, 2′-deoxy uridine, D, L-α-lipoic acid (oxidized form), Tween 60, thymidine, butyric acid, and L-glutamine (Table 4).

Phylogenetic analyses

The optimal tree with the sum of branch length = 31.57 was shown. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1,000 replicates) were shown next to the branches (Felsenstein, 1985). The tree was drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the Maximum Composite Likelihood method (Tamura et al., 2004) and were in the units of the number of base substitutions per site. The analysis involved 37 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. All positions containing gaps and missing data were eliminated. There was a total of 276 positions in the final dataset (Supplementary Figure S1).

Whole genome sequencing and statistical analysis

Whole genome sequencing was performed using single-molecule real-time (SMRT) sequencing on the PacBio RS II (Eid et al., 2009) and HiSeq PE150 platforms. PacBio RS II platform yielded 283,154 reads encompassing 2,129,349,468 bp, with an N50 value of 10,563 bp, using SMRT Link v5.1.0 software2 (Ardui et al., 2018; Reiner et al., 2018) for genome assembly to obtain the preliminary result which can reflect the basic condition of the sample genome. A genome sequence of 50.3 Mb was polished and assembled into 53 contigs with an N50 length of 1,785,794 bp, maximum contig length of 3,223,184 bp, and a total contig length of 51,182,778 bp. After the quality control, the number of contigs was 41 with an N50 length of 1,791,927 bp, maximum contig length of 3,235,075 bp, and total length of 50,257,186 bp (~45× genome average coverage; Table 5), and GC content of R. oryzae genome was found to be 35.57% which is similar to other forty-three sequenced R. oryzae (Figure 3; Supplementary Table S1).

Table 3

WellSubstrateWellSubstrateWellSubstrate
PM4
A02PhosphateD02D-Mannose-6-PhosphateG02S-Methyl-L-Cysteine
A03PyrophosphateD03Cysteamine-S-PhosphateG03Cystathionine
A04Trimeta-PhosphateD04Phospho-L-ArginineG04Lanthionine
A08Adenosine-2’-MonophosphateD05O-Phospho-D-SerineG05Glutathione
A09Adenosine-3’-MonophosphateD06O-Phospho-L-SerineG08D-Methionine
A10Adenosine-5’-MonophosphateD07O-Phospho-L-ThreonineG09Glycyl-L-Methionine
A11Adenosine-2′,3’-Cyclic monophosphateD08Uridine-2’-MonophosphateG10N-Acetyl-D,L-Methionine
A12Adenosine-3′,5’-Cyclic monophosphateD09Uridine-3’-MonophosphateG11L-Methionine sulfoxide
B02DithiophosphateD10Uridine-5’-MonophosphateG12L-Methionine sulfone
B03D,L-α-Glycerol phosphateD11Uridine-2′,3’-Cyclic monophosphateH01L-Djenkolic acid
B04β-Glycerol phosphateD12Uridine-3′,5’-Cyclic monophosphateH02Thiourea
B05Carbamyl phosphateE01O-Phospho-D-TyrosineH031-Thio-β-D-Glucose
B06D-2-Phospho-Glyceric acidE02O-Phospho-L-TyrosineH04D,L-Lipoamide
B07D-3-Phospho-Glyceric acidE03PhosphocreatineH05Taurocholic acid
B08Guanosine-2’-MonophosphateE04Phosphoryl cholineH06Taurine
B09Guanosine-3’-MonophosphateE05O-Phosphoryl-EthanolaminH07Hypotaurine
B10Guanosine-5’-MonophosphateE09Thymidine-3’-MonophosphateH08p-Amino benzene sulfonic acid
B11Guanosine-2′,3’-Cyclic monophosphateE10Thymidine-5’-MonophosphateH09Butane sulfonic acid
B12Guanosine-3′,5’-Cyclic monophosphateF01Negative controlH102-Hydroxyethane sulfonicacid
C01Phosphoenol pyruvateF02SulfateH11Methane sulfonic acid
C02Phospho-Glycolic acidF03ThiosulfateH12Tetramethylene sulfone
C03D-Glucose-1-PhosphateF04Tetrathionate
C052-Deoxy-D-Glucose 6-PhosphateF06Dithiophosphate
C06D-Glucosamine-6-PhosphateF07L-Cysteine
C076-Phospho-Gluconic acidF08D-Cysteine
C08Cytidine-2’-MonophosphatF09L-Cysteinyl_x005f glycine
C09Cytidine-3’-MonophosphateF10L-Cysteic acid
C10Cytidine-5’-MonophosphateF11Cysteamine
C11Cytidine-2′,3’-Cyclic monophosphateF12L-Cysteine sulfinic acid
C12Cytidine- 3′,5’-Cyclic monophosphateG01N-Acetyl-L-Cysteine

Substrates in PM 4 phosphorus and sulfur source Micro plates significantly supported the growth of Rhizopus oryzae.

Figure 3

Table 4

WellSubstrateWellSubstrateWellSubstrate
PM5
A01Negative controlC09(5) 4-Amino-Imidazole-4 (5)-CarboxamideF07Deferoxamine mesylate
A02Positive controlC11InosineF08D-(+)-Glucose
A03L-AlanineC122’-Deoxy inosineF09N-Acetyl D-Glucosamine
A04L-ArginineD01L-OrnithineF10Thymine
A05L-AsparagineD02L-CitrullineF11Glutathione (reduced form)
A06L-Aspartic acidD03Chorismic acidF12Thymidine
A07L-CysteineD04(−)Shikimic acidG01Oxaloacetic acid
A08L-Glutamic acidD05L-Homoserine lactoneG02D-Biotin
A09Adenosine-3′,5′-cyclic monophosphateD06D-AlanineG03Cyano-Cobalamine
A10AdenineD07D-Aspartic acidG04P-Amino-Benzoic acid
A11AdenosineD08D-AlanineG05Folic acid
A122’-Deoxy adenosineD09D,L-α,ε-Diamino-pimelic acidG06Inosine+Thiamine
B01L-GlutamineD10CytosineG07Thiamine
B02GlycineD11CytidineG08Thiamine pyrophosphate
B03L-HistidineD122’-DeoxyCytidineG09Riboflavin
B04L-IsoleucineE01PutrescineG10Pyrrolo-Quinoline Quinone
B05L-LeucineE02SpermidineG11Menadione
B06L-LysineE03SpermineG12Myo-Inositol
B07L-MethionineE04PyridoxineH01Butyric acid
B08L PhenylalanineE05PyridoxalH02D, L-α-Hydroxy-Butyric acid
B09Guanosine-3′,5′-cyclic monophosphateE06PyridoxamineH03α-Ketobutyric acid
B10GuanineE07β-AlanineH04Caprylic acid
B11GuanosineE08D-Pantothenic acidH05D, L-α-Lipoic acid (oxidized form)
B122’-Deoxy guanosinesE10UracilH06D, L-Mevalonic acid
C01L-ProlineE11UridineH07D, L-Carnitine
C02L-SerineE122’-Deoxy UridineH08Choline
C03L-ThreonineF01Quinolinic acidH09Tween20
C04L-TryptophanF02Nicotinic acidH10Tween40
C05L-TyrosineF03NicotinamideH11Tween60
C06L-ValineF04β-Nicotinamide adenine dinucleotideH12Tween80
C07L-isoleucine + L-ValineF05δ-Amino-Levulinic acid
C08trans-4-HydroxyL-ProlineF06Hematin

Substrates in PM 5 nutrient supplements Micro plates significantly supported the growth of Rhizopus oryzae.

Table 5

Assembly parametersY5
Sequencing platformPacBio and Illumina PE150
Assembly methodSMRT Link v5.0.1
Genome size (Mb)50.26
Sequencing coverage45.2 x
Number of contigs41
Average contig length (bp)1,225,785
Contig N50 (bp)1,791,927
Maximum contig length (Mp)3.24
Number of all Contigs53
Maximum Contig length(bp)3,223,184
Contig N50 length (bp)1,785,794
G + C content (%)35.57

Genome characteristics and predicted features of the assembled Rhizopus oryzae strain Y5.

Table 6

Gene prediction
Gene number12,680
Gene total length (bp)17,290,559
Gene average length (bp)1,364
Gene/genome (%)34.40
Repeat
TypeNumberTotal Length (bp)In Genome (%)Average length (bp)
LTR9,5564,996,7009.9423527
DNA4,6661,615,8423.2151350
LINE2,387191,4920.381089
SINE996,3220.012664
Unknown292,1230.004273
RC877,8510.015690
Total16,8246,789,81213.5101410
TypeNumberRepeat Size (bp)Total Length (bp)In Genome (%)
TR14,8661 ~ 1,821750,5931.4935
Minisatellite DNA8,43010 ~ 60389,7430.7755
Microsatellite DNA9612 ~ 644,5350.0886
ncRNATypeNumberAverage length (bp)Total Length (bp)
tRNA2767520,768
rRNA5 s (denovo)3115345
5.8 s (denovo)000
18 s (denovo)21,8223,645
28 s (denovo)34,50513,514
snRNA391184,630

Genomic component statistics and transposable element repeat class analysis of Rhizopus oryzae strain Y5.

Genome component analysis of coding genes, repeat sequence and ncRNA

The gene prediction was annotated, resulting in 12,680 genes. The total length of genes was 17,290,559 bp, and the average length was 1,364 bp, accounting for 34.4% of the genome. The TEs (transposable elements) represented 13.51% of the genome assembly with a total length of 6,787,689 bp. The total number of TE families analyzed with RepeatMasker in the genome assembly was 16,795 of which 16,708 (99.0%) belonged to the known TEs, including 12,042 retrotransposons (Class I) and 4,666 DNA transposons (Class II; Table 6). Class I retrotransposons could be mainly divided into three groups of TEs, including LINE, LTR, and SINE. Owing to the redundancy of repeats, these duplications were more tolerant of mutations, such as transposon insertions and sequence rearrangements, and might therefore act as a hotspot for genome expansion (Lippman et al., 2004). Tandem Repeat (TR) units with species composition specificity could be used as genetic traits of species for the study of evolutionary relationships. The TR represented 1.49% of the genome assembly with a total length of 750,593 bp. Non-coding RNAs (ncRNA) resembled mRNA in structure and function, they could regulate the transcription and translation of mRNAs in close proximity to them, except that unlike mRNAs that could be translated into proteins. The total number of ncRNA was 323, with a total length of 42,902 bp.; this suggested that ncRNA formed only a small proportion of the overall genome size (Table 6).

Genome function analysis of pathogenicity related genes

The annotation result statistics of the encoded genes were shown in the Supplementary Table S2, and 9.4% (3,531 genes) could be annotated and classified into different functional categories using COG (Cluster of Orthologous Groups of proteins; Figure 4), and 3,531 predicted genes could be assigned to 23 COG families. Except for the genes predicted to have general (382 genes) or unknown functions (133 genes), the largest group of genes were involved in “translation, ribosomal structure, and biogenesis” (525 genes, 8.93%). In addition, 5,408 predicted genes had KEGG orthologs, and 4,013 predicted genes had Swiss-Prot orthologs. A total of 12,167 predicted genes had NR orthologs, the largest proportion. The PHI database (Martin et al., 2015), mainly derived from fungi, oomycetes, and bacterial pathogens, indicated that the infected hosts include animals, plants, fungi, and insects. Complete proteome of R. oryzae was aligned to PHI database to reveal the pathogenicity related proteins. We observed a total of 1,147 (9.05%) PHI genes were classified into different classes such as “chemistry target: resistance to chemical” “effector (plant a virulence determinant)” “increased virulence (hypervirulence)” “lethal” “loss of pathogenicity” “no data found” “reduced virulence” and “unaffected pathogenicity.” As shown in Figure 5, we observed 10 genes associated with chemistry target about resistance to chemical. Furthermore, 115 genes belonged to lethal and 25 to increased virulence class. Remaining three classes viz., loss of pathogenicity, reduced virulence and unaffected pathogenicity were having 152, 589 and 253 genes, respectively. Pathogenicity related genes identified in this study have high relevance in future fungicide designing. The database is important for finding target genes for drug interventions, and it also includes antifungal compounds and corresponding target genes. The whole-genome sequence and annotation of Rhizopus oryzae isolate Y5 have been deposited at NCBI3 with accession PRJNA814049; BioSample SAMN26535981.

Figure 4

Figure 5

Discussion

Rhizopus oryzae is a destructive fungal pathogen during flue-cured tobacco. The pathogen’s extensive host range includes Rosaceae, Cucurbitaceae, Solanaceae, Brassicaceae, and Umbelliferae (Xu et al., 2020). Many molecular biology, genetic, and genomic studies have been conducted on R. oryzae (Ellis, 1985; Ma et al., 2009). PMs have been used to analyze many microorganisms’ phenotypes, including Botrytis cinerea (Wang et al., 2018b), Phytophthora parasitica (Wang M. S. et al., 2015), and Alternaria alternata (Wang H. C. et al., 2015), comparisons were made with these pathogens. In this study, the metabolic ability of a R. oryzae isolate obtained from flue-cured tobacco was systematically studied using PMs, and important metabolic diversity information was obtained. So far, many Zygomycetes genomes have been sequenced and are publicly available, including R. oryzae from different hosts. We sequenced the genome of strain Y5 from flue-cured tobacco using Illumina HiSeq and Pacific Biosciences (PacBio) technologies. The genome combined with the metabolic phenotype of the pathogen could provide a reference for the study of fungal biological characteristics and population genetic diversity. Genome sequencing can reveal role of each individual gene and their networks responsible for plant pathogen interaction, growth, evolutionary relationship and genes for pathogenicity (Iquebal et al., 2017). Whole genome sequencing of R. oryzae is imperative not only to study the host-pathogen (HP) interaction but such knowledge discovery may lead to more effective disease combating strategy. Annotated genes/ predicted proteins can be directly used as new targets in fungicides development using computational approach (Acero et al., 2011).

Our study revealed that R. oryzae had a comparatively narrow utilization range of carbon sources. In comparison to carbon, nitrogen utilization efficiency was comparatively higher, which explained the R. oryzae infection mechanism for tobacco. Higher levels of an unutilized carbon source or less accessible nitrogen sources may affect pathogen infection and subsequently restrain disease. The PM 4 and PM 6–8 plates showed high levels of metabolic activity; this is comparable to findings from other researchers studying R. oryzae (Wang et al., 2018a). In our study, for the carbon or nitrogen substrates, carbohydrates or amino acids and peptides were greatly utilized by R. oryzae. The utilized substrates may sustain the continued existence of R. oryzae in different hosts and thus affect the pathogenicity of the pathogen. Compared with other pathogens, A. alternata had a comparatively small range of accessible carbon compounds, and most nitrogen, sulfur, and phosphorus sources are metabolized. Pseudomonas syringae had a comparatively small range of accessible carbon compounds, and most nitrogen, sulfur, and phosphorus sources could not be metabolized. B. cinerea had a small range of accessible carbon compounds with different crops, and most nitrogen, sulfur, and phosphorus sources were metabolized comparatively. Most pathogens made full use of nitrogen sources and used carbon sources to a larger or smaller extent. The prevention and control of tobacco pole rot through exploring the absorption and utilization of different nutritional elements by the pathogenic fungus, combined with the nutritional elements absorbed by tobacco, are discussed further.

We report the genomic analysis of R. oryzae, one of the most widely used extracellular enzyme producers, which could also cause disease in plants and humans. A genome sequence of 50.3 Mb was assembled into 41 contigs with an N50 length of 1,791,927 bp, maximum contig length of 3,223,184 bp, and total contig length of 50,257,186 bp. Genome information of all R. oryzae strains is supplied in Supplementary Table S1. Small differences were recorded in the genome sizes of the R. oryzae strains, ranging from 37.5–55.8 Mb except for strain GL39 (72.36 Mb), and the average GC content of the genomes was 34.7%, which was lower than that of strain Y5 of R. oryzae GC content. The genome host of the R. oryzae strains were “Homo sapiens,” “urine,” and “unknown.” Most of the strains were derived from the medical environment, for instance, “lung transplant,” “sinus,” “tracheal biopsy,” “nasal cavity,” “bone marrow,” “ethmoid sinus of diabetic” and “bronchial wash.” The Y5 strain isolated from tobacco was infrequently mentioned in genomic. In this study, the total number of predicted protein-coding genes was 12,680, compared to approximately 14,000 as the highest number from other strains, with a total length of 17,290,559 bp, average gene length of 1,364 bp, and gene length/genome (%) of 34.4%. Pathogenicity related genes identified in this study have high relevance in future fungicide designing using PHI database (Cools and Hammond-Kosack, 2013). Our enlisted lethal and virulence genes can be used in future research of fungal disease management, especially by designing of new generation genomic based fungicide (Seringhaus et al., 2006), the number of lethal genes in our study was found to be 115. In other ascomycotina, for example in S. cerevisiae, it was found to have 900 lethal genes (Miklos and Rubin, 1996).

The characteristics of each strain depend on which genes had changed and how that affected the phenotype. Although numerous studies had investigated the genome-wide properties of R. oryzae and the differences between multiple strains, few reports had compared the strain’s genome with its metabolic phenotype, despite advances in genetic techniques. Further studies of the effects of such metabolism in strains using gene knockout methods are needed. The deluge of data generated by genome sequencing has led to an increasing reliance on bioinformatic predictions, since the traditional experimental approach of characterizing gene function one at a time cannot possibly keep pace with the sequence-based discovery of novel genes (Johnson et al., 2008). One common application of the PM system is to detect phenotypic changes associated with gene knockouts, and this strategy has been used to study gene function and to assess and improve genome annotation (Loh et al., 2006). We have further explored the linkage between metabolic phenotypes and genomes in the next study. Aspergillus luchuensis (Hong et al., 2013) could grow on carbon sources other than glucose. Further RNA sequencing studies are needed to provide quantitative evidence to support our results and elucidate expression levels under stress conditions. In addition, Orthologous Groups (OGs) containing multiple genes also affect the traits of the lines; thus, these OGs should be further studied. Further Studies to know the relationship between metabolic activities under stress conditions are needed. This study demonstrated the power of whole genome analysis to reveal novel gene sequences between different strains. Whole genome analysis of R. oryzae should reveal the unique genes and non-coding regions of each strain. Therefore, the genome sequences of R. oryzae provided a unique resource for studying pathogenicity determinants in this pathogen, understanding host adaptation mechanisms, and designing specific disease management control strategies.

Conclusion

This study demonstrates that phenotypic characterization of a strain of the pathogen R. oryzae isolated from tobacco was conducted to provide basic biological and pathological information using Biolog Phenotype MicroArray (PM). In addition, the Y5 strain of R. oryzae was sequenced using Illumina HiSeq and Pacific Biosciences (PacBio) technologies. It indicated that R. oryzae could metabolize 54.21% of tested carbon sources, 86.84% of nitrogen sources, 100% of sulfur sources, and 98.31% of phosphorus sources. And also, genome sequencing results showed that the R. oryzae Y5 strain had raw data assembled into of 2,271 Mbp with an N50 value of 10,563 bp. A genome sequence of 50.3 Mb was polished and assembled into 53 contigs with an N50 length of 1,785,794 bp, maximum contig length of 3,223,184 bp, and a sum of contig lengths of 51,182,778 bp. A total of 12,680 protein-coding genes were predicted. This study demonstrated the power of whole genome analysis to reveal novel gene sequences between different strains. Whole genome analysis of R. oryzae should reveal the unique genes and non-coding regions of each strain. Therefore, the genome sequences of R. oryzae provided a unique resource for studying pathogenicity determinants in this pathogen, understanding host adaptation mechanisms, and designing specific disease management control strategies.

Funding

This work was supported by the China National Tobacco Corporation [110202101048(LS-08), 110202001035(LS-04)], the National Natural Science Foundation of China (31960550, 32160522), the Hundred Level Innovative Talent Foundation of Guizhou Province [GCC(2022)028–1], the Guizhou Science Technology Foundation (ZK[2021]Key036), the International Science and Technology Cooperation Base ([2020]4102), Guizhou Provincial Academician Workstation of Microbiology and Health([2020]4004), and the Guizhou Tobacco Company (2020XM22, 2020XM03). The authors declare that this study received funding from the China National Tobacco Corporation and Guizhou Tobacco Company. The funders were not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication. All authors declare no other competing interests.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.1031023/full#supplementary-material

SUPPLEMENTARY FIGURE S1

Evolutionary relationships of Rhizopus oryzae. Phylogenetic tree showing the phylogenetic relationship amongst difference belonging to the species Rhizopus oryzae (Rhizopus oryzae, also known as Rhizopus arrhizus), is a filamentous fungus that is the most common cause of mucormycosis, also referred to as zygomycosis. An opportunistic pathogen, R. oryzae causes disease primarily in immunocompromised people, such as those with diabetes mellitus, cancer, or AIDS. R. oryzae is found in soil, decaying fruit and vegetables, old bread, and animal dung. It is used in the preparation of fermented foods and alcoholic beverages in Asia. R. oryzae is also a destructive pathogen that frequently causes tobacco pole rot in curing chambers.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://www.ncbi.nlm.nih.gov/, PRJNA814049.

Author contributions

ZL, C-hS, YH, H-cW, W-hL and L-tC contributed to conceptualize and design of the study. ZL organized the database and performed the statistical analysis. ZL and H-cW wrote the first draft of the manuscript. H-cW and YH revised the manuscript and wrote some sections. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AceroF. J. F.CarbúM.El-AkhalM. R.GarridoC.González-RodríguezV. E.CantoralJ. M. (2011). Development of proteomics-based fungicides: new strategies for environmentally friendly control of fungal plant diseases. Int. J. Mol. Sci.12, 795816. doi: 10.3390/ijms12010795

  • 2

    AilloudF.LoweT.CellierG.RocheD.AllenC.PriorP. (2015). Comparative genomic analysis of Ralstonia solanacearum reveals candidate genes for host specificity. BMC Genomics16:270. doi: 10.1186/s12864-015-1474-8

  • 3

    AmosB.RolfA. (2000). The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. NAR.28, 4548.

  • 4

    ArduiS.AmeurA.VermeeschJ. R.HestandM. S. (2018). Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res.46, 21592168. doi: 10.1093/nar/gky066

  • 5

    AshburnerM.BallC. A.BlakeJ. A.BotsteinD.ButlerH.CherryJ. M.et al. (2000). Gene ontology: tool for the unification of biology. Nat. Genet.25, 2529. doi: 10.1038/75556

  • 6

    BensonG. (1999). Tandem repeats finder: a program to analyze DNA sequences. NAR.27, 573580. doi: 10.1093/nar/27.2.573

  • 7

    BochnerB. R.GadzinskiP.PanomitrosE.GadzinskiP.PanomitrosE. (2001). Phenotype microarrays for high-throughput phenotypictesting and assay of gene function. Genome Res.11, 12461255. doi: 10.1101/gr.186501

  • 8

    CantarelB. L.CoutinhoP. M.RancurelC.BernardT.LombardV.HenrissatB. (2009). The carbohydrate-active EnZymes database (CAZy): an expert resource for glycogenomics. NAR37, D233D238. doi: 10.1093/nar/gkn663

  • 9

    ChenQ. L.CaiL.WangH. C.CaiL. T.GoodwinP.MaJ.et al. (2020). Fungal composition and diversity of the tobacco leaf phyllosphere during curing of leaves. Front. Microbiol.11:2136. doi: 10.3389/fmicb.2020.554051

  • 10

    ChenQ. L.WangH. C.HuangY.et al. (2021). Pathogenicity and metabolic phenotypic characteristics of Rhizopus oryzae in tobacco under different osmotic pressure and pH environments. Tobacco Science & Technology.2022, 1623. doi: 10.16135/j.issn1002-0861.2021.0081

  • 11

    CoolsH. J.Hammond-KosackK. E. (2013). Exploitation of genomics in fungicide research: current status and future perspectives. Mol. Plant Pathol.14, 197210. doi: 10.1111/mpp.12001

  • 12

    CrešnarB.PetričS. (2011). Cytochrome P450 enzymes in the fungal kingdom. Biochim. Biophys. Acta1814, 2935. doi: 10.1016/j.bbapap.2010.06.020

  • 13

    EidJ.FehrA.GrayJ.LuongK.LyleJ.OttoG.et al. (2009). Real-timeDNA sequencing from single polymerase molecules. Science323, 133138. doi: 10.1126/science.1162986

  • 14

    EllisJ. J. (1985). Species and varieties in the Rhizopus arrhizus-Rhizopus oryzae group as indicated by their dna complementarity. Mycologia77, 243247. doi: 10.1080/00275514.1985.12025091

  • 15

    FelsensteinJ. (1985). Confidence limits on phylogenies: an approach using the bootstrap. Evolution39, 783791. doi: 10.1111/j.1558-5646.1985.tb00420.x

  • 16

    GardnerP. P.DaubJ.TateJ. G.NawrockiE. P.KolbeD. L.LindgreenS.et al. (2009). Rfam: updates to the RNA families database. NAR.37, D136D140. doi: 10.1093/nar/gkn766

  • 17

    GayedS. K.et al. (1972). Rhizopus arrhizus causing pole rot of flue-cured tobacco in Ontario. Can. J. Plant Sci.52, 103106. doi: 10.4141/cjps72-013

  • 18

    HolmesG. J.StangeR. R.et al. (2002). Influence of wound type and storage duration on susceptibility of sweet potatoes to Rhizopus soft rot. Plant Dis.86, 345348. doi: 10.1094/PDIS.2002.86.4.345

  • 19

    HongS. B.LeeM.KimD. H.VargaJ.FrisvadJ. C.PerroneG.et al. (2013). Aspergillus luchuensis, an industrially important black Aspergillus in East Asia. PLoS One8:e63769. doi: 10.1371/journal.pone.0063769

  • 20

    IquebalM. A.TomarR. S.ParakhiaM. V.SinglaD.JaiswalS.RathodV. M.et al. (2017). Draft whole genome sequence of groundnut stem rot fungus Athelia rolfsii revealing genetic architect of its pathogenicity and virulence. Sci. Rep.7, 110. doi: 10.1038/s41598-017-05478-8

  • 21

    JohnsonD. A.TetuS. G.PhillippyK.ChenJ.RenQ.PaulsenI. T. (2008). High-throughput phenotypic characterization of Pseudomonas aeruginosa membrane transport genes. PLoS Genet.4:e1000211. doi: 10.1371/journal.pgen.1000211

  • 22

    KanehisaM.GotoS.HattoriM.Aoki-KinoshitaK. F.ItohM.KawashimaS.et al. (2006). From genomics to chemical genomics: new developments in KEGG. NAR.34, D354D357. doi: 10.1093/nar/gkj102

  • 23

    KanehisaM.GotoS.KawashimaS.OkunoY.HattoriM. (2004). The KEGG resource for deciphering the genome. NAR.32, 277D2280D. doi: 10.1093/nar/gkh063

  • 24

    KortekampA.SchmidtkeM.SerrA.et al. (2003). Infection and decay of tobacco caused by Rhizopus oryzae/die Infektion und Fäulnis von Tabak verursacht durch Rhizopus oryzae. J. Plant. Dis. Protect.110, 535543.

  • 25

    KumarS.StecherG.TamuraK. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33, 18701874. doi: 10.1093/molbev/msw054

  • 26

    LagesenK.HallinP.RødlandE. A.StærfeldtH. H.RognesT.UsseryD. W. (2007). RNAmmer: consistent and rapid annotation of ribosomal RNA genes. NAR.35, 31003108. doi: 10.1093/nar/gkm160

  • 27

    LiW.JaroszewskiL.GodzikA. (2002). Tolerating some redundancy significantly speeds up clustering of large protein databases. CCF TCBI.18, 7782. doi: 10.1093/bioinformatics/18.1.77

  • 28

    LimH. J.LeeE. H.YoonY.ChuaB.SonA. (2016). Portable lysis apparatus for rapid single-step DNA extraction of Bacillus subtilis. J. Appl. Microbiol.120, 379387. doi: 10.1111/jam.13011

  • 29

    LippmanZ.GendrelA. V.BlackM.VaughnM. W.DedhiaN.Richard McCombieW.et al. (2004). Role of transposable elements in heterochromatin and epigenetic control. Nature430, 471476. doi: 10.1038/nature02651

  • 30

    LiuL. P.QuJ. W.YiX. Q.HuangH. H.et al. (2017). Genome-wide identification, classification and expression analysis of the mildew resistance locus o (MLO) gene family in sweet orange (Citrus sinensis). Braz. Arch. Biol.Technol.60, 16784324. doi: 10.1590/1678-4324-2017160474

  • 31

    LohK. D.GyaneshwarP.Markenscoff PapadimitriouE.FongR.KimK. S.ParalesR.et al. (2006). A previously undescribed pathway for pyrimidine catabolism. Proc. Natl. Acad. Sci. U. S. A.103, 51145119. doi: 10.1073/pnas.0600521103

  • 32

    Londoño-HernándezL.Ramírez-ToroC.RuizH. A.Ascacio-ValdésJ. A.Aguilar-GonzalezM. A.Rodríguez-HerreraR.et al. (2017). Rhizopus oryzae–ancient microbial resource with importance in modern food industry. Int. J. Food Microbiol.257, 110127. doi: 10.1016/j.ijfoodmicro.2017.06.012

  • 33

    LoweT. M.EddyS. R. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. NAR.25, 955964. doi: 10.1093/nar/25.5.955

  • 34

    MaL. J.IbrahimA. S.SkoryC.GrabherrM. G.BurgerG.ButlerM.et al. (2009). Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals a whole-genome duplication. PLoS Genet.5:e1000549. doi: 10.1371/journal.pgen.1000549

  • 35

    MartinU.RashmiP.ArathiR.et al. (2015). “The pathogen-host interactions database (phi-base): additions and future developments,” in Nucleic Acids Research (NAR). Oxford, England: Oxford University Press.

  • 36

    MedemaM. H.BlinK.CimermancicP.de JagerV.ZakrzewskiP.FischbachM. A.et al. (2011). antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. NAR.39, W339W346. doi: 10.1093/nar/gkr466

  • 37

    MiklosG. L. G.RubinG. M. (1996). The role of the genome project in determining gene function: insights from model organisms. Cells86, 521529. doi: 10.1016/S0092-8674(00)80126-9

  • 38

    MiltonS. J.VamseeS. R.DorjeeG. T.et al. (2014). “The transporter classification database,” in Nucleic Acids Research (NAR). Oxford, England: Oxford University Press.

  • 39

    NawrockiE. P.KolbeD. L. (2009). Eddy SR: infernal 1.0: inference of RNA alignments. CCF TCBI.25, 13351337. doi: 10.1093/bioinformatics/btp157

  • 40

    NguyenM. H.KaulD.MutoC.ChengS. J.RichterR. A.BrunoV. M.et al. (2020). Genetic diversity of clinical and environmental Mucorales isolates obtained from an investigation of mucormycosis cases among solid organ transplant recipients. Microb. Genomics.6:e000473. doi: 10.1099/mgen.0.000473

  • 41

    PetersenT. N.BrunakS.Von HeijneG.et al. (2011). SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat. Methods8, 785786. doi: 10.1038/nmeth.1701

  • 42

    ReinerJ.PisaniL.QiaoW.SinghR.YangY.ShiL.et al. (2018). Cytogenomic identification and long-read single molecule real-time (SMRT) sequencing of aBardet–Biedl syndrome 9(BBS9). NPJ Genom. Med.3, 15. doi: 10.1038/s41525-017-0042-3

  • 43

    SahaS.BridgesS.MagbanuaZ. V.PetersonD. G. (2008). Empirical comparison of ab initio repeat finding programs. NAR36, 22842294. doi: 10.1093/nar/gkn064

  • 44

    SaitouN.NeiM. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol.4, 406425. PMID:

  • 45

    SeringhausM.PaccanaroA.BornemanA.SnyderM.GersteinM. (2006). Predicting essential genes in fungal genomes. Genome Res.16, 11261135. doi: 10.1101/gr.5144106

  • 46

    StankeM.DiekhansM.BaertschR.HausslerD. (2008). Using native and syntenically mapped cDNA alignments to improve de novo gene finding. CCF TCBI.24, 637644. doi: 10.1093/bioinformatics/btn013

  • 47

    SuJ. E.PanF. L.MaM.et al. (2018). Effects of different curing process on tobacco mildew disease. Tianjin Agricultural Sciences.24, 5155. doi: 10.3969/j.issn.1006–6500.2018.02.013

  • 48

    SurhoneL. M.TimpledonM. T.MarsekenS. F.et al. (2010). Quicksand: hydrogel, granular matter, mercury (element), water circulation, non-newtonian fluid, viscosity. United Kingdom: Betascript Publishing.

  • 49

    TamuraK.NeiM.KumarS. (2004). Prospects for inferring very large phylogenies by using the neighbor-joining method. PNAS101, 1103011035. doi: 10.1073/pnas.0404206101

  • 50

    Von EiffC.McNamaraP.BeckerK.BatesD.LeiX. H.et al. (2006). Phenotype microarray profiling of Staphylococcus aureus menD and hemB mutants with the small-colony-variant phenotype. J. Bacteriol.188, 687693. doi: 10.1128/JB.188.2.687-693.2006

  • 51

    WangH. C.HuangY. F.TangX. G.CaiL. T.ChenQ. Y.WangM. S.et al. (2016). Leaf and stem rot of tobacco (Nicotiana tabacum) caused by Rhizopus oryzae in closed curing barns in Guizhou Province of China. Plant Dis.100, 536536. doi: 10.1094/PDIS-08-15-0935-PDN

  • 52

    WangH. C.HuangY. F.XiaH. Q.WangJ.WangM. S.ZhangC. Q.et al. (2015). Phenotypic analysis of Alternaria alternata, the causal agent of tobacco Brown spot. Plant Pathology J.14, 7985. doi: 10.3923/ppj.2015.79.85

  • 53

    WangH. C.LiL. C.CaiB.CaiL. T.ChenX. J.YuZ. H.et al. (2018b). Metabolic phenotype characterization of Botrytis cinerea, the causal agent of Gray Mold. Front. Microbiol.13, 9470. doi: 10.3389/fmicb.2018.00470

  • 54

    WangM. S.WangH. C.HuangY. F.et al. (2015). Phenotypic analysis of Phytophthora parasitica by using high throughput phenotypic microarray. Acta Microbiol Sin.55, 13561363. doi: 10.13343/j.cnki.wsxb.20150036

  • 55

    WangH. C.ZhangM.ZhangC. Q.et al. (2018a). Pathogen detection, carbon metabolic phenotype analysis of Rhizopus oryzae from tobacco and its sensitivity to seven fungicides. J. Pestic. Sci.20, 743748. doi: 10.16801/j.issn.1008-7303.2018.0095

  • 56

    WatanabeT. (2002). Pictorial atlas of soil and seed fungi: Morphologies of cultured fungi and key to species. United States: CRC press.

  • 57

    XuD.LiR. G.XiaoS. J.et al. (2020). Host range of the lily pathogen Rhizopus oryzae. J. yi chun University.42, 9599.

  • 58

    ZhangY. Q.LiQ. S.WangC. Y.et al. (2018). Study on color parameters and curing characteristics of flue-cured tobacco leaves in process ofmaturity. Southwest China J Agric Sci.31, 6267. doi: 10.16213/j.cnki.scjas.2018.1.011

  • 59

    ZhouL.LeiX. H.BochnerB. R.WannerB. L. (2003). Phenotype microarray analysis of Escherichia coli K-12mutants with deletions of all two-component systems. J. Bacteriol.185, 49564972. doi: 10.1128/JB.185.16.4956-4972.2003

Summary

Keywords

Rhizopus oryzae, whole genome sequence, tobacco pole rot, biolog phenotype MicroArray, metabolic fingerprint

Citation

Li Z, Shi C, Huang Y, Wang H, Li W and Cai L (2023) Phenotypic analysis and genome sequence of Rhizopus oryzae strain Y5, the causal agent of tobacco pole rot. Front. Microbiol. 13:1031023. doi: 10.3389/fmicb.2022.1031023

Received

29 August 2022

Accepted

07 December 2022

Published

04 January 2023

Volume

13 - 2022

Edited by

Stepan Toshchakov, Kurchatov Institute, Russia

Reviewed by

Feng Zhang, Nanjing Agricultural University, China; Alexander N. Ignatov, Peoples' Friendship University of Russia, Russia

Updates

Copyright

*Correspondence: Cai-hua Shi, Han-cheng Wang,

This article was submitted to Microbial Physiology and Metabolism, a section of the journal Frontiers in Microbiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics