Anaerobic digestion (AD) is important in treating of food waste, and thousands of metagenome-assembled genomes (MAGs) have been constructed for the microbiome in AD. However, due to the limitations of the short-read sequencing and assembly technologies, most of these MAGs are grouped from hundreds of short contigs by binning algorithms, and the errors are easily introduced.
In this study, we constructed a total of 60 non-redundant microbial genomes from 64.5 Gb of PacBio high-fidelity (HiFi) long reads, generated from the digestate samples of a full-scale biogas plant fed with food waste. Of the 60 microbial genomes, all genomes have at least one copy of rRNA operons (16S, 23S, and 5S rRNA), 54 have ≥18 types of standard tRNA genes, and 39 are circular complete genomes. In comparison with the published short-read derived MAGs for AD, we found 23 genomes with average nucleotide identity less than 95% to any known MAGs. Besides, our HiFi-derived genomes have much higher average contig N50 size, slightly higher average genome size and lower contamination. GTDB-Tk classification of these genomes revealed two genomes belonging to novel genus and four genomes belonging to novel species, since their 16S rRNA genes have identities lower than 95 and 97% to any known 16S rRNA genes, respectively. Microbial community analysis based on the these assembled genomes reveals the most predominant phylum was
High-fidelity sequencing not only generated microbial genomes with obviously improved quality but also recovered a substantial portion of novel genomes missed in previous short-read based studies, and the novel genomes will deepen our understanding of the microbial composition in AD of food waste.