AUTHOR=Giraud-Gatineau Audrey , Texier Gaetan , Garnotel Eric , Raoult Didier , Chaudet Hervé TITLE=Insights Into Subspecies Discrimination Potentiality From Bacteria MALDI-TOF Mass Spectra by Using Data Mining and Diversity Studies JOURNAL=Frontiers in Microbiology VOLUME=11 YEAR=2020 URL=https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2020.01931 DOI=10.3389/fmicb.2020.01931 ISSN=1664-302X ABSTRACT=
Bacterial identification at subspecies level is critical in clinical care and epidemiological investigations due to the different epidemic potentialities of a species. For this purpose, matrix-assisted laser desorption ionization – time-of-flight mass spectrometry (MALDI-TOF MS) has been proposed in place of molecular genotyping, but with some result discrepancies. The aim of this work is to methodically mine the expression diversities of MALDI-TOF bacterial species spectra and their possible latent organization in order to evaluate their subspecies specific expression. Peak expression diversities of MALDI-TOF spectra coming from routine identifications have been analyzed using Hill numbers, rarefaction curves, and peak clustering. Some size effect critical thresholds were estimated using change point analyses. We included 167,528 spectra corresponding to 405 species. Species spectra diversities have a broad size-dependent variability, which may be influenced by the kind of sampling. Peak organization is characterized by the presence of a main cluster made of the most frequently co-occurring peaks and around 20 secondary clusters grouping less frequently co-occurring peaks. The 35 most represented species in our sample are distributed in two groups depending on the focusing of their protein synthesis activity on the main cluster or not. Our results may advocate some analogy with genomics studies of bacteria, with a main species-related cluster of co-occurring peaks and several secondary clusters, which may host peaks able to discriminate bacterial subgroups. This systematic study of the expression diversities of MALDI-TOF spectra shows that latent organization of co-occurring peaks supports subspecies discrimination and may explain why studies on MALDI-TOF-based typing exhibit some result divergences.