AUTHOR=Wang Mengqi , Tu Qichao TITLE=Effective data filtering is prerequisite for robust microbial association network construction JOURNAL=Frontiers in Microbiology VOLUME=13 YEAR=2022 URL=https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2022.1016947 DOI=10.3389/fmicb.2022.1016947 ISSN=1664-302X ABSTRACT=

Microorganisms do not exist as individual population in the environment. Rather, they form complex assemblages that perform essential ecosystem functions and maintain ecosystem stability. Besides the diversity and composition of microbial communities, deciphering their potential interactions in the form of association networks has attracted many microbiologists and ecologists. Much effort has been made toward the methodological development for constructing microbial association networks. However, microbial profiles suffer dramatically from zero values, which hamper accurate association network construction. In this study, we investigated the effects of zero-value issues associated with microbial association network construction. Using the TARA Oceans microbial profile as an example, different zero-value-treatment approaches were comparatively investigated using different correlation methods. The results suggested dramatic variations of correlation coefficient values for differently treated microbial profiles. Most specifically, correlation coefficients among less frequent microbial taxa were more affected, whichever method was used. Negative correlation coefficients were more problematic and sensitive to network construction, as many of them were inferred from low-overlapped microbial taxa. Consequently, microbial association networks were greatly differed. Among various approaches, we recommend sequential calculation of correlation coefficients for microbial taxa pairs by excluding paired zero values. Filling missing values with pseudo-values is not recommended. As microbial association network analyses have become a widely used technique in the field of microbial ecology and environmental science, we urge cautions be made to critically consider the zero-value issues in microbial data.