AUTHOR=Zhao Xinye , Du Alexander , Qiu Peng
TITLE=scMODD: A model-driven algorithm for doublet identification in single-cell RNA-sequencing data
JOURNAL=Frontiers in Systems Biology
VOLUME=2
YEAR=2023
URL=https://www.frontiersin.org/journals/systems-biology/articles/10.3389/fsysb.2022.1082309
DOI=10.3389/fsysb.2022.1082309
ISSN=2674-0702
ABSTRACT=
Single-cell RNA sequencing (scRNA-seq) data often contain doublets, where a doublet manifests as 1 cell barcode that corresponds to combined gene expression of two or more cells. Existence of doublets can lead to spurious biological interpretations. Here, we present single-cell MOdel-driven Doublet Detection (scMODD), a model-driven algorithm to detect doublets in scRNA-seq data. ScMODD achieved similar performance compared to existing doublet detection algorithms which are primarily data-driven, showing the promise of model-driven approach for doublet detection. When implementing scMODD in simulated and real scRNA-seq data, we tested both the negative binomial (NB) model and the zero-inflated negative binomial (ZINB) model to serve as the underlying statistical model for scRNA-seq count data, and observed that incorporating zero inflation did not improve detection performance, suggesting that consideration of zero inflation is not necessary in the context of doublet detection in scRNA-seq.