Skip to main content

MINI REVIEW article

Front. Microbiol., 15 November 2022
Sec. Systems Microbiology
This article is part of the Research Topic Artificial Intelligence in Forensic Microbiology View all 10 articles

Advances in microbial metagenomics and artificial intelligence analysis in forensic identification

  • 1Department of Dermatology, The First Hospital of China Medical University, Shenyang, China
  • 2Key Laboratory of Immunodermatology, Ministry of Education and NHC, National Joint Engineering Research Center for Theranostics of Immunological Skin Diseases, Shenyang, China
  • 3Institute of Respiratory Disease, China Medical University, Shenyang, China

Microorganisms, which are widely distributed in nature and human body, show unique application value in forensic identification. Recent advances in high-throughput sequencing technology and significant reductions in analysis costs have markedly promoted the development of forensic microbiology and metagenomics. The rapid progression of artificial intelligence (AI) methods and computational approaches has shown their unique application value in forensics and their potential to address relevant forensic questions. Here, we summarize the current status of microbial metagenomics and AI analysis in forensic microbiology, including postmortem interval inference, individual identification, geolocation, and tissue/fluid identification.

Introduction

“Microorganism” is a general term for tiny organisms that exist in nature, mainly including bacteria, viruses, and fungi, which are invisible to the naked eye or cannot be observed clearly. Microorganisms are small, simple in structure, and widely present in nature and the human body. The microbiome and metagenomics are rapidly emerging due to the progress of genome sequencing technology, improved microbial sampling methods and the rise of bioinformatics. In the era of big data, artificial intelligence and its related technologies continue to be developed and innovated, and corresponding results have been widely used in many disciplines, including forensics (Rahaman et al., 2020; Zhang et al., 2021a; Chen et al., 2022).

Postmortem interval estimation

Inference of the time of death, or the postmortem interval (PMI), is an important task during forensic examination. Host- and environment-related microbial community succession during postmortem decay, which occurs in a regular, clock-like manner after human death, provides novel ideas for PMI inference (Metcalf et al., 2016). Johnson et al. (2016) sampled the skin microbiota in the nasal and ear canals of decomposing human cadavers to establish an algorithm for predicting the PMI, and thereby successfully demonstrated that the skin microbiota is a promising tool in forensic death investigations. The application of microbial community changes for PMI estimation has gradually become a topic of major interest in forensic research.

The oral cavity is one of the key research fields of human microbial communities, and its microbial community richness is one of the most abundant areas and the second largest human complex after gastrointestinal tract. Adserias-Garriga et al. (2017) monitored the oral microbiota of donated human bodies within 12 days after death. They found that Firmicutes and Actinobacteria are the predominant phyla in the fresh stage, Tenericutes is the predominant phyla in bloat stage, and Firmicutes is the predominant phyla in advanced decay. Dong et al. (2019) found that when the PMI was 0 h, the dominant phyla in the oral cavity of mice were Proteobacteria, Firmicutes, Actinobacteria, and Bacteroidetes. Within 240 h after the death of mice, the Proteobacteria and Firmicutes always occupied the dominant position. The oral microbiota changes in mice are different from those in human decaying bodies. By constructing linear regression models between relative abundance and postmortem intervals, Gamma-proteobacteria and Proteus species were the best candidates for use to infer the PMI, especially the late PMI. The R2 value of both constructed linear models was 0.99.

Microorganisms play a vital role in the decomposition process. However, relatively few studies are available on the postmortem migration behavior of microbial communities inside cadavers. Liu et al. (2020) assessed the microbial community structure in the brain, heart, and cecum of mice at 15 d postmortem and found that an artificial neural network (ANN) combined with the postmortem microbial dataset from the cecum was the optimal model; mean absolute error of 1.5 ± 0.8 h within 24-h decomposition and 14.5 ± 4.4 h within 15-day decomposition. This model is potential to serve as an advantageous technique in PMI inference, however, further verification is needed.

The above studies exposed cadavers to the air during decomposition. However, the microbial community in buried decomposing cadavers may be different from that in cadavers exposed to air due to different conditions such as oxygen content, humidity, light, and soil composition. Zhang et al. (2021b) analyzed postmortem microorganisms in the gravesoil, rectum, and skin of buried rats using the random forest algorithm to predict the PMI. The results showed that the predicted MAEs of the microorganisms in the rectum, cadaver skin, and gravesoil were 2.06, 2.13, and 1.82 days, respectively, within 60 days after death. This study developed the first model to predict the PMI based on microbial community succession and machine learning algorithms for buried bodies, which can provide information on the timing of buried body cases for forensic investigations.

Deel et al. (2021) placed six human donor subjects remains outdoors to decompose on the soil surface, with three samples each placed in spring and summer. Microorganisms in the skin and soil can naturally decompose the corpse to expose the ribs. The investigators developed a PMI prediction model using colonies on ribs in combination with the random forest algorithm. The accuracy of PMI prediction within 9 months was approximately (±34) days. This study represents a preliminary attempt to study the continuity of microbial communities in postmortem corpse remains, which may provide a tool for forensic investigators to estimate the time since death of skeletal remains. However, limitations remain, such as small sample sizes, differences between seasons, including differences in soil moisture, inorganic salts, and microbial contents, and variations in the organic composition of bones and other skeletal degradation indicators.

Individual identification

Individual identification is one of the most important tasks in forensic science. A number of studies have shown that microecosystems such as the skin, oral cavity, and intestine have obvious polymorphisms and individual differences. The differences in microbial community composition and abundance in human microecosystems constitute the basis of microbial use for individual identification. In theory, each individual carries a unique set of microorganisms that differs from those of other individuals, which can be identified through microbiome analysis, and this particular microbial community can persist over long periods. Therefore, microbiome characterization is potentially applicable to forensic human identification.

Franzosa et al. found that the microbiome of an individual can specifically identify its source host in a population of more than 100 people, the performance of the gut microbiome is very stable, and more than 80% of individuals can still be accurately located after 1 year (Franzosa et al., 2015). Another study found that the genotypic composition of the 16S rRNA of Cutibacterium acnes is individual-specific. The random forest machine learning method was used to combine the 16S rRNA genotype of C. acnes with the skin microbiome profile data, and the accuracy of individual identification was ~90% (Yang et al., 2019). Over time, the 16S rRNA genotype of C. acnes was more stable than that of the skin microbiome profile.

The Budowle team conducted a series of studies on the application of forensic individual identification using skin microorganisms (Schmedes et al., 2017, 2018). The core microbiota of the skin was determined, and clade-specific markers were identified. A novel targeted sequencing panel, the hidSkinPlex, was developed, which contains 286 markers covering a range of taxonomies of specific microorganisms that are in high abundance on the human skin. Schmedes et al., (2018) achieved accuracy rates between 54.20% and 100.00% when classifying eight individuals with samples from three body sites (i.e., foot, hand and manubrium) by using regularized multinomial logistic regression and 1-nearest-neighbor classification. Woerner et al. (2019) used the same panel to classify 51 individuals across three body sites with nearest neighbor machine learning approaches. The accuracy rates of using phylogenetic distance or nucleotide diversity were 78.00% and 83.70%, respectively. As the number of individuals increased, the classification accuracy decreased.

Sherier et al. (2021, 2022) proposed that single nucleotide polymorphism genetic markers are more individualized than taxonomic markers. They designed an improved “hidSkinPlex+” system, which comprises 365 SNPs residing in 135 markers, fewer markers than the original hidSkinPlex. Eliminating the markers that do not contribute to classification accuracy can improve the enrichment process and increase the efficiency of machine learning. They reanalyzed the same sequencing data as those in Woerner et al. (2019), and found that the highest Wright’s fixation index (FST) combined with support vector machine (SVM) could achieve higher accuracy in individual identification (p = 0.03, chi-squared test).

Tissue/fluid identification

During forensic reconstruction of crime scene activities, identification of biological traces and their bodily origin provides valuable evidence that can be presented in court. However, traces and stains at the crime scene are often exposed to the environment outside the human body for a period before being processed in the laboratory. Dobay et al. (2019) detected some characteristic microorganisms with high abundance in semen, saliva, vaginal secretions, menstrual blood, peripheral blood, and skin. The study found that samples with 30 days of indoor exposure still harbor a microbial signature that can be used to identify bodily origins. The dominant microbial signature in skin, saliva, semen are Propionibacterium, Prevotella, and Bacteroides, respectively. Vaginal fluid and menstrual blood share their microbial signatures, as Lactobacillus makes up on average 75% and 86% of the bacterial reads. Hanssen et al. (2017) used standard pattern recognition based on principal component analysis in combination with linear discriminant analysis and found that the microbial community was well differentiated between saliva and skin, and the saliva microorganisms of different individuals have specificity. The accuracy of cross-validation was 94%. Based on massively parallel sequencing of the microbiome, Díez López et al. achieved accurate tissue-type classification of skin, saliva, and vaginal secretions by using taxonomy-independent deep learning networks (Díez López et al., 2019). Body-site classification accuracy of these test samples was very high as indicated by AUC values of 0.99 for skin, 0.99 for oral, and 1 for vaginal secretion. It can also provide forensically relevant blood samples (e.g., menstrual blood, nasal blood, fingertip blood, and venous blood) with accurate information about the source of blood in the body (Díez López et al., 2020). By analyzing the sequencing data of different body parts and soil mixture samples, Tackmann et al. (2018) identified a core set of ecologically informed microbial biomarkers for human body sites. Using Generalized Local Learning, 635 operational taxonomic units (OTUs) were reported as biomarkers, between 92 (nostril) and 326 (skin). Bacteroidetes, Firmicutes, Proteobacteria, and Actinobacteria were dominant in all investigated body sites. They found high fractions of positive Firmicutes and Bacteroidetes biomarkers in feces and Proteobacteria biomarkers in skin.

Geolocation

The International Metagenomics and Metadesign of Subways and Urban Biomes (MetaSUB), which was launched in 2015, is a global network of scientists and clinicians developing knowledge of urban microbiomes by studying mass transit systems, the built environment, and hospitals. In forensic casework, a link is evident between a crime scene investigation, the suspect, and an object, location, or victim. The study of environmental metagenomics also introduces potential for new forensic applications such as geographical identification.

Researchers and volunteers of MetaSUB Consortium collected ~5,000 samples from the mass transit systems of 60 cities around the world. Analysis was performed using next-generation sequencing and genome sequencing technology, and the largest set of global urban microbial metagenomics research results to date was reported (Danko et al., 2021). Public data from MetaSUB Consortium were used by multiple research teams to perform geographic origin inference by using various bioinformatics and artificial intelligence algorithms. Huang et al. (2020) extracted features from metagenomic abundance profiles. By using logistic regression with L2 normalization, the prediction accuracy of the model reached 86% to infer city affiliation. Walker and Datta (2019) analyzed whole-genome sequenced microbiota sampled from 12 cities in seven different countries. The authors applied machine learning techniques to identify the geographical provenance of the microbiome samples. Up to 90% of the samples were correctly classified, demonstrating the potential of machine learning applications in biogeography, although further evidence is necessary to extend these applications to an evidentiary context. Ryan (2019) constructed a random forest classifier based on a dataset of 311 urban microbiome samples and correctly classified 83.3% of the samples.

Conclusion

Microorganisms are widely distributed in nature and the human body. Microbial traces from the human body or crime scenes can be effectively used in forensic medicine to solve crime problems, showing huge potential and unique application value in forensic medicine. The rapid progress of artificial intelligence and its related technologies has markedly promoted the development of forensic microbiology, which has introduced novel ideas and tools for solving the problems in forensic practice. Research is still at the preliminary stage, and many challenges need further addressed, for example, limited sample sizes, model accuracies, unrealistic environmental settings, etc. As artificial intelligence analysis in forensic identification is novel innovation, there are only limited relevant research reports. Many researchers conducted a single study from a single perspective, and there was insufficient data to cross-verify the accuracy of these results. Although we have summarized these reports, it is not known how accurate the studies are. Nonetheless, it is foreseeable that microbiome-based evidence could contribute to forensic investigations in the future.

Author contributions

QH: searched and analyzed the published literature, drafted the manuscript. XN: searched and analyzed the published literature. R-QQ and ML: reviewed and edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This study was funded and supported by National Natural Science Foundation of China (82173401 and U1908206).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Adserias-Garriga, J., Quijada, N. M., Hernandez, M., Rodríguez Lázaro, D., Steadman, D., and Garcia-Gil, L. J. (2017). Dynamics of the oral microbiota as a tool to estimate time since death. Mol Oral Microbiol 32, 511–516. doi: 10.1111/omi.12191

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., Li, C., Wang, G., Li, X., Mamunur Rahaman, M., Sun, H., et al. (2022). GasHis-transformer: a multi-scale visual transformer approach for gastric histopathological image detection. Pattern Recogn. 130:108827. doi: 10.1016/j.patcog.2022.108827

CrossRef Full Text | Google Scholar

Danko, D., Bezdan, D., Afshin, E. E., Ahsanuddin, S., Bhattacharya, C., Butler, D. J., et al. (2021). A global metagenomic map of urban microbiomes and antimicrobial resistance. Cells 184, 3376–3393.e17. doi: 10.1016/j.cell.2021.05.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Deel, H., Emmons, A. L., Kiely, J., Damann, F. E., Carter, D. O., Lynne, A., et al. (2021). A pilot study of microbial succession in human rib skeletal remains during terrestrial decomposition. mSphere 6:e0045521. doi: 10.1128/mSphere.00455-21

PubMed Abstract | CrossRef Full Text | Google Scholar

Díez López, C., Montiel González, D., Haas, C., Vidaki, A., and Kayser, M. (2020). Microbiome-based body site of origin classification of forensically relevant blood traces. Forensic Sci. Int. Genet. 47:102280. doi: 10.1016/j.fsigen.2020.102280

PubMed Abstract | CrossRef Full Text | Google Scholar

Díez López, C., Vidaki, A., Ralf, A., Montiel González, D., Radjabzadeh, D., Kraaij, R., et al. (2019). Novel taxonomy-independent deep learning microbiome approach allows for accurate classification of different forensically relevant human epithelial materials. Forensic Sci. Int. Genet. 41, 72–82. doi: 10.1016/j.fsigen.2019.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Dobay, A., Haas, C., Fucile, G., Downey, N., Morrison, H. G., Kratzer, A., et al. (2019). Microbiome-based body fluid identification of samples exposed to indoor conditions. Forensic Sci. Int. Genet. 40, 105–113. doi: 10.1016/j.fsigen.2019.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, K., Xin, Y., Cao, F., Huang, Z., Sun, J., Peng, M., et al. (2019). Succession of oral microbiota community as a tool to estimate postmortem interval. Sci. Rep. 9:13063. doi: 10.1038/s41598-019-49338-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Franzosa, E. A., Huang, K., Meadow, J. F., Gevers, D., Lemon, K. P., Bohannan, B. J., et al. (2015). Identifying personal microbiomes using metagenomic codes. Proc. Natl. Acad. Sci. U. S. A. 112, E2930–E2938. doi: 10.1073/pnas.1423854112

PubMed Abstract | CrossRef Full Text | Google Scholar

Hanssen, E. N., Avershina, E., Rudi, K., Gill, P., and Snipen, L. (2017). Body fluid prediction from microbial patterns for forensic application. Forensic Sci. Int. Genet. 30, 10–17. doi: 10.1016/j.fsigen.2017.05.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, L., Xu, C., Yang, W., and Yu, R. (2020). A machine learning framework to determine geolocations from metagenomic profiling. Biol. Direct 15:27. doi: 10.1186/s13062-020-00278-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, H. R., Trinidad, D. D., Guzman, S., Khan, Z., Parziale, J. V., DeBruyn, J. M., et al. (2016). A machine learning approach for using the postmortem skin microbiome to estimate the postmortem interval. PLoS One 11:e0167370. doi: 10.1371/journal.pone.0167370

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, R., Gu, Y., Shen, M., Li, H., Zhang, K., Wang, Q., et al. (2020). Predicting postmortem interval based on microbial community sequences and machine learning algorithms. Environ. Microbiol. 22, 2273–2291. doi: 10.1111/1462-2920.15000

PubMed Abstract | CrossRef Full Text | Google Scholar

Metcalf, J. L., Xu, Z. Z., Weiss, S., Lax, S., van Treuren, W., Hyde, E. R., et al. (2016). Microbial community assembly and metabolic function during mammalian corpse decomposition. Science 351, 158–162. doi: 10.1126/science.aad2646

PubMed Abstract | CrossRef Full Text | Google Scholar

Rahaman, M. M., Li, C., Yao, Y., Kulwa, F., Rahman, M. A., Wang, Q., et al. (2020). Identification of COVID-19 samples from chest X-ray images using deep learning: a comparison of transfer learning approaches. J. Xray Sci. Technol. 28, 821–839. doi: 10.3233/XST-200715

PubMed Abstract | CrossRef Full Text | Google Scholar

Ryan, F. J. (2019). Application of machine learning techniques for creating urban microbial fingerprints. Biol. Direct 14:13. doi: 10.1186/s13062-019-0245-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmedes, S. E., Woerner, A. E., and Budowle, B. (2017). Forensic human identification using skin microbiomes. Appl. Environ. Microbiol. 83:e01672-17. doi: 10.1128/AEM.01672-17

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmedes, S. E., Woerner, A. E., Novroski, N. M. M., Wendt, F. R., King, J. L., Stephens, K. M., et al. (2018). Targeted sequencing of clade-specific markers from skin microbiomes for forensic human identification. Forensic Sci. Int. Genet. 32, 50–61. doi: 10.1016/j.fsigen.2017.10.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherier, A. J., Woerner, A. E., and Budowle, B. (2021). Population informative markers selected using Wright’s fixation index and machine learning improves human identification using the skin microbiome. Appl. Environ. Microbiol. 87:e0120821. doi: 10.1128/AEM.01208-21

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherier, A. J., Woerner, A. E., and Budowle, B. (2022). Determining informative microbial single nucleotide polymorphisms for human identification. Appl. Environ. Microbiol. 88:e0005222. doi: 10.1128/aem.00052-22

PubMed Abstract | CrossRef Full Text | Google Scholar

Tackmann, J., Arora, N., Schmidt, T. S. B., Rodrigues, J. F. M., and von Mering, C. (2018). Ecologically informed microbial biomarkers and accurate classification of mixed and unmixed samples in an extensive cross-study of human body sites. Microbiome 6:192. doi: 10.1186/s40168-018-0565-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Walker, A. R., and Datta, S. (2019). Identification of city specific important bacterial signature for the meta SUB CAMDA challenge microbiome data. Biol. Direct 14:11. doi: 10.1186/s13062-019-0243-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Woerner, A. E., Novroski, N. M. M., Wendt, F. R., Ambers, A., Wiley, R., Schmedes, S. E., et al. (2019). Forensic human identification with targeted microbiome markers using nearest neighbor classification. Forensic Sci. Int. Genet. 38, 130–139. doi: 10.1016/j.fsigen.2018.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., Tsukimi, T., Yoshikawa, M., Suzuki, K., Takeda, T., Tomita, M., et al. (2019). Cutibacterium acnes (Propionibacterium acnes) 16S rRNA genotyping of microbial samples from possessions contributes to owner identification. mSystems 4:e00594-19. doi: 10.1128/mSystems.00594-19

CrossRef Full Text | Google Scholar

Zhang, J., Li, C., Kosov, S., Grzegorzek, M., Shirahama, K., Jiang, T., et al. (2021a). LCU-net: a novel low-cost U-net for environmental microorganism image segmentation. Pattern Recogn. 115:107885. doi: 10.1016/j.patcog.2021.107885

CrossRef Full Text | Google Scholar

Zhang, J., Wang, M., Qi, X., Shi, L., Zhang, J., Zhang, X., et al. (2021b). Predicting the postmortem interval of burial cadavers based on microbial community succession. Forensic Sci. Int. Genet. 52:102488. doi: 10.1016/j.fsigen.2021.102488

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: artificial intelligence, microbiome, machine learning, forensic microbiology, forensic science, microbial forensics

Citation: He Q, Niu X, Qi R-Q and Liu M (2022) Advances in microbial metagenomics and artificial intelligence analysis in forensic identification. Front. Microbiol. 13:1046733. doi: 10.3389/fmicb.2022.1046733

Received: 17 September 2022; Accepted: 31 October 2022;
Published: 15 November 2022.

Edited by:

Chen Li, Northeastern University, China

Reviewed by:

Hu ShanQing, Beijing Institute of Technology, China
Fenglin Zhuo, Capital Medical University, China

Copyright © 2022 He, Niu, Qi and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rui-Qun Qi, xiaoqiliumin@163.com; Min Liu, liuminxiaoqi@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.