AUTHOR=Noor Fatima , Ashfaq Usman Ali , Bakar Abu , ul Haq Waqar , Allemailem Khaled S. , Alharbi Basmah F. , Al-Megrin Wafa Abdullah I. , Tahir ul Qamar Muhammad TITLE=Discovering common pathogenic processes between COVID-19 and HFRS by integrating RNA-seq differential expression analysis with machine learning JOURNAL=Frontiers in Microbiology VOLUME=14 YEAR=2023 URL=https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2023.1175844 DOI=10.3389/fmicb.2023.1175844 ISSN=1664-302X ABSTRACT=

Zoonotic virus spillover in human hosts including outbreaks of Hantavirus and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) imposes a serious impact on the quality of life of patients. Recent studies provide a shred of evidence that patients with Hantavirus-caused hemorrhagic fever with renal syndrome (HFRS) are at risk of contracting SARS-CoV-2. Both RNA viruses shared a higher degree of clinical features similarity including dry cough, high fever, shortness of breath, and certain reported cases with multiple organ failure. However, there is currently no validated treatment option to tackle this global concern. This study is attributed to the identification of common genes and perturbed pathways by combining differential expression analysis with bioinformatics and machine learning approaches. Initially, the transcriptomic data of hantavirus-infected peripheral blood mononuclear cells (PBMCs) and SARS-CoV-2 infected PBMCs were analyzed through differential gene expression analysis for identification of common differentially expressed genes (DEGs). The functional annotation by enrichment analysis of common genes demonstrated immune and inflammatory response biological processes enriched by DEGs. The protein–protein interaction (PPI) network of DEGs was then constructed and six genes named RAD51, ALDH1A1, UBA52, CUL3, GADD45B, and CDKN1A were identified as the commonly dysregulated hub genes among HFRS and COVID-19. Later, the classification performance of these hub genes were evaluated using Random Forest (RF), Poisson Linear Discriminant Analysis (PLDA), Voom-based Nearest Shrunken Centroids (voomNSC), and Support Vector Machine (SVM) classifiers which demonstrated accuracy >70%, suggesting the biomarker potential of the hub genes. To our knowledge, this is the first study that unveiled biological processes and pathways commonly dysregulated in HFRS and COVID-19, which could be in the next future used for the design of personalized treatment to prevent the linked attacks of COVID-19 and HFRS.