AUTHOR=Zhang Boyu , Liu Haiyan , Wu Fengxia , Ding Yuhong , Wu Jiarun , Lu Lu , Bajpai Akhilesh K. , Sang Mengmeng , Wang Xinfeng TITLE=Identification of hub genes and potential molecular mechanisms related to drug sensitivity in acute myeloid leukemia based on machine learning JOURNAL=Frontiers in Pharmacology VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/pharmacology/articles/10.3389/fphar.2024.1359832 DOI=10.3389/fphar.2024.1359832 ISSN=1663-9812 ABSTRACT=

Background: Acute myeloid leukemia (AML) is the most common form of leukemia among adults and is characterized by uncontrolled proliferation and clonal expansion of hematopoietic cells. There has been a significant improvement in the treatment of younger patients, however, prognosis in the elderly AML patients remains poor.

Methods: We used computational methods and machine learning (ML) techniques to identify and explore the differential high-risk genes (DHRGs) in AML. The DHRGs were explored through multiple in silico approaches including genomic and functional analysis, survival analysis, immune infiltration, miRNA co-expression and stemness features analyses to reveal their prognostic importance in AML. Furthermore, using different ML algorithms, prognostic models were constructed and validated using the DHRGs. At the end molecular docking studies were performed to identify potential drug candidates targeting the selected DHRGs.

Results: We identified a total of 80 DHRGs by comparing the differentially expressed genes derived between AML patients and normal controls and high-risk AML genes identified by Cox regression. Genetic and epigenetic alteration analyses of the DHRGs revealed a significant association of their copy number variations and methylation status with overall survival (OS) of AML patients. Out of the 137 models constructed using different ML algorithms, the combination of Ridge and plsRcox maintained the highest mean C-index and was used to build the final model. When AML patients were classified into low- and high-risk groups based on DHRGs, the low-risk group had significantly longer OS in the AML training and validation cohorts. Furthermore, immune infiltration, miRNA coexpression, stemness feature and hallmark pathway analyses revealed significant differences in the prognosis of the low- and high-risk AML groups. Drug sensitivity and molecular docking studies revealed top 5 drugs, including carboplatin and austocystin-D that may significantly affect the DHRGs in AML.

Conclusion: The findings from the current study identified a set of high-risk genes that may be used as prognostic and therapeutic markers for AML patients. In addition, significant use of the ML algorithms in constructing and validating the prognostic models in AML was demonstrated. Although our study used extensive bioinformatics and machine learning methods to identify the hub genes in AML, their experimental validations using knock-out/-in methods would strengthen our findings.