AUTHOR=Chen Qing , Zhang Ji , Bao Banghe , Zhang Fan , Zhou Jie TITLE=Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree JOURNAL=Frontiers in Molecular Biosciences VOLUME=8 YEAR=2022 URL=https://www.frontiersin.org/journals/molecular-biosciences/articles/10.3389/fmolb.2021.815243 DOI=10.3389/fmolb.2021.815243 ISSN=2296-889X ABSTRACT=

The early clinical symptoms of gastric cancer are not obvious, and metastasis may have occurred at the time of treatment. Poor prognosis is one of the important reasons for the high mortality of gastric cancer. Therefore, the identification of gastric cancer-related genes can be used as relevant markers for diagnosis and treatment to improve diagnosis precision and guide personalized treatment. In order to further reveal the pathogenesis of gastric cancer at the gene level, we proposed a method based on Gradient Boosting Decision Tree (GBDT) to identify the susceptible genes of gastric cancer through gene interaction network. Based on the known genes related to gastric cancer, we collected more genes which can interact with them and constructed a gene interaction network. Random Walk was used to extract network association of each gene and we used GBDT to identify the gastric cancer-related genes. To verify the AUC and AUPR of our algorithm, we implemented 10-fold cross-validation. GBDT achieved AUC as 0.89 and AUPR as 0.81. We selected four other methods to compare with GBDT and found GBDT performed best.