Skip to main content

METHODS article

Front. Genet.
Sec. Computational Genomics
Volume 15 - 2024 | doi: 10.3389/fgene.2024.1381851
This article is part of the Research Topic Computational Genomic and Precision Medicine View all 3 articles

SAFE-MIL: A statistically interpretable framework for screening potential targeted therapy patients based on risk estimation

Provisionally accepted
  • 1 School of Computer Science and Technology, Xi’an Jiaotong University, Xi'an, China
  • 2 Geneplus Beijing Institute, Huilongguan Town, Beijing, China
  • 3 Shanghai Chest Hospital, Shanghai Jiao Tong University, Shanghai, Beijing Municipality, China
  • 4 Department of Oncology, Shanghai Chest Hospital, Shanghai Jiao Tong University, Shanghai, China

The final, formatted version of the article will be published soon.

    Patients with the target gene mutation frequently derive significant clinical benefits from target therapy. However, differences in the abundance level of mutations among patients resulted in varying survival benefits, even among patients with the same target gene mutations. Currently, there is a lack of rational and interpretable models to assess the risk of treatment failure. In this study, we investigated the underlying coupled factors contributing to variations in medication sensitivity and established a statistically interpretable framework, named SAFE-MIL, for risk estimation. We first constructed an effectiveness label for each patient from the perspective of exploring the optimal grouping of patients' positive judgment values and sampled patients into 600 and 1000 groups, respectively, based on multiinstance learning (MIL). A novel and interpretable loss function was further designed based on the Hosmer-Lemeshow test for this framework. By integrating multi-instance learning with the Hosmer-Lemeshow test, SAFE-MIL is capable of accurately estimating the risk of drug treatment failure across diverse patient cohorts and providing the optimal threshold for assessing the risk stratification simultaneously. We conducted a comprehensive case study involving 457 non-small cell lung cancer patients with EGFR mutations treated with EGFR tyrosine kinase inhibitors. Results demonstrate that SAFE-MIL outperforms traditional regression methods with higher accuracy and can accurately assess patients' risk stratification. This underscores its ability to accurately capture inter-patient variability in risk while providing statistical interpretability. SAFE-MIL is able to effectively guide clinical decisionmaking regarding the use of drugs in targeted therapy and provides an interpretable computational framework for other patient stratification problems. The SAFE-MIL framework has proven its effectiveness in capturing inter-patient variability in risk and providing statistical interpretability. It outperforms traditional regression methods and can effectively guide clinical decision-making in the use of drugs for targeted therapy. SAFE-MIL offers a valuable interpretable computational framework that can be applied to other patient stratification problems, enhancing the precision of risk assessment in personalized medicine. The source code for SAFE-MIL is available for further exploration and application at https://github.com/Nevermore233/SAFE-MIL.

    Keywords: epidermal growth factor receptor, Non-small cell lung cancer, target therapy, Risk estimation, Hosmer-Lemeshow test, Multi-instance learning

    Received: 04 Feb 2024; Accepted: 31 Jul 2024.

    Copyright: © 2024 Guan, Xue, Wang, Ai, Chen, Yi, Lu and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Shun Lu, Department of Oncology, Shanghai Chest Hospital, Shanghai Jiao Tong University, Shanghai, 200030, China
    Yuqian Liu, School of Computer Science and Technology, Xi’an Jiaotong University, Xi'an, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.