Skip to main content

ORIGINAL RESEARCH article

Front. Environ. Sci.
Sec. Environmental Informatics and Remote Sensing
Volume 12 - 2024 | doi: 10.3389/fenvs.2024.1424988

Quantitative Evaluation of Uncertainty and Interpretability in Machine Learning-Based Landslide Susceptibility Mapping Through Feature Selection and Explainable AI

Provisionally accepted
  • 1 Department of Advanced Science and Technology Convergence, Kyungpook National University, Daegu, Republic of Korea
  • 2 Department of Forest Environment and Conservation, National Institute of Forest Sicence (NIFoS), Seoul, Republic of Korea

The final, formatted version of the article will be published soon.

    Landslide susceptibility mapping (LSM) is essential for determining risk regions and guiding mitigation strategies. Machine learning (ML) techniques have been broadly utilized, but the uncertainty and interpretability of these models have not been well-studied. This study conducted a comparative analysis and uncertainty assessment of five ML algorithms—Random Forest (RF), Light Gradient-Boosting Machine (LGB), Extreme Gradient Boosting (XGB), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM)—for LSM in Inje area, South Korea. We optimized these models using Bayesian optimization, a method that refines model performance through probabilistic model-based tuning of hyperparameters. The performance of these algorithms was evaluated using accuracy, Kappa score, and F1 score, with accuracy in detecting landslide-prone locations ranging from 0.916 to 0.947. Among them, the tree-based models (RF, LGB, XGB) showed competitive performance and outperformed the other models. Prediction uncertainty was quantified using bootstrapping and Monte Carlo simulation methods, with the latter providing a more consistent estimate across models. Further, the interpretability of ML predictions was analyzed through sensitivity analysis and SHAP values. We also expanded our investigation to include both the inclusion and exclusion of predictors, providing insights into each significant variable through a comprehensive sensitivity analysis. This paper provides insights into the predictive uncertainty and interpretability of ML algorithms for LSM, contributing to future research in South Korea and beyond.

    Keywords: Bayesian optimization, bootstrapping, Landslide susceptibility, monte carlo, Shap, uncertainty analysis

    Received: 29 Apr 2024; Accepted: 03 Jul 2024.

    Copyright: © 2024 Le, Choi, Eu, Yeon and Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Giha Lee, Department of Advanced Science and Technology Convergence, Kyungpook National University, Daegu, Republic of Korea

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.