Skip to main content

ORIGINAL RESEARCH article

Front. Earth Sci.
Sec. Geohazards and Georisks
Volume 13 - 2025 | doi: 10.3389/feart.2025.1511413

Tunnel squeezing prediction based on partially missing dataset and optimized machine learning models

Provisionally accepted
Peng Guan Peng Guan 1Guangzhao Ou Guangzhao Ou 2*Feng Liang Feng Liang 3Weibang Luo Weibang Luo 3Qingyong Wang Qingyong Wang 4Chengyuan Pei Chengyuan Pei 4Xuan Che Xuan Che 1
  • 1 China University of Geosciences Wuhan, Wuhan, China
  • 2 Hunan University of Finance and Economics, Changsha, Hunan, China
  • 3 Xinjiang Survey and Design Institute for Water Resources and Hydropower, Urumqi, China
  • 4 Xinjiang Water Conservancy Development and Construction Group Co., Ltd., Urumqi, China

The final, formatted version of the article will be published soon.

    Abstract:Accurate prediction of tunnel squeezing, one of the common geological hazards during tunnel construction, is of great significance for ensuring construction safety and reducing economic losses. To achieve precise prediction of tunnel squeezing, this study constructed six reliable machine learning (ML) classification models for this purpose, including Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), and K-Nearest Neighbors (KNN). The parameters of these six ML models were optimized using the Whale Optimization Algorithm (WOA) in conjunction with five-fold crossvalidation. A total of 305 tunnel squeezing sample data were collected to train and test the models. KNN and Synthetic Minority Over-sampling Technique (SMOTE) methods were employed to handle the missing and imbalanced data sets. An input feature system for tunnel squeezing prediction was established, comprising tunnel burial depth (H), tunnel diameter (D), strength-to-stress ratio (SSR), and support stiffness (K). The XGBoost model optimized with WOA demonstrated the highest prediction accuracy of 0.9681. The SHAP method was utilized to interpret the XGBoost model, indicating that the contribution rank of the input features to tunnel squeezing prediction was SSR > K > D > H, with average SHAP values of 2.93, 1.49, 0.82, and 0.69, respectively. The XGBoost model was applied to predict tunnel squeezing in 10 sections of the Qinghai Huzhu Beishan Tunnel. The prediction results were highly consistent with the actual outcomes.

    Keywords: Tunnel squeezing prediction, machine learning, Whale optimization algorithm, Model interpretation, Missing Dataset

    Received: 15 Oct 2024; Accepted: 09 Jan 2025.

    Copyright: © 2025 Guan, Ou, Liang, Luo, Wang, Pei and Che. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Guangzhao Ou, Hunan University of Finance and Economics, Changsha, Hunan, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.