Skip to main content

ORIGINAL RESEARCH article

Front. Earth Sci.
Sec. Geohazards and Georisks
Volume 12 - 2024 | doi: 10.3389/feart.2024.1514591
This article is part of the Research Topic Failure Analysis and Risk Assessment of Natural Disasters Through Machine Learning and Numerical Simulation: Volume IV View all 18 articles

Impact of a multiple oversampling technique-based assessment framework on shallow rockburst prediction models

Provisionally accepted
Guozhu Rao Guozhu Rao *Yunzhang Rao Yunzhang Rao *Yangjun Xie Yangjun Xie *Qiang Huang Qiang Huang *Jiazheng Wan Jiazheng Wan *Jiyong Zhang Jiyong Zhang
  • Jiangxi University of Science and Technology, Ganzhou, China

The final, formatted version of the article will be published soon.

    The occurrence of class-imbalanced datasets is a frequent observation in natural science research, emphasizing the paramount importance of effectively harnessing them to construct highly accurate models for rockburst prediction. Initially, genuine rockburst incidents within a burial depth of 500 meters were sourced from literature, revealing a small dataset imbalance issue.Utilizing various mainstream oversampling techniques, the dataset was expanded to generate six new datasets, subsequently subjected to 12 classifiers across 84 classification processes. The model incorporating the highest-scoring model from the original dataset and the top two models from the expanded dataset, yielded a high-performance model. Findings indicate that the KMeansSMOTE oversampling technique exhibits the most substantial enhancement across the combined 12 classifiers, whereas individual classifiers favor ET+SVMSMOTE and RF+SMOTENC. Following multiple rounds of hyper parameter adjustment via random cross-validation, the ET+SVMSMOTE combination attained the highest accuracy rate of 93.75%, surpassing mainstream models for rockburst prediction. Moreover, the SVMSMOTE technique, augmenting samples with fewer categories, demonstrated notable benefits in mitigating overfitting, enhancing generalization, and improving Recall and F1 score within RF classifiers.Validated for its high generalization performance, accuracy, and reliability. This process also provides an efficient framework for model development.

    Keywords: Oversampling techniques, machine learning, Shallow rockburst intensity prediction, assessment, generalization capability

    Received: 21 Oct 2024; Accepted: 31 Dec 2024.

    Copyright: © 2024 Rao, Rao, Xie, Huang, Wan and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Guozhu Rao, Jiangxi University of Science and Technology, Ganzhou, China
    Yunzhang Rao, Jiangxi University of Science and Technology, Ganzhou, China
    Yangjun Xie, Jiangxi University of Science and Technology, Ganzhou, China
    Qiang Huang, Jiangxi University of Science and Technology, Ganzhou, China
    Jiazheng Wan, Jiangxi University of Science and Technology, Ganzhou, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.