Skip to main content

ORIGINAL RESEARCH article

Front. Mater.
Sec. Computational Materials Science
Volume 12 - 2025 | doi: 10.3389/fmats.2025.1542655

Explainable AutoML Models for Predicting the Strength of High-Performance Concrete Using Optuna, SHAP and Ensemble Learning

Provisionally accepted
  • 1 College of Civil Engineering, Department of Bridge Engineering, Tongji University, Shanghai 200092, China
  • 2 State Key Laboratory of Disaster Reduction in Civil Engineering, Tongji University, Shanghai 200092, China
  • 3 Department of Civil Engineering, University of Engineering and Technology Peshawar (Bannu Campus), Bannu 28100, Pakistan
  • 4 Institute of Energy Infrastructure, Universiti Tenaga Nasional, Kajang 43000, Malaysia
  • 5 The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University, 4800 Cao’an Road, Jiading, Shanghai 201804, China
  • 6 Peter the Great St. Petersburg Polytechnic University, 195251 St. Petersburg, Russia
  • 7 Muhayil Asir, Applied College , King Khalid University, Abha 62529, Saudi Arabia

The final, formatted version of the article will be published soon.

    Accurately predicting key engineering properties, such as compressive and tensile strength, remains a significant challenge in high-performance concrete (HPC) due to its complex and heterogeneous composition. Early selection of optimal components and the development of reliable machine learning (ML) models can significantly reduce the time and cost associated with extensive experimentation. This study introduces four explainable Automated Machine Learning (AutoML) models that integrate Optuna for hyperparameter optimization, SHapley Additive exPlanations (SHAP) for interpretability, and ensemble learning algorithms such as Random Forest (RF), Extreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LGB), and Categorical Gradient Boosting (CB).The resulting interpretable AutoML models O-RF, O-XGB, O-LGB, and O-CB are applied to predict the compressive and tensile strengths of HPC. Compared to a baseline model from the literature, O-LGB achieved significant improvements in predictive performance. For compressive strength, it reduced the Mean Absolute Error (MAE) by 87.69% and the Root Mean Squared Error (RMSE) by 71.93%. For tensile strength, it achieved a 99.41% improvement in MAE and a 96.67% reduction in RMSE, along with increases in R². Furthermore, SHAP analysis identified critical factors influencing strength, such as cement content, water, and age for compressive strength, and curing age, water-binder ratio, and water-cement ratio for tensile strength. This approach provides civil engineers with a robust and interpretable tool for optimizing HPC properties, reducing experimentation costs, and supporting enhanced decision-making in structural design, risk assessment, and other applications.

    Keywords: High performance concrete, machine learning, Ensemble learning algorithm, Shap, Optuna, Compressive Strength, Split tensile strength

    Received: 10 Dec 2024; Accepted: 02 Jan 2025.

    Copyright: © 2025 Khan, Peng, Khan, Khan, Ahmad, Aziz, Sabri Sabri and Abd El-Gawaad. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Muhammad Adeel Khan, College of Civil Engineering, Department of Bridge Engineering, Tongji University, Shanghai 200092, China
    Asad Khan, College of Civil Engineering, Department of Bridge Engineering, Tongji University, Shanghai 200092, China
    Mahmood Ahmad, Department of Civil Engineering, University of Engineering and Technology Peshawar (Bannu Campus), Bannu 28100, Pakistan

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.