Skip to main content

ORIGINAL RESEARCH article

Front. Artif. Intell.
Sec. Machine Learning and Artificial Intelligence
Volume 8 - 2025 | doi: 10.3389/frai.2025.1504281

Transfer Learning Based Hybrid VGG16-Machine Learning Approach for Heart Disease Detection with Explainable Artificial Intelligence

Provisionally accepted
  • 1 University of Gondar, Gondar, Ethiopia
  • 2 Department of Information System, College of Informatics, University of Gondar, Gondar, Amhara Region, Ethiopia
  • 3 College of Natural and Computational Science, Mekdela Amba University, Tulu Awuliya, Ethiopia
  • 4 department of computer science, college of informatics, university of gondar, gondar ethiopia, Gondar, Ethiopia
  • 5 Department of Surgery, School of Medicine, College of Medicine and Health sciences, University of Gondar, Gondar, Amhara Region, Ethiopia

The final, formatted version of the article will be published soon.

    Heart disease is a leading cause of mortality worldwide, and accurate early detection is crucial for effective treatment and management. In this study, we propose a novel hybrid machine learning approach that leverages transfer learning using the VGG16 convolutional neural network (CNN) combined with multiple machine learning classifiers for heart disease detection. Conditional tabular generative adversarial network (CTGAN) was employed to generate synthetic data samples from the actual datasets; statistical metrics, correlation analysis and domain experts evaluation were incorporated to evaluate the quality of synthetic datasets. The dataset comprises tabular data with 13 features, which is reshaped into image like format and resized to 224x224x3 to be compatible with the input requirements of the VGG16 model. Feature extraction is performed using VGG16, and the extracted features are fused with the original tabular data. This combined feature set is then used to train various machine learning models, including Support Vector Machines (SVM), Gradient Boosting, Random Forest, Logistic Regression, K-Nearest Neighbours (KNN), and Decision Tree. Among these models, the VGG16-Random Forest hybrid achieved a promising result across all evaluation metrics, including 92% of accuracy, 91.3% precision, 92.2% recall, 91.82% specificity, 92.2% sensitivity, and 91.75% F1-score. The hybrid models were also evaluated by unseen datasets to determine generalizability capabilities of the proposed approaches and relatively promising results are achieved by VGG16-Random Forest. Additionally, explainability is integrated into the model using SHAP values, providing insights into the contribution of each feature on the model prediction. This hybrid VGG16-ML approach demonstrates the potential for highly accurate heart disease detection with interpretable outcomes, offering valuable assistance in clinical decision-making processes.

    Keywords: machine learning, deep learning, Heart disease, artificial intelligence explainability, feature extraction, VGG16-random forest, CTGAN

    Received: 30 Sep 2024; Accepted: 31 Jan 2025.

    Copyright: © 2025 Addisu, Yirga, Yirga and Yehuala. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Eshetie Gizachew Addisu, University of Gondar, Gondar, Ethiopia

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.