The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Artif. Intell.
Sec. Medicine and Public Health
Volume 7 - 2024 |
doi: 10.3389/frai.2024.1499530
This article is part of the Research Topic Artificial Intelligence for Diabetes Related Complications and Metabolic Health View all 3 articles
Robust Predictive Framework for Diabetes Classification Using Optimized Machine Learning on Imbalanced Datasets
Provisionally accepted- 1 Dept. of Information Technology, Faculty of Computers and Information Technology, University of Tabuk, tabuk, Saudi Arabia
- 2 Dept. of Electronics and Electrical Comm., Higher Institute of Engineering and Technology, Kafrelsheikh, Egypt, Kafr El Sheikh, Egypt
- 3 Dept. of Computer and Automatic Control, Faculty of Engineering, Tanta University, Tanta, Egypt
Diabetes prediction from clinical datasets is a critical challenge in medical data analysis, particularly due to the frequent imbalance in class distributions, where non-diabetic cases often dominate. Such imbalances can compromise the predictive performance of machine learning models, leading to biased outcomes and reduced generalization. To address these challenges, we present a novel predictive framework that leverages cuttingedge machine learning algorithms and sophisticated imbalance handling techniques. Our approach systematically evaluates various machine learning models, applying advanced feature engineering and resampling strategies to enhance predictive accuracy. We conduct rigorous testing on three widely recognized datasets-PIMA, Diabetes Dataset 2019, and BIT 2019 -demonstrating the robustness and adaptability of our methodology across different data environments. This work introduces a comprehensive experimental setup that underscores the importance of model selection and imbalance mitigation in achieving reliable and generalizable diabetes prediction outcomes. The findings contribute significant insights to the field of medical informatics, offering a robust, data-driven framework that improves prediction accuracy in the presence of challenging class imbalances, thereby advancing the state-of-the-art in diabetes prediction.
Keywords: Diabetes detection, Imbalance handling methods, Imbalanced datasets, machine learning, Statistical analysis Robust Predictive Framework for Diabetes Classification Using Optimized Machine Learning on Imbalanced Datasets 3
Received: 21 Sep 2024; Accepted: 12 Dec 2024.
Copyright: © 2024 Abousaber, Abdallah and El-Ghaish. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Haitham F. Abdallah, Dept. of Electronics and Electrical Comm., Higher Institute of Engineering and Technology, Kafrelsheikh, Egypt, Kafr El Sheikh, Egypt
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.