Skip to main content

ORIGINAL RESEARCH article

Front. Educ.
Sec. Digital Education
Volume 9 - 2024 | doi: 10.3389/feduc.2024.1390892

Educational Data Mining (EDM): Influencing Factors on Medical Student Success; Exploring Visualization Techniques

Provisionally accepted
Ploywarong Rueangket Ploywarong Rueangket 1*Juraluck Taebunphakul Juraluck Taebunphakul 2*Boonsub Sakboonyarat Boonsub Sakboonyarat 2*Akara Prayote Akara Prayote 3*
  • 1 Department of obstetrics and gynecology, Phramongkutklao Hospital, Bangkok, Thailand
  • 2 Phramongkutklao Hospital, Bangkok, Thailand
  • 3 Department of Computer and Information Science, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, Bangkok, Thailand

The final, formatted version of the article will be published soon.

    Objectives: Medical student education plays a vital role in enabling graduated doctors to contribute significantly to patient healthcare and the national public health system. This study aimed to identify influential factors on student academic success (Honors level or High-grade group) using data mining techniques for multidimensional educational data.A retrospective cohort study was conducted using a standardized questionnaire administered to 145 medical students. Thirteen factors attributed to four domains; academic activity, demographics, environment, and psychology or learning style were examined. Analysis of prevalence ratio (PR) and adjusted prevalence ratio (APR) was performed using multivariate logistic regression. Unsupervised learning techniques, including cluster analysis and association rules, were used to identify hidden patterns.Visualization techniques, such as heatmaps and centroid plots, based on cluster analysis, were employed to illustrate data relationships and enhance the interpretation of key trends. Internal validation was evaluated.Results: Of the 13 factors analyzed, logistic regression identified pre-med GPAX ≥ 3.75 and an interest in internal medicine as statistically significant predictors of high academic performance, with adjusted prevalence ratios (APRs) of 1.73 (95% CI, 1.02-2.91, p = 0.040) and 1.52 (95% CI, 1.14-2.03, p = 0.005), respectively. Cluster analysis revealed typical characteristics of highgrade students, including a metropolis residence, a very high pre-med GPAX, and a preference for kinesthetic and reading learning styles. Association rules analysis further emphasized the importance of environmental factors, identifying transportation time to school and access to learning resources as influential in supporting academic success.Educational Data Mining (EDM) visually highlighted essential factors in medical student success. Logistic regression identified pre-med GPAX and interest in internal medicine as key predictors; cluster analysis uncovered learning style patterns associated with performance; and association rules emphasized environmental factors, such as proximity to school and access to resources. Together, these methods provide a comprehensive, visual framework to support a holistic, data-driven approach in educational planning, potentially offering new insights for addressing broader medical challenges and advancing clinical practice.

    Keywords: educational data mining, Medical student, Logistic regression, Unsupervised learning techniques, Visualization techniques

    Received: 24 Feb 2024; Accepted: 18 Nov 2024.

    Copyright: © 2024 Rueangket, Taebunphakul, Sakboonyarat and Prayote. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Ploywarong Rueangket, Department of obstetrics and gynecology, Phramongkutklao Hospital, Bangkok, Thailand
    Juraluck Taebunphakul, Phramongkutklao Hospital, Bangkok, 10400, Thailand
    Boonsub Sakboonyarat, Phramongkutklao Hospital, Bangkok, 10400, Thailand
    Akara Prayote, Department of Computer and Information Science, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, Bangkok, Thailand

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.