The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Digit. Health
Sec. Health Informatics
Volume 7 - 2025 |
doi: 10.3389/fdgth.2025.1538793
Equitable Hospital Length of Stay Prediction for Patients with Learning Disabilities and Multiple Long-term Conditions Using Machine Learning
Provisionally accepted- 1 Department of Computer Science, School of Science, Loughborough University, Loughborough, United Kingdom
- 2 Faculty of Medicine, Health and Life Science, Swansea University Medical School, Swansea, Wales, United Kingdom
- 3 Leicester Real World Evidence Unit, Diabetes Research Centre, College of Life Sciences, University of Leicester, Leicester, Leicestershire, United Kingdom
- 4 Leicester Real World Evidence Unit,, Diabetes Research Centre, College of Life Sciences, University of Leicester, Leicester, Leicestershire, United Kingdom
- 5 School of Design, Loughborough University, Loughborough, Leicestershire, United Kingdom
- 6 Leicestershire Partnership NHS Trust, Leicester, England, United Kingdom
Purpose: Individuals with learning disabilities (LD) often face higher rates of premature mortality and prolonged hospital stays compared to the general population. Predicting the length of stay (LOS) for patients with LD and multiple long-term conditions (MLTCs) is critical for improving patient care and optimising medical resource allocation. However, there is limited research on the application of machine learning (ML) models to this population. Furthermore, approaches designed for the general population often lack generalisability and fairness, particularly when applied across sensitive groups within their cohort.Method: This study analyses hospitalisations of 9,618 patients with LD in Wales using electronic health records (EHR) from the SAIL Databank. A Random Forest (RF) ML model was developed to predict hospital LOS, incorporating demographics, medication history, lifestyle factors, and 39 long-term conditions. To address fairness concerns, two bias mitigation techniques were applied: a post-processing threshold optimizer and an in-processing reductions method using an exponentiated gradient. These methods aimed to minimise performance discrepancies across ethnic groups while ensuring robust model performance.The Random Forest (RF) model outperformed other state-of-the-art models, achieving an Area Under the Curve (AUC) of 0.759 for males and 0.756 for females, a false negative rate (FNR) of 0.224 for males and 0.229 for females, and a balanced accuracy of 0.690 for males and 0.689 for females. Bias mitigation algorithms reduced disparities in prediction performance across ethnic groups, with the threshold optimizer yielding the most notable 1 Emeka Abakasanga et al.improvements. Performance metrics, including false positive rate and balanced accuracy, showed significant enhancements in fairness for the male cohort.This study demonstrates the feasibility of applying ML models to predict LOS for patients with LD and MLTCs, while addressing fairness through bias mitigation techniques. The findings highlight the potential for equitable healthcare predictions using EHR data, paving the way for improved clinical decision-making and resource management.
Keywords: Learning disabilities, Length of Stay, Bias mitigation, Threshold Optimiser, Exponentiated gradient
Received: 03 Dec 2024; Accepted: 27 Jan 2025.
Copyright: © 2025 Cosma, Abakasanga, Kousovista, Akbari, Zaccardi, Kaur, Fitt, Jun, Kiani and Gangadharan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Georgina Cosma, Department of Computer Science, School of Science, Loughborough University, Loughborough, United Kingdom
Emeka Abakasanga, Department of Computer Science, School of Science, Loughborough University, Loughborough, United Kingdom
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.