Skip to main content

ORIGINAL RESEARCH article

Front. Pharmacol.
Sec. Pharmacoepidemiology
Volume 15 - 2024 | doi: 10.3389/fphar.2024.1395707
This article is part of the Research Topic Methods and Metrics to Measure Medication Adherence View all 6 articles

Machine Learning Methods for Propensity and Disease Risk Score Es9ma9on in High-Dimensional Data: a Plasmode Simula9on and Real-world Data Cohort Analysis

Provisionally accepted
  • 1 Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Medical Sciences Division, University of Oxford, Oxford, England, United Kingdom
  • 2 Boehringer Ingelheim, Ingelheim, Germany
  • 3 Department of Medical Informatics, Erasmus Medical Center, Rotterdam, Netherlands

The final, formatted version of the article will be published soon.

    Machine learning (ML) methods are promising and scalable alternatives for propensity score (PS) estimation, but their comparative performance in disease risk score (DRS) estimation remains unexplored. We used real-world data comparing antihypertensive users to non-users with 69 negative control outcomes, and plasmode simulations to study the performance of ML methods in PS and DRS estimation. We conducted a cohort study using UK primary care records. Further, we conducted a plasmode simulation with synthetic treatment and outcome mimicking empirical data distributions. We compared four PS and DRS estimation methods: 1. Reference: Logistic regression including clinically chosen confounders. 2. Logistic regression with L1 regularisation (LASSO). 3. Multi-layer perceptron (MLP). 4. Extreme Gradient Boosting (XgBoost). Covariate balance, coverage of the null effect of negative control outcomes (real-world data) and bias based on the absolute difference between observed and true effects (for plasmode) were estimated. 632,201 antihypertensive users and non-users were included. ML methods outperformed the reference method for PS estimation in some scenarios, both in terms of covariate balance and coverage/bias. Specifically, XgBoost achieved the best performance. DRS-based methods performed worse than PS in all tested scenarios. We found that ML methods could be reliable alternatives for PS estimation. ML-based DRS methods performed worse than PS ones, likely given the rarity of outcomes.

    Keywords: treatment effect, observa6onal research, machine learning, Propensity scores, Disease risk scores, nega6ve control

    Received: 04 Mar 2024; Accepted: 17 Oct 2024.

    Copyright: © 2024 Guo, Strauss, Català, Jödicke, Khalid and Prieto-Alhambra. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Yuchen Guo, Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Medical Sciences Division, University of Oxford, Oxford, OX3 7HE, England, United Kingdom

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.