Skip to main content

ORIGINAL RESEARCH article

Front. Digit. Health
Sec. Connected Health
Volume 6 - 2024 | doi: 10.3389/fdgth.2024.1500811
This article is part of the Research Topic Digital Remote Patient Monitoring in Neurodegenerative Diseases View all 8 articles

A Novel Machine Learning Based Framework for Developing Composite Digital Biomarkers of Disease Progression

Provisionally accepted
  • 1 Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, United States
  • 2 Nuffield Department of Clinical Neurosciences, Medical Sciences Division, University of Oxford, Oxford, England, United Kingdom
  • 3 Nuffield Department of Surgical Sciences, Medical Sciences Division, University of Oxford, Oxford, England, United Kingdom
  • 4 Division of Clinical Neurology, Nuffield Department of Clinical Neurosciences, Medical Sciences Division, University of Oxford, Oxford, England, United Kingdom
  • 5 Digital Clinical Measures, Translational Medicine, Merck & Co., Inc., Rahway, United States

The final, formatted version of the article will be published soon.

    Background: Current methods of measuring disease progression of neurodegenerative disorders, including Parkinson's disease (PD), largely rely on composite clinical rating scales, which are prone to subjective biases and lack the sensitivity to detect progression signals in a timely manner. Digital health technology (DHT)-derived measures offer potential solutions to provide objective, precise, and sensitive measures that address these limitations. However, the complexity of DHT datasets and the potential to derive numerous digital features that were not previously possible to measure pose challenges, including in selection of the most important digital features and construction of composite digital biomarkers. Methods: We present a comprehensive machine learning based framework to construct composite digital biomarkers for progression tracking. This framework consists of a marginal (univariate) digital feature screening, a univariate association test, digital feature selection, and subsequent construction of composite (multivariate) digital disease progression biomarkers using Penalized Generalized Estimating Equations (PGEE). As an illustrative example, we applied this framework to data collected from a PD longitudinal observational study. The data consisted of Opal TM sensor-based movement measurements and MDS-UPDRS Part III scores collected at 3-month intervals for 2 years in 30 PD and 10 healthy control participants. Results: In our illustrative example, 77 out of 235 digital features from the study passed univariate feature screening, with 11 features selected by PGEE to include in construction of the composite digital measure. Compared to MDS-UPDRS Part III, the composite digital measure exhibited a smoother and more significant increasing trend over time in PD groups with less variability, indicating improved ability for tracking disease progression. This composite digital measure also demonstrated the ability to classify between de novo PD and healthy control groups. Conclusion: Measures from DHTs show promise in tracking neurodegenerative disease progression with increased sensitivity and reduced variability as compared to traditional clinical scores. Herein, we present a novel framework and methodology to construct composite digital measure of disease progression from high-dimensional DHT datasets, which may have utility in accelerating the development and application of composite digital biomarkers in drug development.

    Keywords: Composite digital biomarker, Parkinson's disease, disease progression, Linear mixed effects model, machine learning, Penalized generalized estimating equations

    Received: 24 Sep 2024; Accepted: 16 Dec 2024.

    Copyright: © 2024 Zhai, Liaw, Shen, Xu, Svetnik, FitzGerald, Antoniades, Holder, Dockendorf, Ren and Baumgartner. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Jie Ren, Digital Clinical Measures, Translational Medicine, Merck & Co., Inc., Rahway, United States
    Richard Baumgartner, Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, United States

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.