Hypoglycemia event prediction from CGM using ensemble learning

Fleischer, Jesper; Hansen, Troels Krarup; Cichosz, Simon Lebech

doi:10.3389/fcdhc.2022.1066744

BRIEF RESEARCH REPORT article

Front. Clin. Diabetes Healthc., 09 December 2022

Sec. Diabetes Self-Management

Volume 3 - 2022 | https://doi.org/10.3389/fcdhc.2022.1066744

This article is part of the Research TopicExploring the Benefits of Digital Health Technologies in Diabetes ManagementView all 11 articles

Hypoglycemia event prediction from CGM using ensemble learning

Jesper Fleischer^1,2

Troels Krarup Hansen¹

Simon Lebech Cichosz^3*

¹Steno Diabetes Center Aarhus, Aarhus, Denmark
²Steno Diabetes Center Zealand, Holbæk, Denmark
³Department of Health Science and Technology, Aalborg University, Aalborg, Denmark

This work sought to explore the potential of using standalone continuous glucose monitor (CGM) data for the prediction of hypoglycemia utilizing a large cohort of type 1 diabetes patients during free-living. We trained and tested an algorithm for the prediction of hypoglycemia within 40 minutes on 3.7 million CGM measurements from 225 patients using ensemble learning. The algorithm was also validated using 11.5 million synthetic CGM data. The results yielded a receiver operating characteristic area under the curve (ROC AUC) of 0.988 and a precision-recall area under the curve (PR AUC) of 0.767. In an event-based analysis for predicting hypoglycemic events, the algorithm had a sensitivity of 90%, a lead-time of 17.5 minutes and a false-positive rate of 38%. In conclusion, this work demonstrates the potential of using ensemble learning to predict hypoglycemia, using only CGM data. This could help alarm patients of a future hypoglycemic event so countermeasures can be initiated.

Introduction

Hypoglycemia is related to both increased physical, and mental health problems and is a major risk factor for mortality (1, 2). Hypoglycemia can result from exogenous or endogenous insulin excess alone. The clinical manifestation is often characteristic, but the neurogenic and neuroglycopenic symptoms of hypoglycemia are nonspecific and relatively insensitive (3). Consequently, many episodes of hypoglycemia are not recognized or treated late in the progression (3). It is very important to prevent, identify and treat hypoglycemic events secondary to the use of insulin. Additionally, it is safer for the patients and more effective to prevent hypoglycemia than to treat it after it occurs (4).

Hypoglycemia is common among patients with insulin dependent diabetes. Patients who aim for a strict glycemic target experience frequent episodes of asymptomatic hypoglycemia and severe hypoglycemia (5). Studies suggest that plasma glucose levels may be less than 60 mg/dL (3.3 mmol/L) up to 10% of the day (6, 7). Furthermore, patients with type 1 diabetes suffer from an average of two weekly incidents of symptomatic hypoglycemia (6, 7).

However, newer studies on patients utilizing continuous glucose monitoring (CGM) have shown that time below range (< 3.9 mmol/L) was estimated to be 5.4% with a mean HbA1c of 7.0% (52 mmol/mol) (8).

Blood glucose prediction is about forecasting a patient’s future blood glucose levels using current and past information and is also an important constituent of blood glucose anomaly classification approaches. One potential method to reduce episodes of hypoglycemia is prediction models that can alarm patients early to begin countermeasures. Such models can be implemented directly into the CGM systems or as an add-on in the patient’s smartphone applications connected to the systems (9).

We have in previous studies (10–13) investigated the potential of using a continuous glucose monitor (CGM) combined with heart rate variability (HRV) to predict hypoglycemia for the purpose of early intervention. Also, many others have reported the potential of predicting future glucose levels using CGM combined with multiple data sources such as insulin, physical activity, food intake, and stress response (14, 15). Obtaining these multiple data in real time is not always practical (9). Also, most studies that utilize only CGM data as a more practical approach, are often based on limited number of patients, short CGM wear-time, and are not validated in external cohorts of patients (9, 14, 16). Therefore, we sought to further explore the potential of using only CGM data for the prediction of hypoglycemia in a proof of concept analysis using a large cohort of type 1 diabetes patients during normal daily living and validating the results in an external CGM database.

Methods

The study cohort comprised CGM data derived from individuals who were enrolled in the REPLACE-BG trial (17). The REPLACE-BG study design was a 6-month parallel group multicenter randomized clinical trial. A total of 225 patients ≥18 years of age (mean ± standard deviation or median (interquartile range): age: 44 ± 14 years, duration of diabetes: 23 ± 12 years, BMI: 27.7 ± 4.1, HbA1c: 7.1 ± 0.7% (54 mmol/mol), time in range: 63 ± 13%, time below <70 mg/dL: 2.9% (1.5–5.1)) with type 1 diabetes were enrolled from the diabetes clinics and used CGM (Dexcom G4) for up to 6 months. The characteristics are presented in Table 1.

TABLE 1

Table 1 patients characteristics presented as mean ± standard deviation for parametric characteristics or Median (interquartile range) for non-parametric.

We trained and tested an algorithm for the prediction of hypoglycemia within 40 minutes on 3.7 million CGM measurements from 225 patients using an ensemble learning approach named RUSBoost (18). In short, ensemble learning is a general meta-approach to machine learning that seeks better performance by combining the predictions from multiple models. RUSBoost has been reported to be a fast and robust classifier for datasets with imbalanced data. For training, 70% of the data were utilized (split on a patient level) and the remaining 30% were reserved for testing the performance of the final model.

The hyperparameter estimation (learning cycles, learn rate, max splits) were determined using 5-fold cross-validation on the training data using a grid search strategy. A hyperparameter is a parameter whose value is used to control the learning process of the prediction model. Grid search is a specific tuning strategy that attempts to compute the optimum values of the hyperparameters. It is an exhaustive search that is performed on the specific parameter values of a model. Cross-validation is used in the process to ensure that the model is not over-tuned, which could result in worse performance on new patient data.

Input to the model was CGM data one hour prior to the point of prediction. Hypoglycemia was defined as CGM values below 70 mg/dL for 15 minutes or more (sustained hypoglycemia) – the definition was based on the recommendations in previous studies (19, 20). The algorithm was implemented using MATLAB R2020b (The Mathworks Inc., Natick, Massachusetts).

In addition to the data from real patients the algorithm was also tested on 11.5 million synthetic CGM data from the publicly available SCGMS database (18). The database mimics CGM data from type 1 patients and healthy individuals with different HbA1c levels using a Conditional Generative Adversarial Network (CGAN) (21). In short, CGAN is a novel method to construct a neural network which can be used to generate realistic biological signals. The external validation was conducted to determine the generalizability of the model in people with different glycemic control.

To evaluate the performance of the trained model we conducted a sample-based assessment that comprised every datapoint in the test dataset (real patients) and synthetic dataset. The sample-based performance was assessed using Receiver operating characteristic (ROC) and precision-recall curve with (PR curve) with accompanying area under the curve (AUC). The metrics from the sample-based assessment are important for between model comparison. However, from a clinical or patient perspective, an event-based assessment is more useful for evaluating the performance.

Therefore, we conducted an event-based assessment that was conducted on each episode of hypoglycemia to test how many episodes of hypoglycemia was detected, the lead-time (prediction time) and the number of false positives. The event-based assessment was included to assess the performance on clinical in-use situations.

Results

Sample-based test results

From 1,110,000 samples in the test dataset the performance of the algorithm was a receiver operating characteristic areal under the curve (ROC AUC) of 0.988 and a precision-recall area under the curve (PR AUC) of 0.767. The ROC and PR curves are illustrated in Figure 1.

FIGURE 1

Figure 1 ROC and PR curves of the sample-based performance from the test dataset (real patients).

From the 11,500,000 samples of synthetic data the assessment yielded an operating characteristic areal under the curve (ROC AUC) of 0.988 and a precision-recall area under the curve (PR AUC) of 0.879.

Event-based test results

The results from the event-based assessment yielded a sensitivity of 90%, a lead-time of 17.5 minutes and a false-positive rate of 38%. Due to the class imbalance (few events compared to non-events) the specificity and negative-predictive-value are both high >99%. The prediction was on average triggered with glucose levels of 83 mg/dL.

Translated to round estimates this would mean that 9 out of 10 hypoglycemia events were detected on average 17 minutes prior to the first CGM value below 70 mg/dL and with 2/3 alarms being true. The metrics are calculated from a total of 3725 hypoglycemic events in the test dataset. The test dataset comprised of 5,456,905 minutes of CGM wear time during daily living.

Figure 2 shows an example of prediction from three days of CGM wear. The patient would be alarmed three times, where the first two alarms are true positives, while the last is a rapid decline in glucose levels that does not lead to hypoglycemia.

FIGURE 2

Figure 2 An example of three days of continuous glucose monitoring from a patient. The dots illustrate the point in time where an alarm is activated for high hypoglycemic risk. The green dots are true positives, and the red dot is a false alarm. The red line is the threshold for hypoglycemia (70 mg/dL).

Discussion

The event-based assessment shows that it is possible to predict a large proportion of hypoglycemic events with a lead-time which makes it possible for the patients to reverse the situation and potentially avoid severe hypoglycemia. Especially during nights and rapid dips in glucose it is extremely important to be aware of the risk and start timely treatment with ingestion of fast absorbable carbohydrates or potentially glucagon to avoid severe hypoglycemia related complications. Early prediction of hypoglycemia and herby earlier intervention could also potentially aid in reducing the time in hypoglycemia.

In comparison, the commercial system Dexcom 6G advertise hypoglycemia prediction up to 20 minutes prior to hypoglycemia defined at a lower threshold of 55 mg/dL (18). However, without data on the average lead-time, sensitivity and false-positive it is difficult to compare the predictive capabilities. However, improving on the prediction capability as in our study with event prediction up to 40 minutes a-head (average lead-time 17.5 minutes) prior to an event (<70 mg/dL) would enable faster action to avoid mild hypoglycemia/sever hypoglycemia. Accurate prediction models could also be used in a closed-loop system to suspend insulin dosing in order to avoid severe hypoglycemia.

Recent studies by Darpit et al. (15, 22) have reported interesting results on the multisource prediction of hypoglycemia using a battery of features from CGM, insulin, meal intake and demographic data. They reported from a cohort of 110 pediatric patients an accuracy of predicted events with >97% sensitivity and specificity and false alert rate <25%. However, due to the difference in sensor model, methodically assessment and cohort characteristics between the studies it is challenging to compare results head-to-head and conclude if the use of additional data is worth the practical implications. CGM based hypoglycemic event prediction seems like an attractive approach due to the simplicity of implementing it into already running commercial CGM sensors or analytic platforms for CGM data.

Additionally, Seo et al. (23) proposed a model for predicting postprandial hypoglycemia using CGM and meal announcement. The study explored retrospective CGM datasets of 104 people who had experienced at least one hypoglycemia event during a three-day CGM session. The best performance reported in the study was an average AUC of 0.966, average sensitivity of 89.6%, and average specificity of 91.3%. Marcus et al. (24) published results from 11 patients with type 1 diabetes - they proposed a prediction model for hypoglycemia with a sensitivity of 64% and a low false-positive rate of 4%.

This study has some limitations; the proposed model in this study, still needs to be tested in a broader spectrum of patients and CGM sensors. One limitation in this study is that we cannot generalize the performance to all CGM sensors. Many new sensors are emerging from different manufacturers with better accuracy and decision support, such as trend arrows.

Alarm fatigue is a relevant challenge, which is why false alarm needs to be low. In our study, the proposed model, if implemented, would result in one alarm each ~10 days of wear time. This is dependent on the population and degree of glycemic control, so we cannot extrapolate this finding to a group of patients with severe glycemic control. However, the results from external validation on synthetic CGM data from people with different HbA1c levels could indicate that the model is generalizable.

In future perspectives, models such as the one proposed in this study need to be evaluated in a clinical impact study to assess the effects and clinical implications. The hypothesis is that the accurate prediction of hypoglycemic events could lead to better glycemic control with fewer events, increased time in range and less glycemic variability.

In conclusion, this work demonstrates the potential of using ensemble learning to predict hypoglycemia, using only CGM data, in a large and heterogeneous group of patients with type 1 diabetes.

Data availability statement

The datasets presented in this study can be found online. The names of the links can be found below: the SCGMS database- 10.1177/19322968211014255; the REPLACE-BG database - https://diabetesjournals.org/care/article/40/4/538/3687/REPLACE-BG-A-Randomized-Trial-Comparing-Continuous; the original data of this study- https://public.jaeb.org/dataset/546.

Ethics statement

The studies involving human participants were reviewed and approved as part of the original study REPLACE-BG listed on ClinicalTrials.gov under identifier NCT02258373. The patients/participants provided their written informed consent to participate in this study.

Author contributions

SC had access to all the data analyzed in this study. SC takes responsibility for the integrity and accuracy of the study data analysis and results. SC and JF were involved in the study design, concept, analysis, and interpretation of the data. SC drafted the manuscript and performed the statistical analysis. JF and TH were involved in critical revision of the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The algorithm tested in this article was developed by Medicus Engineering. SC is consultant and JF is consultant and co-owner of Medicus Engineering.

The remaining author declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Author disclaimer

The source of the data is the T1D Exchange, but the analyses, content, and conclusions presented herein are solely the responsibility of the authors and have not been reviewed or approved by the T1D Exchange.

References

1. Parekh B. The mechanism of dead-in-Bed syndrome and other sudden unexplained nocturnal deaths. Curr. Diabetes Rev. (2009) 5(4):210–5. doi: 10.2174/157339909789804387

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Cryer PE, Davis SN, Shamoon H. Hypoglycemia in diabetes. Diabetes Care (2003) 26(6):1902–12. doi: 10.2337/diacare.26.6.1902

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Cryer PE. The barrier of hypoglycemia in diabetes. Diabetes (2008) 57(12):3169–76. doi: 10.2337/db08-1084

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Yale JF, Paty B, Senior PA. Hypoglycemia. Can. J. Diabetes. (2018) 42:S104–8. doi: 10.1016/j.jcjd.2017.10.010

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Henriksen MM, Andersen HU, Thorsteinsson B, Pedersen-Bjergaard U. Hypoglycemic exposure and risk of asymptomatic hypoglycemia in type 1 diabetes assessed by continuous glucose monitoring. J. Clin. Endocrinol. Metab. (2018) 103(6):2329–35. doi: 10.1210/jc.2018-00142

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Donnelly LA, Morris AD, Frier BM, Ellis JD, Donnan PT, Durrant R, et al. Frequency and predictors of hypoglycaemia in type 1 and insulin-treated type 2 diabetes: a population-based study. Diabetes Med. (2005) 22(6):749–55. doi: 10.1111/j.1464-5491.2005.01501.x

CrossRef Full Text | Google Scholar

7. Diabetes Control and Complications Trial Research Group, Nathan DM, Genuth S, Lachin J, Cleary P, Crofford O, et al. The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N Engl. J. Med. (1993) 329(14):977–86. doi: 10.1056/NEJM199309303291401

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Ahmadi SS, Westman K, Pivodic A, F.ólafsdóttir A, Dahlqvist S, Hirsch IB, et al. The association between HbA1c and time in hypoglycemia during CGM and self-monitoring of blood glucose in people with type 1 diabetes and multiple daily insulin injections: A randomized. Clin. Trial (GOLD-4). Diabetes Care (2020) 43(9):2017–24. doi: 10.2337/dc19-2606

CrossRef Full Text | Google Scholar

9. Woldaregay AZ, Årsand E, Botsis T, Albers D, Mamykina L, Hartvigsen G. Data-driven blood glucose pattern classification and anomalies detection: Machine-learning applications in type 1 diabetes. J. Med. Internet Res. (2019) 21(5):e11030. doi: 10.2196/11030

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Cichosz SL, Henriksen MM, Tarnow L, Thorsteinsson B, Pedersen-Bjergaard U, Fleischer J. Validation of an algorithm for predicting hypoglycemia from continuous glucose measurements and heart rate variability data. J. Diabetes Sci. Technology. (2019) 13:1178–9. doi: 10.1177/1932296819864625

CrossRef Full Text | Google Scholar

11. Cichosz SL, Frystyk J, Hejlesen OK, Tarnow L, Fleischer J. Novel algorithm for prediction and detection of hypoglycemia based on continuous glucose monitoring and heart rate variability in patients with type 1 diabetes. Diabetes science and technology (2014) 8(4):731–7. doi: 10.1177/1932296814528838

CrossRef Full Text | Google Scholar

12. Cichosz SL, Frystyk J, Tarnow L, Fleischer J. Combining information of autonomic modulation and CGM measurements enables prediction and improves detection of spontaneous hypoglycemic events. Diabetes science and technology (2014) 9(1):132–7. doi: 10.1177/1932296814549830

CrossRef Full Text | Google Scholar

13. Cichosz S, Frystyk J, Tarnow L, Fleischer J. Are changes in heart rate variability during hypoglycemia confounded by the presence of cardiovascular autonomic neuropathy in patients with diabetes? Diabetes Technology & Therapeutics (2017) 19(2):91–5. doi: 10.1089/dia.2016.0342

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Woldaregay AZ, Årsand E, Walderhaug S, Albers D, Mamykina L, Botsis T, et al. Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes. Artif. Intell. Med. Elsevier; (2019) 98:109–34. doi: 10.1016/j.artmed.2019.07.007

CrossRef Full Text | Google Scholar

15. Dave D, DeSalvo DJ, Haridas B, McKay S, Shenoy A, Koh CJ, et al. Feature-based machine learning model for real-time hypoglycemia prediction. Diabetes science and technology (2020) 15(4):842–55. doi: 10.1177/1932296820922622

CrossRef Full Text | Google Scholar

16. Cichosz SL, Jensen MH, Hejlesen O. Short-term prediction of future continuous glucose monitoring readings in type 1 diabetes: Development and validation of a neural network regression model. Int. J. Med. Inform. (2021) 151:104472. doi: 10.1016/j.ijmedinf.2021.104472

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Aleppo G, Ruedy KJ, Riddlesworth TD, Kruger DF, Peters AL, Hirsch I, et al. REPLACE-BG: A randomized trial comparing continuous glucose monitoring with and without routine blood glucose monitoring in adults with well-controlled type 1 diabetes. Diabetes Care (2017) 40(4):538–45. doi: 10.2337/dc16-2482

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Cichosz SL, Xylander AAP. A Conditional Generative Adversarial Network for Synthesis of Continuous Glucose Monitoring Signals. J Diabetes Sci Technol (2022) 16(5):1220–23. doi: 10.1177/19322968211014255

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Battelino T, Danne T, Bergenstal RM, Amiel SA, Beck R, Biester T, et al. Clinical targets for continuous glucose monitoring data interpretation: Recommendations from the international consensus on time in range. Diabetes Care (2019) 42(8):1593–603. doi: 10.2337/dci19-0028

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Danne T, Nimri R, Battelino T, Bergenstal RM, Close KL, DeVries JH, et al. International consensus on use of continuous glucose monitoring. Diabetes Care (2017) 40(12):1631–40. doi: 10.2337/dc17-1600

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Mirza M, Osindero S. Conditional generative adversarial nets. arXiv (2014) 1–7. doi: 10.48550/arXiv.1411.1784

CrossRef Full Text | Google Scholar

22. Dave D, Erraguntla M, Lawley M, DeSalvo D, Haridas B, McKay S, et al. Improved low glucose predictive alerts based on sustained hypoglycemia. JMIR Diabetes (2021) 6(2):1–11. doi: 10.2196/26909

CrossRef Full Text | Google Scholar

23. Seo W, Lee S, SM J, Park SM. A machine-learning approach to predict postprandial hypoglycemia. BMC Med. Inform Decis Mak (2019) 19(1):1–13. doi: 10.1186/s12911-019-0943-4

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Marcus Y, Eldor R, Yaron M, Shaklai S, Ish-Shalom M, Shefer G, et al. Improving blood glucose level predictability using machine learning. Diabetes Metab. Res. Rev. (2020) 36(8):e3348. doi: 10.1002/dmrr.3348

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: diabetes, machine learning, hypoglycaemia, type 1 diabetes, continuous glucose monitoring (CGM), event prediction, Dexcom G4 platinum, blood glucose (BG)

Citation: Fleischer J, Hansen TK and Cichosz SL (2022) Hypoglycemia event prediction from CGM using ensemble learning. Front. Clin. Diabetes Healthc. 3:1066744. doi: 10.3389/fcdhc.2022.1066744

Received: 20 October 2022; Accepted: 22 November 2022;
Published: 09 December 2022.

Edited by:

Andreas Schmitt, Diabetes Zentrum Mergentheim, Germany

Reviewed by:

Maartje De Wit, Academic Medical Center, Netherlands
Anouk Geraets, University of Luxembourg, Luxembourg

Copyright © 2022 Fleischer, Hansen and Cichosz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Simon Lebech Cichosz, c2ltY2ljaEBoc3QuYWF1LmRr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.