Blood pressure stratification using photoplethysmography and light gradient boosting machine

Hu, Xudong; Yin, Shimin; Zhang, Xizhuang; Menon, Carlo; Fang, Cheng; Chen, Zhencheng; Elgendi, Mohamed; Liang, Yongbo

doi:10.3389/fphys.2023.1072273

ORIGINAL RESEARCH article

Front. Physiol., 20 February 2023

Sec. Computational Physiology and Medicine

Volume 14 - 2023 | https://doi.org/10.3389/fphys.2023.1072273

This article is part of the Research TopicWearable Sensors Role in Promoting Health and Wellness via Reliable and Longitudinal MonitoringView all 7 articles

Blood pressure stratification using photoplethysmography and light gradient boosting machine

Xudong Hu¹

Shimin Yin¹

Xizhuang Zhang²

Carlo Menon³

Cheng Fang¹

Zhencheng Chen^1,4,5

Mohamed Elgendi³*

Yongbo Liang^1,4,5*

¹School of Life and Environmental Sciences, Guilin University of Electronic Technology, Guilin, China
²School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, China
³Biomedical and Mobile Health Technology Lab, ETH Zurich, Zurich, Switzerland
⁴Guangxi Colleges and Universities Key Laboratory of Biomedical Sensors and Intelligent Instruments, Guilin, China
⁵Guangxi Engineering Technology Research Center of Human Physiological Information Noninvasive Detection, Guilin, China

Introduction: Globally, hypertension (HT) is a substantial risk factor for cardiovascular disease and mortality; hence, rapid identification and treatment of HT is crucial. In this study, we tested the light gradient boosting machine (LightGBM) machine learning method for blood pressure stratification based on photoplethysmography (PPG), which is used in most wearable devices.

Methods: We used 121 records of PPG and arterial blood pressure (ABP) signals from the Medical Information Mart for Intensive Care III public database. PPG, velocity plethysmography, and acceleration plethysmography were used to estimate blood pressure; the ABP signals were used to determine the blood pressure stratification categories. Seven feature sets were established and used to train the Optuna-tuned LightGBM model. Three trials compared normotension (NT) vs. prehypertension (PHT), NT vs. HT, and NT + PHT vs. HT.

Results: The F1 scores for these three classification trials were 90.18%, 97.51%, and 92.77%, respectively. The results showed that combining multiple features from PPG and its derivative led to a more accurate classification of HT classes than using features from only the PPG signal.

Discussion: The proposed method showed high accuracy in stratifying HT risks, providing a noninvasive, rapid, and robust method for the early detection of HT, with promising applications in the field of wearable cuffless blood pressure measurement.

1 Introduction

Globally, cardiovascular disease (CVD) is the main cause of mortality (Al-Makki et al., 2022). The World Health Organization predicts that the death rate from CVD will increase from 246 per million people in 2015 to 264 per million people in 2030 (Roth et al., 2015; Ribas Ripoll et al., 2016). Over 25% of adults worldwide suffer from hypertension (HT), which is the major cause of CVD events and mortality. Since HT typically has no symptoms or indicators in its early stages, it is referred to as the silent killer (Mensah, 2019; Polak-Iwaniuk et al., 2019). Consequently, many people are unaware that they are suffering from high blood pressure and are not treated in a timely manner. Early diagnosis, treatment, and management of HT are crucial for preventing and treating CVDs.

Blood pressure measurement methods can be invasive or noninvasive. The former is referred to as arterial puncture measurement as it requires making a puncture or incision in a blood vessel. Thus, it is only suitable for critically ill patients and not daily monitoring. Korotkoff’s sound and oscillometric methods, which require the use of an upper arm cuff and use the cuff pressure and release process to identify systolic and diastolic blood pressure levels, are the most widely used noninvasive blood pressure measuring techniques. Medical personnel and patients appreciate the reliability of these two approaches (Martínez et al., 2018). However, the Korotkoff’s sound and oscillographic methods can only provide intermittent blood pressure measurements, with a 2-min interval between measurements. Therefore, they are not suitable for real-time prediction and assessment of HT (Slapničar et al., 2019). Developing a method to determine the classification of HT that can be applied continuously and for which the results can be obtained instantly has become a popular topic of research in the digital health industry (Hosanee et al., 2020).

In the last 10 years, photoplethysmography (PPG), a noninvasive technique for monitoring changes in microvascular blood volume, has been the method most frequently studied (Elgendi et al., 2019; Park et al., 2022). Research on PPG is typically focused on two aspects: 1) the PPG-based pulse arrival time (PAT) and 2) PPG-based time and frequency domain parameters. The PAT of the former method is calculated from the PPG and the electrocardiographic (ECG) signal. However, this method requires simultaneous measurement at two different sites on the body, which can be inconvenient and challenging for some patients (Elgendi, 2020). The latter approach models PPG time and frequency domain factors to determine blood pressure. However, the analysis and information extraction of the PPG waveform morphology is extremely demanding in terms of calculating time-frequency domain parameters. This method is extremely sensitive to noise and requires high-quality PPG signals, for example, with a high sample rate and accurate sampling, which limits its wide application.

A number of scholars have conducted research to overcome these problems. In 2020, Tjahjadi et al. classified blood pressure values using the K-nearest neighbors (KNN) method based on PPG. The suggested technique improves classification accuracy without the PPG waveform shape (Tjahjadi and Ramli, 2020). However, before KNN is applied to the dataset, feature scaling (standardization and normalization) is needed, adding a data preprocessing step. Recently, deep learning approaches have been effectively used to address this issue. Sun et al. (2021) suggested a deep learning approach for classifying blood pressure using a PPG signal as well as its first and second derivative signals. They employed a convolutional neural network based on the Hilbert–Huang transform. Compared to feature extraction approaches, the method obtained greater accuracy in HT risk categorization and demonstrated that PPG derivatives include crucial information on blood pressure. However, deep learning methods require training using large-scale data, and the training time consumption is often greater than 5 h (Tjahjadi and Ramli, 2020).

We sought to address these limitations while reducing the impact of morphological methods on model stability, and we proposed a method to classify blood pressure using Tsfresh and an Optuna-tuned light gradient boosting machine (LightGBM) based on PPG and its derivatives. The suggested technique worked well for classifying blood pressure in real time. Tsfresh is a Python tool for time series feature extraction. Optuna-tuned LightGBM has the advantages of better model training accuracy and lower memory consumption in comparison to traditional classification algorithms, making it suitable for use in wearable devices.

The key contributions of this study are:

1) We classified blood pressure as normotension (NT), prehypertension (PHT), and HT according to the seventh report of the United States Joint National Committee on Prevention, Detection, Evaluation and Treatment of High Blood Pressure (JNC7). Our proposed method allows users to instantly know their blood pressure condition and provides a warning system for patients that may possibly have hypertension.

2) With our proposed method, only one physiological pulse wave signal is needed, and our research shows that PPG and its derivatives can be used to predict blood pressure in place of the combination of ECG and PPG. This advantage has great application potential in wearable devices, as, in general, traditional smart bracelets and smart watches can easily obtain a stable PPG signal.

3) Our proposed method has no requirement in terms of PPG signal quality and does not require extraction of pulse wave morphological features.

4) To obtain a shorter training period and lower computational complexity, our proposed method uses machine learning rather than deep learning.

2 Materials and methods

2.1 Data acquisition

In this study, arterial blood pressure (ABP) signals, measured with a catheter in the radial artery, were used to categorize the PPG signals based on the blood pressure values. They were obtained from the Medical Information Mart for Intensive Care III (MIMIC-III) database (Saeed et al., 2011; Johnson et al., 2016), a large, free, public database that contains complex-parameter recordings of more than 40,000 intensive care unit (ICU) patients, including laboratory test data, demographic information data, and physical measurements. In this paper, only the PPG and ABP signal data were used as the original data source. To correctly obtain blood pressure labels, we excluded recordings, such as missing peaks, double-peaked pulses, and no signal (Liang et al., 2018a; Liang et al., 2019). Ultimately, the recordings of 121 subjects were collected, each lasting 120 s, with a sampling frequency of 125 Hz.

2.2 Signal preprocessing

Each recording included ABP and PPG signals as the target source and prediction source, respectively, and the recording was divided into 5-s segments. Next, to create the training data, we adopted the signal function extreme value search algorithm, which was mainly used to detect the peak and trough points in the ABP signal and to extract the diastolic blood pressure (DBP) and systolic blood pressure (SBP). According to JNC7 (Chobanian et al., 2003), the blood pressure conditions were classified as NT, PHT, or HT. Figure 1 illustrates the structure of the signal processing.

FIGURE 1

FIGURE 1. A signal processing structure. Note: ABP stands for arterial blood pressure; PPG, VPG, and APG refer to photoplethysmography, velocity plethysmography, and acceleration plethysmography, respectively. JNC7 stands for the seventh report of the United States Joint National Committee on Prevention, Detection, Evaluation and Treatment of High Blood Pressure.

The PPG signal was then processed by primary and secondary differentiation to obtain its first and second derivative signals, which represent velocity plethysmography (VPG) and acceleration plethysmography (APG), respectively (Elgendi et al., 2018). Because a signal collected manually or by machine is inevitably subject to disturbance by the environment and other factors, such as circuit interference, resulting in the presence of various kinds of noise in the collected signal, noise reduction was an essential part of signal processing. Current noise reduction methods include filters, digital filters, Fourier transforms, wavelet transforms, etc. In the study discussed in this paper, the noise was reduced using a 0.5–10 Hz Butterworth bandpass filter. Then, to map the data to the same scale, the filtered PPG signals were mean-variance normalized. Figure 2 shows the PPG, VPG, and APG waveforms for the three different blood pressure categories.

FIGURE 2

FIGURE 2. Signal derivatives for three different blood pressure categories. Note: PPG, VPG, and APG refer to photoplethysmography, velocity plethysmography, and acceleration plethysmography, respectively.

2.3 Feature extraction

Previous PPG research faced challenges in terms of the effective location and extraction of feature points, and the traditional manual method of extracting signal feature points did not work well for poor-quality PPG signals (Chan et al., 2019). In this study, the Python package Tsfresh was used to extract signal segment characteristics (Christ et al., 2018). Tsfresh is a feature engineering tool for relational databases dealing with time series. It provides 77 time series characterization methods, with a total of 794 time series features that can be computed with different parameters (Gedam and Paul, 2020; Hunter et al., 2020). In real-world scenarios, time series often contain noise and redundant or irrelevant information. To avoid extracting features with low relevance, we obtained p-values by performing univariate feature significance tests, which were then evaluated using the Benjamini-Hochberg procedure, retaining the features with high correlation with the classification label in order to identify the features that best explained the trend (Li and Barber, 2019; Phillips et al., 2021).

Multiple temporal subsegments with classification labels y were simultaneously imported into the Tsfresh function, and the numerical features of the temporal subsegments were extracted; thus, 794 features were extracted for each temporal subsegment (Simjanoska et al., 2020). We then filtered out the features that did not have a significant impact on the recognition result. A total of 189 features were obtained for each PPG timing subsegment, 200 features for each VPG timing subsegment, and 190 features for each APG timing subsegment. All the features can be found in the Supplementary Material.

The following are some of the Tsfresh-calculated features:

1) Absolute energy: this term refers to the absolute energy of the time series and is the sum of the squared values.

E = \sum_{i = 1, \dots, n} x_{i}^{2} (1)

2) Continuous wavelet transform (CWT) coefficients: these are used to perform a CWT on the Ricker wavelet.

c w t = \frac{2}{\sqrt{3 a} π^{\frac{1}{4}}} (1 - \frac{x^{2}}{a^{2}}) \exp (- \frac{x^{2}}{2 a^{2}}) (2)

3) Fast Fourier transform (FFT) coefficients: these are the Fourier coefficients of the one-dimensional discrete Fourier transform of real input by a fast Fourier transformation algorithm.

A_{k} = \sum_{m = 0}^{n - 1} a_{m} \exp \{- 2 π i \frac{m k}{n}\}, k = 0, \dots, n - 1 (3)

4) Mean second derivative central: this is the mean value of a central approximation of the second derivative.

A_{d} = \frac{1}{2 (n - 2)} \sum_{i = 1, \dots, n - 1} \frac{1}{2} (x_{i + 2} - 2 \cdot x_{i + 1} + x_{i}) (4)

5) Mean absolute change: this is the mean over the absolute differences between the subsequent time series values.

A_{a c} = \frac{1}{n - 1} \sum_{i = 1, \dots, n - 1} |x_{i + 1} - x_{i}| (5)

2.4 Machine learning methods and hyperparameter tuning

After the redundant features were removed, the remaining features were input into the LightGBM classifier for blood pressure classification. LightGBM is a new gradient boosting decision tree extension proposed by Microsoft (Ke et al., 2017). The algorithm incorporates exclusive feature bundling (EFB) and gradient-based one-side sampling (GOSS). To optimize the feature values, the algorithm uses the histogram-based algorithm instead of the traditional presorted traversal algorithm. LightGBM offers better model training accuracy and prevention of overfitting than traditional classification algorithms, such as decision trees (Song and Ying, 2015) and random forests (Biau, 2012). To better demonstrate the scientific nature of the experiment, we used a decision tree, AdaBoost, a gradient boosting decision tree (GBDT), random forest, XgBoost, and LightGBM to conduct comparative experiments.

To improve the predictive performance of the LightGBM model and avoid overfitting, a Bayesian optimization library, called Optuna, was employed to effectively adjust the hyperparameters and empirically benchmark its performance. Optuna is a framework created to automate and accelerate hyperparameter optimization experiments (Akiba et al., 2019). It has three core concepts: objective function, single trial, and study. Optuna continually calls for and assesses the objective function for various parameter values to arrive at the best result (Dong et al., 2020; Lacerda et al., 2021). In this study, 1,000 Bayesian optimization trials were used to maximize the accuracy score and 10 LightGBM hyperparameters. Table 1 shows the 10 LightGBM hyperparameters.

TABLE 1

TABLE 1. Hyperparameter settings of LightGBM used for the Bayesian optimization.

2.5 Hypertension classification

A recent study (Berstad et al., 2018) showed that multiple binary classifiers resulted in a more robust model than a single network multiclass implementation. In other words, examining several binary classifiers can provide more robust and increased classification accuracy. Here, we focused on formulating the classification problem into a binary classification based on clinical importance. Therefore, we implemented the one-vs-one multiclass and the one-vs-rest multiclass strategies. In the one-vs-one multiclass strategy, it is crucial to differentiate between NT from PHT and NT from HT. With regards to the one-vs-rest multiclass strategy, distinguishing NT + PH from HT is also clinically essential. Consequently, a total of 1,158 NT cases, 950 PHT cases, and 850 HT cases were obtained based on the HT classifications reported by JNC7. Three HT classification trials were established: NT vs. PHT, NT vs. HT, and NT + PHT vs. HT. Seven feature sets were used to classify the different blood pressure categories: one containing only PPG features, one containing only VPG features, one containing only APG features, one containing both PPG and VPG features, one containing both PPG and APG features, one containing both VPG, and APG features, and one containing PPG, VPG, and APG features. We designed these seven feature sets for two purposes. First, we used them to compare the three waveforms, PPG, VPG, and APG, to predict blood pressure levels. Second, we verified whether a feature set containing PPG, VPG, and APG is superior to a single PPG feature set for blood pressure prediction. Figure 3 shows the flow chart for this study.

FIGURE 3

FIGURE 3. the flow chart of this study. Note: ABP stands for arterial blood pressure, PPG, VPG, and APG denote photoplethysmography, velocity plethysmography, and acceleration plethysmography, respectively.

We used eight assessment methods in this study: specificity (SP), sensitivity (SN), accuracy (ACC), precision (PRE), F1 score, Matthew’s correlation coefficient (MCC), Cohen’s kappa coefficient (Kappa), and area under the receiver operating characteristic (ROC) curve (AUC). These methods are defined as follows:

S P = \frac{T N}{T N + F P} (6)

S N = \frac{T P}{T P + F N} (7)

A C C = \frac{T P + T N}{T P + T N + F T + F N} (8)

P R E = \frac{T P}{T P + F P} (9)

F 1 s c o r e = 2 \times \frac{T P}{2 T P + F P + F N} (10)

M C C = \frac{(T P \times T N) - (F P \times F N)}{\sqrt{(T P + F P) (T P + F N) (T N + F P) (T N + F N)}} (11)

K a p p a = \frac{P_{0} - P_{e}}{1 - P_{e}} (12)

TP, FP, TN, and FN stand for true positive, false positive, true negative, and false negative, respectively. $P_{0}$ denotes the level of observed agreement among the raters, and $P_{e}$ indicates the hypothesized likelihood of chance agreement. AUC refers to the area under the ROC curve, representing the sum of the measured classification performance across all possible thresholds.

In this study, a total of 2958 signal segments were obtained and then randomly divided the dataset into 70%, and 30%, of which 70% was the training set with a total of 2070 signal segments and 30% was the testing set with a total of 888 signal segments. The bootstrap method was used based on the recommendation by Xu and Goodacre (2018) who conducted a comparative study of cross-validation, bootstrap, and systematic sampling for estimating the generalization performance of supervised learning. They concluded that most of the resampling methods produce similar correct classification results; therefore, in this study, bootstrap method was implemented instead of the cross-validation method because of the need for parameter optimization using Optuna for LightGBM model tuning. The optimized parameters include the bagging fraction learning control parameter, which indicates the proportion of data for each bootstrap aggregating (bagging); this can improve the robustness of the model.

All signal processing, modeling, and evaluations were performed in PyCharm software (Community version 2020.2.3), developed and distributed by JetBrains (Prague, Czech Republic). Machine learning algorithms were implemented using Python 3.8 based on the following packages: LightGBM v3.2.1, Scikit-learn v1.0.1, Optuna v2.10.0, and Tsfresh 0.19.0. The code was executed on a laptop with an Intel i7-6700 as the CPU, 8 GB RAM, and NVIDIA GeForce GTX 960M as the graphics card.

3 Results

As traditional classification algorithms, the decision tree and random forest methods have made great contributions to the development of machine learning (Jordan and Mitchell, 2015). To better demonstrate the rigor of this experiment, the performance of these traditional algorithms and LightGBM were compared. Table 2 presents a summary of the performance of the various machine learning models using the PPG signal feature set extracted by Tsfresh, including a decision tree, AdaBoost, GBDT, random forest, XgBoost, and LightGBM, and provides the run times for 100 training runs. The test set included 348 NT cases, 285 PHT cases, and 255 HT cases. The best model performance is marked in bold font. As seen in Table 2, of the six models, the decision tree had the shortest running time but the worst classification performance. The classification performance was slightly better for LightGBM than XgBoost and much better than the other four tested models. The superior performance of the LightGBM model was demonstrated by the fact that it ran in much less time than the XgBoost model.

TABLE 2

TABLE 2. Classification performance of the proposed machine learning method. Note, NT, PHT, and HT denote normotension, prehypertension, and hypertension, respectively. SP stands for specificity, SN stands for sensitivity, ACC stands for accuracy, PRE stands for precision, MCC stands for Matthew’s correlation coefficient, Kappa stands for Cohen’s kappa coefficient, AUC stands for Area under curve. Values in Bold indicate highest scores achieved for each classification per evaluation metric.

3.1 Hyperparameter tuning

As seen in Figure 4, the optimized LightGBM model with Optuna performed better on the PPG signal feature set extracted by Tsfresh. In the default setting of the LightGBM model, the F1 scores of NT vs. PHT, NT vs. HT, and NT + PHT vs. HT were 0.8355, 0.9151, and 0.8968, respectively. After the Bayesian hyperparameters were modified, the improved LightGBM model performed better in multiple classification tests. The corresponding values for NT vs. PHT, NT vs. HT, and NT + PHT vs. HT were 0.8657, 0.9418, and 0.9170, respectively.

FIGURE 4

FIGURE 4. Performance of LightGBM in the classification of hypertension. Radar plot illustrations the performance of base and optimization LightGBM model in (A) NT vs. PHT (B) NT vs. HT and (C) (NT + PHT) vs. HT. Note, NT, PHT, and HT refer to normotension, prehypertension, and hypertension, respectively. SP stands for specificity, SN stands for sensitivity, ACC stands for accuracy, PRE stands for precision, MCC stands for Matthew’s correlation coefficient, Kappa stands for Cohen’s kappa coefficient, AUC stands for Area under curve.

3.2 Model performance

To investigate the effects of different feature sets extracted by Tsfresh on blood pressure classification, we used seven different feature sets and trained the Optuna-tuned LightGBM model on each. Table 3 presents a summary of the classification performance of different feature sets using Optuna-tuned LightGBM models. The best performance is marked in bold font. Based on the results presented in Table 3, we reached the following conclusions. First, VPG outperformed PPG and APG in terms of classification performance; the most significant difference in performance improvement was seen between the NT and PHT experiments. Second, the PPG, APG, and VPG datasets outperformed the single PPG dataset, and similar results were obtained in different classification experiments. Third, an increase in the dataset size contributed to an improvement in the performance of blood pressure classification.

TABLE 3

TABLE 3. Classification performance on different feature sets using Optuna-tuned LightGBM models. Note, NT, PHT, and HT denote normotension, prehypertension, and hypertension, respectively. PPG, VPG, and APG refer to photoplethysmography, velocity plethysmography, and acceleration plethysmography, respectively. SP stands for specificity, SN stands for sensitivity, ACC stands for accuracy, PRE stands for precision, MCC stands for Matthew’s correlation coefficient, Kappa stands for Cohen’s kappa coefficient, AUC stands for Area under curve. Values in Bold indicate highest scores achieved for each classification per evaluation metric.

3.3 Study comparison

In past research (Liang et al., 2018a; Liang et al., 2018b), the dataset used in this study was employed to classify blood pressure using PAT and PPG features extracted from ECG and PPG signals. The results shown in Table 4 denote that the accuracy of the model used in this study is greater than that of the classifier employing PAT and 10 PPG morphological characteristics. Moreover, the method performs better than the Google Net model using continuous wavelet transform. This suggests the potential of the model proposed in this study, which only employed PPG signals and their derivatives, as an alternative to ECG and PPG HT classification methods.

TABLE 4

TABLE 4. Classification performance of the proposed machine learning method and deep learning method and feature-based methods on the same recordings from the MIMIC-III database. Note, NT, PHT, and HT refer to normotension, prehypertension, and hypertension, respectively. PAT stands for pulse arrival time, CWT stands for continuous wavelet transform, and KNN stands for k-nearest neighbors.

4 Discussion

HT is routinely evaluated using blood pressure testing methods. Elderly individuals generally have trouble reading and managing blood pressure cuffs, limiting their use. In contrast, the PPG signal has the advantage of being easy to collect and monitor over time, making it an important tool for noninvasive cardiovascular health screening. However, in a previous study (Chan et al., 2019) based on PPG signals, the researchers had trouble recognizing and extracting feature points because patient age, motion, and respiration all interfere with PPG signals. We have provided a potential solution to these difficulties through using the Tsfresh method, which automatically and robustly extracts features from the original signal and uses the Optuna-tuned LightGBM machine learning model for classification.

LightGBM enables effective parallel training, which can speed up standard GBDT model training 20-fold. It also has reduced memory usage, improved accuracy, and rapid data processing (He et al., 2020). LightGBM has often been used to perform classification and regression tasks (Zeng et al., 2019; Xu et al., 2020). The results shown in Table 2 demonstrate the superior performance of the LightGBM model in terms of training time overhead when using different machine learning models to classify the same dataset.

However, tuning is more difficult for LightGBM than for traditional machine learning techniques, which only need the adjustment of one or two parameters to ensure model correctness and resilience. The grid-search strategy is the most common method for optimizing the 10 LightGBM parameters. However, this method has no pruning operation, resulting in a long search time. Optuna optimization techniques may be used to solve this problem by modifying the hyperparameters (Bergstra et al., 2013). As seen in Figure 3, the baseline LightGBM model could be improved via Optuna optimization by over 3.6%, 2.9%, and 2.1% (ACC value), 3.6%, 2.9%, and 2.3% (F-value), 9.2%, 6.6%, and 6.6% (MCC value), and 3.8%, 3.0%, and 3.2% (AUC value) for NT vs. PHT, NT vs. HT, and NT + PHT vs. HT, respectively, thereby verifying the effectiveness of Optuna optimization.

To study the influence of the first- and second-order derivatives of PPG on the classification results, seven different feature sets were used for the classification experiments. The results shown in Table 3 demonstrate that the combined feature set of PPG, VPG, and APG outperformed the single PPG feature set in the blood pressure classification model. VPG denotes the aortic blood flow velocity and APG indicates the change in the velocity of blood flow. Because hypertensive patients have high blood pressure, blood flows more rapidly into the aorta when the aortic valve is open. Additionally, the descending branch of the PPG signal is steeper in hypertensive patients who lack vascular elasticity than in the general population, which is reflected in the APG. Consequently, adding PPG derivative information to the dataset can make blood pressure classification more accurate.

NT, PHT, and HT are the different stages of blood pressure that the human body exhibits with age or the cause of disease, and also reflect the state of cardiovascular health. Globally, compared with PHT and HT, the number of NT is the largest, from the normotensive population screening to identify PHT and HT samples is of great significance, through one-to-one binary classification research, we can more intuitively observe the actual effect of the proposed scheme in the screening of PHT and HT. Thus, in the machine learning approach using Tsfresh and Optuna-tuned LightGBM, three classification trials for HT were conducted: NT vs. PHT, NT vs. HT, and NT + PHT vs. HT. The classification performance results of the three classification tests are shown in Table 3. The F1 scores of the tests were found to be greater than .85. The combined feature set of PPG, VPG, and APG was associated with the highest F1 scores, with .9018, .9751, and .9277 in different classification experiments, respectively. All the feature sets had F1 scores greater than .9. These findings indicate this method’s potential for detecting HT. Figure 5 shows the performance of the main models used in this study with different feature sets.

FIGURE 5

FIGURE 5. Performance of the model with different feature sets. Note, NT, PHT, and HT represent normotension, prehypertension, and hypertension, respectively. PPG, VPG, and APG denote photoplethysmography, velocity plethysmography, and acceleration plethysmography, respectively.

It is worth noting that we obtained a total of 1158 NT cases, 950 PHT cases, and 850 HT cases, and the ratios of positive and negative examples for the classification trials were: 1.22 for NT/PHT, 1.36 for NT/HT, and 2.48 for (NT + PHT)/HT. According to García’s study (García et al., 2012), they defined data with positive and negative sample ratios less than 3 as low-imbalance data. They studied the impact of imbalance ratios and classifiers on the performance of several resampling strategies for processing imbalanced datasets and found no significant differences for the low-imbalance data. The data ratios in this paper are all less than 3, which are considered low-imbalance data; therefore, upsampling and downsampling are not needed.

PAT has been examined extensively in novel cuffless blood pressure detection systems (Ding et al., 2017; Zhang et al., 2017). A previous study classified blood pressure using PAT and PPG parameters taken from ECG and PPG signals (Liang et al., 2018b), and the present study used the same dataset. The results presented in Table 4 show that the approach employed in this study has greater accuracy than the method previously utilized. Thus, using the Tsfresh and Optuna-tuned LightGBM method easily yields better performance than extracting PAT and PPG morphological features using the same low-quality signals based on those collected from elderly ICU patients. Therefore, the approach suggested in this research has more practical usefulness and prospective applications.

This research mainly focused on novel blood pressure detection and HT risk categorization, in which extracting features from physiological signals with different qualities has been difficult. It proposed a machine learning-based classification method using Tsfresh for feature extraction and Optuna-tuned LightGBM for HT classification. Experiments were conducted using machine learning techniques, and the results showed that the proposed model has good performance and strong potential for application in the field of wearable cuffless blood pressure measurement.

The method proposed in this study has some advantages and disadvantages. As for the advantages, first, the proposed method does not require the extraction of morphological characteristics. Additionally, there are no special requirements regarding the quality of the PPG signal. Third, this process can be completely automated. Finally, this process does not require a high level of processing power, the procedure is simple, and the processing time is short. These advantages will make it easier to implement the method in wearable cuffless blood pressure management devices. As for the disadvantages, this process is not suitable for the real-time processing of large-scale data and a small dataset was used. One of the next steps is test the performance of the proposed algorithm on a different dataset with bigger sample size with different ethnic groups. Ethnicity plays a major role in creating bias in the PPG signal (Sjoding et al., 2020). The MIMIC database used in this study suffer from ethnicity bias (Sinaki et al., 2022). As the algorithm proposed in the study was trained and tested to differentiate between subjects, the study was designed as an inter-subject stratification approach. The algorithm is expected to perform well in assessing each subject over time if the intra-subject stratification is explored. Although, getting subjects for intra-subject stratification whose blood pressure variation is regularly changing between NT, PH, and PHT is challenging. Such subjects are usually in critical health situations where medication is needed to move them from HT to NT. However, it is an area worth exploring.

5 Conclusion

The method proposed in the study discussed in this paper (using Tsfresh and Optuna-tuned LightGBM) increases classification accuracy without requiring the extraction of PPG morphological characteristics or a high-quality PPG signal. Comparison of the results of blood pressure classification trials in various models revealed that our proposed model has higher accuracy than decision tree, AdaBoost, GBDT, random forest, and XgBoost models. Our study also showed that the first- and second-order derivatives of PPG include significant information about blood pressure, allowing PPG, VPG, and APG to be used in place of PAT and PPG for blood pressure prediction. The proposed method automatically diagnoses HT, providing a noninvasive, rapid, and low-cost method for the early detection of HT in low- and middle-income countries.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author contributions

XH and YL performed the statistical analysis. SY and XZ performed some signal processing work. ZC and CM partially analyzed and discussed the experimental results. XH, YL, and ME conceived the study and drafted the manuscript. YL and ME designed the study and led this investigation. All authors approved the final manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China (62101148, 82060330), Natural Science Foundation of Guangxi (2020GXNSFBA297156, 2021GXNSFBA220051) and Guangxi Innovation Driven Development Project (GuikeAA19254003).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2023.1072273/full#supplementary-material

References

Akiba, T., Sano, S., Yanase, T., Ohta, T., and Koyama, M. (2019). Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, Anchorage, AK, 2623–2631.

Al-Makki, A., DiPette, D., Whelton, P. K., Murad, M. H., Mustafa, R. A., Acharya, S., et al. (2022). Hypertension pharmacological treatment in adults: A world health organization guideline executive summary. Hypertension 79, 293–301. doi:10.1161/HYPERTENSIONAHA.121.18192

PubMed Abstract | CrossRef Full Text | Google Scholar

Bergstra, J., Yamins, D., and Cox, D. (2013). Proceedings of the 30th international conference on machine learning, Atlanta, GA, 115–123.

Berstad, T. J. D., Riegler, M., Espeland, H., de Lange, T., Smedsrud, P. H., Pogorelov, K., et al. (2018). IEEE International Symposium on Multimedia (ISM). Taichung, Taiwan: IEEE, 1–8.

Google Scholar

Biau, G. (2012). Analysis of a random forests model. J. Mach. Learn. Res. 13, 1063–1095.

Google Scholar

Chan, G., Cooper, R., Hosanee, M., Welykholowa, K., Kyriacou, P. A., Zheng, D., et al. (2019). Multi-site photoplethysmography technology for blood pressure assessment: Challenges and recommendations. J. Clin. Med. 8, 1827. doi:10.3390/jcm8111827

PubMed Abstract | CrossRef Full Text | Google Scholar

Chobanian, A. V., Bakris, G. L., Black, H. R., Cushman, W. C., Green, L. A., Izzo, J. L., et al. (2003). Seventh report of the Joint national committee on prevention, detection, evaluation, and treatment of high blood pressure. Hypertension 42, 1206–1252. doi:10.1161/01.HYP.0000107251.49515.c2

PubMed Abstract | CrossRef Full Text | Google Scholar

Christ, M., Braun, N., Neuffer, J., and Kempa-Liehr, A. W. (2018). Time series FeatuRe extraction on basis of scalable hypothesis tests (tsfresh – a Python package). Neurocomputing 307, 72–77. doi:10.1016/j.neucom.2018.03.067

CrossRef Full Text | Google Scholar

Ding, X., Yan, B. P., Zhang, Y. T., Liu, J., Zhao, N., and Tsang, H. K. (2017). Pulse transit time based continuous cuffless blood pressure estimation: A new extension and A comprehensive evaluation. Sci. Rep. 7, 11554. doi:10.1038/s41598-017-11507-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, X., Dan, X., Yawen, A., Haibo, X., Huan, L., Mengqi, T., et al. (2020). Identifying sarcopenia in advanced non-small cell lung cancer patients using skeletal muscle CT radiomics and machine learning. Thorac. Cancer 11, 2650–2659. doi:10.1111/1759-7714.13598

PubMed Abstract | CrossRef Full Text | Google Scholar

Elgendi, M., Fletcher, R., Liang, Y., Howard, N., Lovell, N. H., Abbott, D., et al. (2019). The use of photoplethysmography for assessing hypertension. NPJ Digit. Med. 2, 60–11. doi:10.1038/s41746-019-0136-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Elgendi, M., Liang, Y., and Ward, R. (2018). Toward generating more diagnostic features from photoplethysmogram waveforms. Diseases 6, 20. doi:10.3390/diseases6010020

PubMed Abstract | CrossRef Full Text | Google Scholar

Elgendi, M. (2020). PPG signal analysis: An introduction using MATLAB®. Boca Raton, FL: CRC Press.

Google Scholar

García, V., Sánchez, J. S., and Mollineda, R. A. (2012). On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowledge-Based Syst. 25, 13–21. doi:10.1016/j.knosys.2011.06.013

CrossRef Full Text | Google Scholar

Gedam, S., and Paul, S. (2020). International Conference on Computing, Communication and Networking Technologies (ICCCNT). Kharagpur, India: IEEE, 1–7.

Google Scholar

He, Y., Ye, Z., Liu, X., Wei, Z., Qiu, F., Li, H. F., et al. (2020). Can machine learning predict drug nanocrystals? J. Control. Release. 322, 274–285. doi:10.1016/j.jconrel.2020.03.043

PubMed Abstract | CrossRef Full Text | Google Scholar

Hosanee, M., Chan, G., Welykholowa, K., Cooper, R., Kyriacou, P. A., Zheng, D., et al. (2020). Cuffless single-site photoplethysmography for blood pressure monitoring. J. Clin. Med. 9, 723. doi:10.3390/jcm9030723

PubMed Abstract | CrossRef Full Text | Google Scholar

Hunter, R. B., Jiang, S., Nishisaki, A., Nickel, A. J., Napolitano, N., Shinozaki, K., et al. (2020). Supervised machine learning applied to automate flash and prolonged capillary refill detection by pulse oximetry. Front. Physiol. 11, 564589. doi:10.3389/fphys.2020.564589

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, A. E., Pollard, T. J., Shen, L., Lehman, L. W. H., Feng, M., Ghassemi, M., et al. (2016). MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035. doi:10.1038/sdata.2016.35

PubMed Abstract | CrossRef Full Text | Google Scholar

Jordan, M. I., and Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science 349, 255–260. doi:10.1126/science.aaa8415

PubMed Abstract | CrossRef Full Text | Google Scholar

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., et al. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Adv. neural Inf. Process. Syst. 30.

Google Scholar

Lacerda, P., Barros, B., Albuquerque, C., and Conci, A. (2021). Hyperparameter optimization for COVID-19 pneumonia diagnosis based on chest CT. Sensors 21, 2174. doi:10.3390/s21062174

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, A., and Barber, R. F. (2019). Multiple testing with the structure-adaptive Benjamini–Hochberg algorithm. J. R. Stat. Soc. Ser. B. Stat. Methodol. 81, 45–74. doi:10.1111/rssb.12298

CrossRef Full Text | Google Scholar

Liang, Y., Abbott, D., Howard, N., Lim, K., Ward, R., and Elgendi, M. (2019). How effective is pulse arrival time for evaluating blood pressure? Challenges and recommendations from a study using the MIMIC database. J. Clin. Med. 8, 337. doi:10.3390/jcm8030337

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, Y., Chen, Z., Ward, R., and Elgendi, M. (2018). Hypertension assessment via ECG and PPG signals: An evaluation using MIMIC database. Diagnostics 8, 65. doi:10.3390/diagnostics8030065

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, Y., Chen, Z., Ward, R., and Elgendi, M. (2018). Photoplethysmography and deep learning: Enhancing hypertension risk stratification. Biosensors 8, 101. doi:10.3390/bios8040101

PubMed Abstract | CrossRef Full Text | Google Scholar

Martínez, G., Howard, N., Abbott, D., Lim, K., Ward, R., and Elgendi, M. (2018). Can photoplethysmography replace arterial blood pressure in the assessment of blood pressure? J. Clin. Med. 7, 316. doi:10.3390/jcm7100316

PubMed Abstract | CrossRef Full Text | Google Scholar

Mensah, G. A. (2019). Commentary: Hypertension phenotypes: The many faces of a silent killer. Ethn. Dis. 29, 545–548. doi:10.18865/ed.29.4.545

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, J., Seok, H. S., Kim, S.-S., and Shin, H. (2022). Photoplethysmogram analysis and applications: An integrative review. Front. Physiol. 12, 808451. doi:10.3389/fphys.2021.808451

PubMed Abstract | CrossRef Full Text | Google Scholar

Phillips, C., Liaqat, D., Gabel, M., and de Lara, E. (2021). IEEE international conference on pervasive computing and communications workshops and other affiliated events. Kassel, Germany: PerCom Workshops, 623–629.

Polak-Iwaniuk, A., Harasim-Symbor, E., Gołaszewska, K., and Chabowski, A. J. F. i. P. (2019). How hypertension affects heart metabolism. Front. Physiol. 10, 435. doi:10.3389/fphys.2019.00435

PubMed Abstract | CrossRef Full Text | Google Scholar

Ribas Ripoll, V. J., Wojdel, A., Romero, E., Ramos, P., and Brugada, J. (2016). ECG assessment based on neural networks with pretraining. Appl. Soft Comput. 49, 399–406. doi:10.1016/j.asoc.2016.08.013

CrossRef Full Text | Google Scholar

Roth, G. A., Forouzanfar, M. H., Moran, A. E., Barber, R., Nguyen, G., Feigin, V. L., et al. (2015). Demographic and epidemiologic drivers of global cardiovascular mortality. N. Engl. J. Med. 372, 1333–1341. doi:10.1056/NEJMoa1406656

PubMed Abstract | CrossRef Full Text | Google Scholar

Saeed, M., Villarroel, M., Reisner, A. T., Clifford, G., Lehman, L. W., Moody, G., et al. (2011). Multiparameter intelligent monitoring in intensive care II: A public-access intensive care unit database. Crit. Care. Med. 39, 952–960. doi:10.1097/CCM.0b013e31820a92c6

PubMed Abstract | CrossRef Full Text | Google Scholar

Simjanoska, M., Eftimov, T., Kocheva, S., Stevchev, N., and Kochev, S. (2020). Proceedings of the 13th international Joint conference on biomedical engineering systems and technologies, Valletta, Malta, 228–235.

Sinaki, F. Y., Ward, R., Abbott, D., Allen, J., Fletcher, R. R., Menon, C., et al. (2022). Ethnic disparities in publicly-available pulse oximetry databases. Commun. Med. 2, 59. doi:10.1038/s43856-022-00121-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sjoding, M. W., Dickson, R. P., Iwashyna, T. J., Gay, S. E., and Valley, T. S. (2020). Racial bias in pulse oximetry measurement. N. Engl. J. Med. 383, 2477–2478. doi:10.1056/NEJMc2029240

PubMed Abstract | CrossRef Full Text | Google Scholar

Slapničar, G., Mlakar, N., and Luštrek, M. (2019). Blood pressure estimation from photoplethysmogram using a spectro-temporal deep neural network. Sensors 19, 3420. doi:10.3390/s19153420

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, Y.-Y., and Ying, L. (2015). Decision tree methods: Applications for classification and prediction. Shanghai Arch. Psychiatry 27, 130–135. doi:10.11919/j.issn.1002-0829.215044

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, X., Zhou, L., Chang, S., and Liu, Z. (2021). Using CNN and HHT to predict blood pressure level based on photoplethysmography and its derivatives. Biosensors 11, 120. doi:10.3390/bios11040120

PubMed Abstract | CrossRef Full Text | Google Scholar

Tjahjadi, H., and Ramli, K. (2020). Noninvasive blood pressure classification based on photoplethysmography using k-nearest neighbors algorithm: A feasibility study. Information 11, 93. doi:10.3390/info11020093

CrossRef Full Text | Google Scholar

Xu, Y., and Goodacre, R. (2018). On splitting training and validation set: A comparative study of cross-validation, bootstrap and systematic sampling for estimating the generalization performance of supervised learning. J. analysis Test. 2, 249–262. doi:10.1007/s41664-018-0068-2

CrossRef Full Text | Google Scholar

Xu, Y., Ju, L., Tong, J., Zhou, C. M., and Yang, J. J. (2020). Machine learning algorithms for predicting the recurrence of stage IV colorectal cancer after tumor resection. Sci. Rep. 10, 2519. doi:10.1038/s41598-020-59115-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, H., Yang, C., Zhang, H., Wu, Z., Zhang, J., Dai, G., et al. (2019). A LightGBM-based EEG analysis method for driver mental states classification. Comput. Intell. Neurosci. 2019, 3761203. doi:10.1155/2019/3761203

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Q., Zhou, D., and Zeng, X. (2017). Highly wearable cuff-less blood pressure and heart rate monitoring with single-arm electrocardiogram and photoplethysmogram signals. Biomed. Eng. Online 16, 23. doi:10.1186/s12938-017-0317-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: blood pressure monitoring, photoplethysmography, machine learning, Optuna-tuned LightGBM, hypertension evaluation, wearable devices

Citation: Hu X, Yin S, Zhang X, Menon C, Fang C, Chen Z, Elgendi M and Liang Y (2023) Blood pressure stratification using photoplethysmography and light gradient boosting machine. Front. Physiol. 14:1072273. doi: 10.3389/fphys.2023.1072273

Received: 17 October 2022; Accepted: 06 February 2023;
Published: 20 February 2023.

Edited by:

Aniruddha Sinha, Tata Consultancy Services, India

Reviewed by:

Colin K. Drummond, Case Western Reserve University, United States
Chenxi Yang, Southeast University, China

Copyright © 2023 Hu, Yin, Zhang, Menon, Fang, Chen, Elgendi and Liang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mohamed Elgendi, bW9lLmVsZ2VuZGlAaGVzdC5ldGh6LmNo; Yongbo Liang, bGlhbmd5b25nYm9AZ3VldC5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Blood pressure stratification using photoplethysmography and light gradient boosting machine

1 Introduction

2 Materials and methods

2.1 Data acquisition

2.2 Signal preprocessing

2.3 Feature extraction

2.4 Machine learning methods and hyperparameter tuning

2.5 Hypertension classification

3 Results

3.1 Hyperparameter tuning

3.2 Model performance

3.3 Study comparison

4 Discussion

5 Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good