Corrigendum on: A life-threatening arrhythmia detection method based on pulse rate variability analysis and decision tree
- 1School of Electrical and Automatic Engineering, Changshu Institute of Technology, Suzhou, China
- 2School of Computer and Information Technology, Northeast Petroleum University, Daqing, China
- 3School of Computer Science and Engineering, Changshu Institute of Technology, Suzhou, China
Extreme bradycardia (EB), extreme tachycardia (ET), ventricular tachycardia (VT), and ventricular flutter (VF) are the four types of life-threatening arrhythmias, which are symptoms of cardiovascular diseases. Therefore, in this study, a method of life-threatening arrhythmia recognition is proposed based on pulse rate variability (PRV). First, noise and interference are wiped out from the arterial blood pressure (ABP), and the PRV signal is extracted. Then, 19 features are extracted from the PRV signal, and 15 features with highly important and significant variation were selected by random forest (RF). Finally, the back-propagation neural network (BPNN), extreme learning machine (ELM), and decision tree (DT) are used to build, train, and test classifiers to detect life-threatening arrhythmias. The experimental data are obtained from the MIMIC/Fantasia and the 2015 Physiology Net/CinC Challenge databases. The experimental results show that the DT classifier has the best average performance with accuracy and kappa coefficient (kappa) of 98.76 ± 0.08% and 97.59 ± 0.15%, which are higher than those of the BPNN (accuracy = 94.85 ± 1.33% and kappa = 89.95 ± 2.62%) and ELM (accuracy = 95.05 ± 0.14% and kappa = 90.28 ± 0.28%) classifiers. The proposed method shows better performance in identifying four life-threatening arrhythmias compared to existing methods and has potential to be used for home monitoring of patients with life-threatening arrhythmias.
1 Introduction
In recent years, cardiovascular diseases have the highest mortality rate and are the “number one killer” of human beings (Roberts and Fair 2021). Among them, acute cardiovascular diseases such as myocardial infarction (MI) and cerebral infarction (CI) have the high suddenness and lethality (Du et al., 2021). If MI and CI are not effectively treated within a few hours after the sudden onset, it will directly lead to the patient’s death.
Life-threatening arrhythmias are a common symptom in patients with CI and MI, and the common life-threatening arrhythmias include EB, ET, VT, and VF (Deaconu et al., 2021), and the definitions of those four life-threatening arrhythmias are given in Table 1 according to the beating rhythm of the heart rate (Alinejad et al., 2019; Paliakaitė et al., 2019). In the initial period of suddenness of life-threatening arrhythmias, patients sometimes experience sudden heart pain that is slight and rapid and disappears after a short rest such as sitting or lying down, which is called “transient” (Chorin et al., 2021). The “transient” of life-threatening arrhythmias is often ignored by patients, which can lead to the sudden illness of dangerous MI, CI, and other acute cardiovascular diseases. On the eve of acute cardiovascular diseases such as MI and CI, significant abnormal changes in physiological parameters such as electrocardiographic (ECG) and blood pressure occur (Jahmunah et al., 2021; Shuvo et al., 2021). Moreover, if these abnormalities can be monitored in time, then patients can be warned of the risk so that they can seek medical help, which would significantly reduce the rate of death from acute cardiovascular disease.
At present, the main detection method is hospital ECG, while the acquisition of ECG signal requires multiple electrodes and cable connection and the process needs professional medical staff’s guidance. If one electrode is wrongly attached, the whole signal is no longer valuable. In addition, the prolonged electrode connection can cause skin irritation (Chou et al., 2019). It is difficult for short-time ECG monitoring to effectively recognize life-threatening arrhythmias with transient; thus, long-term tracking and detection of physiological signals is required to achieve recognition of acute cardiovascular disease outbreaks.
The beat rhythm of the heart is transmitted to the pulse with the blood, and both ECG and pulse period sequences can effectively reflect heartbeat rhythm changes (Mitchell and Schwarzwald, 2021). Heart rate variability (HRV) is calculated from ECG, which reflects the rate of the heartbeat and is used to assess the autonomic nervous system of the heart (Ishaque et al., 2021); thus, HRV can be used for the diagnosis of cardiovascular diseases (Saul and Valenza, 2021). The PRV is extracted from the ABP signal, which reflects the subtle changes in the vascular pulse cycle (Jan et al., 2019). Moreover, the PRV can be utilized to assess cardiovascular autonomic activity (Mejía-Mejía et al., 2021). Studies have shown that the PRV extracted from the ABP signal and HRV obtained from the ECG signal have a strong correlation and are interchangeable in cardiovascular disease monitoring in the supine or resting state (Mejía-Mejía et al., 2020; Hejjel and Béres, 2021). Compared with the ECG signal, the ABP signal acquisition does not require the affixing of multiple electrodes and can be easily affixed to multiple parts of the body, which is easy to operate and can be self-measured (Jan et al., 2019; Mejía-Mejía et al., 2022). Thus, ABP signals are widely used in wearable devices such as bracelets and smart watches (Zhu et al., 2021). The study of the life-threatening arrhythmia detection method based on the PRV signal is expected to be used for home monitoring of life-threatening arrhythmias.
Therefore, based on the PRV signal, this study studies techniques for the recognition of four life-threatening arrhythmias: EB, ET, VT, and VF. First, the interference and noise in the pulse signal are filtered out, and then, the PRV signal is extracted from ABP. Next, the parameters of physiological and pathological changes caused by these four life-threatening arrhythmias are extracted from the PRV signal, and the parameters with high importance and contribution are obtained as feature vectors by RF to train classifiers of BPNN, ELM, and DT to detect these four life-threatening arrhythmias.
This study is structured as follows: Section 2 gives the experimental data we used and describes the process and methods of the experiments; Section 3 describes the experimental results, including signal preprocessing, PRV extraction, feature parameter extraction and dimensionality reduction, and classification results; the discussion of the experimental results is given in Section 4; and the conclusions of the study are presented in Section 5.
2 Materials and methods
2.1 Materials
The experimental data consisted of two groups, both of them from the international physiological signal database: PhysioBank. One group has 10 young (aged 21–31) and 10 elderly (aged 70–85) healthy subjects with equal males and females, which comes from the sub-database “MIMIC/Fantasia” (Iyengar et al., 1996; Goldberger et al., 2000). The other group has patients with four life-threatening arrhythmias consisting of 17 EB, 39 ET, 47 VT and 6 VF subjects, which comes from the sub-database “2015 Physiology Net/CinC Challenge” (Clifford et al., 2015).
The data of healthy subjects: the data of healthy subjects are the PRV signal extracted from the ABP signal in the MIMIC/Fantasia database. Before the data recording, every non-smoking subject underwent a physical examination, and only the healthy subjects were allowed to participate. In addition, each recording includes the continuous ECG, respiration, ABP signals with a sampling rate of 250 Hz, and a duration of 2 h.
The data of patients with four life-threatening arrhythmias: the data are the PRV signal extracted from the ABP signal in the 2015 Physiology Net/CinC Challenge database, which was recorded from patients in the intensive care unit of hospitals. During data recording, two ECGs and one ABP signal were collected from the patients, and all signals were sampled at 250 Hz with a duration of 5 or 5.5 min.
The simulation software is MATLAB 2020b, installed on an Intel(R) Core (TM) i5-6300HQ CPU at 2.30 GHz, Windows-10 64-bit operating system, and installed on a laptop with 8 GB RAM.
2.2 Methods
Figure 1 depicts the processing of the intelligent recognition of those four life-threatening arrhythmias, which includes six steps: the preprocessing of the ABP signal, extraction of PRV, extraction of features, dimensionality reduction of features by RF, life-threatening arrhythmia recognition, and evaluation of results. The details are displayed in the following subsections.
2.2.1 The preprocessing of the arterial blood pressure signal
Noise such as electromyographic (EMG) interference, alternating current (AC) interference, and baseline drift can be generated in the ABP signal acquisition, for example, the ABP signal from an ET patient with noise is displayed in Figure 2. The purpose of ABP signal preprocessing is to wipe out these noises and obtain a clean ABP signal in order to improve the accuracy of PRV extraction. According to the range of frequencies, an integer coefficient notch filter with a stop frequency of 0 Hz, 50 Hz, and its integer multiples are used for de-noising the AC interference and the baseline drift, and an integral coefficient low-pass filter is utilized to eliminate the EMG interference in this study (Chou et al., 2020).
The transfer function F1(Z) of the integer coefficient notch filter is,
In Equation 1, FAP(Z) is the transfer function of the all-pass filter; FBP(Z) is the transfer function of the band pass filter; N is the order of the filter; R and P are the order of the numerator polynomial and denominator polynomial of the transfer function, respectively, where P = fs/f1, fs is the sampling rate of the signal and is 250Hz and f1 is the notch frequency and is 50 Hz here; and Q is the gain of the filter (i.e., the amplification) and should be 2N, which is proportional to the steepness of the notch band, and Q = R/P. In this study, N = 2, P = 5, and Q = 64 by trial and error, and R = PQ = 320. Therefore, Equation 1 becomes
Then, Equation 3 is the difference equation, which is calculated to eliminate the ABP signal containing the AC interference and the baseline drift in real time by Equation 2.
where x1(n) is the latest data of the ABP signal, x1 (n-r) is the r-th sampling data before x1(n), and y1(n) is the output of the integer coefficient notch filter.
The frequency response of the integer coefficient notch filter is
The frequency response is illustrated in Figure 3. The filter with notch frequencies of 0Hz, 50Hz, and 100 Hz can effectively de-noise the AC interference and the baseline drift, and it has linear phase in the passband.
The transfer function F2(Z) of integer coefficient low-pass filter is
where N is the order of the filter, fs is the sampling frequency, f2 is the first-order cut-frequency, and C must be an integer and is fs/f2. Here, fs = 250Hz, f2 = 62.5Hz, N = 2, so C = 4. Therefore, Equation 5 becomes
Then, as displayed in Equation 7, the difference equation is calculated to de-noise the ABP signal containing the EMG interference in real time.
where x2(n) is the latest datum of the ABP signal, x2 (n-c) is the c-th sampling datum before x2(n), and y2(n) is the output of the integer coefficient low-pass filter.
The frequency response of the integer coefficient low-pass filter is
The frequency response is illustrated in Figure 4. The filter with a stop band frequency of 62.5 Hz can effectively suppress the EMG interference, and it has linear phase in the passband.
2.2.2 The extraction of pulse rate variability
Since the cardiac cycle corresponds to the pulsation period, one heartbeat produces one pulse wave. An ABP signal consisting of a series of pulse waves (the red curve) is displayed in Figure 5. The start and end points are two pulse troughs (the solid green dots) corresponding to a complete pulse wave, respectively, where the end point of one pulse wave is the start point of the next pulse wave. In general, the PRV can be calculated by computing the first-order difference between the start points and end points, that is, pulse-to-pulse intervals (PPIs). However, it is not easy to detect troughs due to the small amplitude of the waves corresponding to the pulse troughs, while the waves corresponding to the pulse peaks (the blue hollow cycle) have notable characteristics and are easy to detect. Therefore, in this study, the frequency domain extraction method based on sliding window iterative discrete Fourier transform (SWIDFT) is used to detect the wave peaks (Chou et al., 2014), which can be corrected using a manual calibration method if there are incorrect or missing sampling points. Two adjacent pulse peaks are utilized as the boundary to divide the PRV signal, which is calculated by the time interval between two adjacent peaks, and the equation is as follows
where t is the sampling time of the ABP signal, and fs is the sampling frequency of the ABP signal.
2.2.3 Pulse rate variability feature extracted
So far, the main methods for analyzing physiological signals include time domain analysis, frequency domain analysis, and nonlinear domain analysis, from which some features are extracted to describe changes in heartbeat rhythm for the diagnosis of cardiovascular diseases (Sluyter et al., 2019; Mandal et al., 2021). In this study, 19 features were extracted from the PRV signal based on the description of heart rhythm changes.
2.2.3.1 Feature extraction based on time domain analysis
The PRV signal is quantified in the time domain, and some useful information is extracted from the PRV signal by the statistical analysis method to analyze the temporal variation among the PRV signal and obtain the abnormalities and stability of the cardiovascular system. We extracted seven indexes in the time domain, which are calculated as follows.
1) Mean: the average of the PRV signal, and the equation is
where S(i) is the ith datum of the PRV signal, and n is the length of the PRV signal.
2) Std: the standard deviation of the PRV signal, which can reflect the dispersion of the Mean and the datum of the PRV signal. The equation is
3) RMSD: the root mean square of PRV signal’s difference, which can reflect the degree of rapid change in the PRV signal. The equation is
4) nRMSD: the normalized RMSD, and the equation is
5) PNN40: the percentage of difference in time intervals between adjacent sampling points of a PRV signal greater than 40 ms. The higher the value, the higher the nervous system tension. The equation is
where NN40 is the number of time intervals between two adjacent sampling points in a PRV signal that exceed 40 ms, and TotalNN is the number of sampling points intervals of a PRV signal.
6) PNN70: the percentage of difference in time intervals between adjacent sampling points of a PRV signal greater than 70 ms. The equation is
where NN70 is the number of time intervals between two adjacent sampling points in a PRV signal that exceed 70 ms.
7) Mid: the median of the PRV signal, which represents a value in the PRV signal distribution that can divide the PRV signal into two groups. For a sequence of PRV signal from small to large, when n is an odd number, the equation is
When n is an even number, the equation is
8) IQR: the interquartile range of the PRV signal, which describes the dispersion of the PRV signal. The equation is
where S75 is the third quartile, and S25 is the first quartile.
9) RMSD_APM: the root mean square of amplitude’s (APM) difference, which can reflect the degree of rapid change in APM. The calculation method is the same as Equation 12.
2.2.3.2 Feature extraction based on frequency domain analysis
The power spectrum is calculated using the autoregressive (AR) model for the PRV signal, from which the features are extracted according to the frequency range to reflect the stability of cardiovascular activity within the human body and to obtain information about the variability of the cardiovascular system (Fallet et al., 2019).
1) LF_HF: the ratio of low frequency (LF) to high frequency (HF), which can reflect a balanced state of sympathetic and parasympathetic tone.
where the range of LF is 0.04–0.15, and the range of HF is 0.15–0.4.
2.2.3.3 Feature extraction based on nonlinear domain analysis
The methods of nonlinear domain analysis are Poincaré plot (Nordin et al., 2019) and entropy (Rohila and Sharma, 2019), where the Poincaré plot can be approximated as an ellipse with the horizontal axis of a single time interval of the PRV signal and the vertical axis of time interval of two adjacent PRV signals, which can be utilized to reflect the variation of different PRV signals. The following are some relevant features of the calculation.
1) Sd1:Sd2: the ratio of the long half-axis (Sd1) to the short half-axis (Sd2) of the ellipse. The equation is
where Sd1 is defined as
and Sd2 is defined as
2) Se: the area of the ellipse is
3) TPR_PR: the turning point ratio of the PRV signal, which can measure the randomness of the PRV signal. The equation is
where Num is used to count the number of turning point.
4) ShE: the Shannon entropy of the PRV signal, and the equation is
where P(i) is the probability of the i-th datum in the PRV signal.
5) SamE_PR: the sample of the PRV signal, and the calculation is as follows:
where Bm(r) is the average probability of the PRV signal when the embedding dimension is m, and Bm+1(r) is the average probability of the PRV signal when the embedding dimension is m+1. Here, m is equal to 2.
6) CSampEn: the coefficients of sample entropy, and the calculation is as follows (Eerikäinen et al., 2018)
where r is the tolerance and is equal to 0.25 here, and S is the PRV signal in the buffer.
7) PE_PR: the permutation entropy of the PRV signal, and the equation is
where P(i) is the probability of occurrence of mode i.
8) SamE_APM: the sample entropy of APM, and the calculation method is the same as that of Equation 26.
9) TRP_APM: the turning point ratio of APM, and the calculation method is the same as that of Equation 24.
2.2.4 Feature dimensionality reduction
In this study, a method of RF is used to measure the importance of the feature parameters and to reduce the feature dimensionality with less information loss (Qi, 2012), which will make the subsequent recognition of life-threatening arrhythmias more efficient without overfitting due to too many features. The method of RF in the “neural network toolbox” of MATLAB 2020b is exploited to calculate the importance of feature parameters for feature dimensionality reduction. The related functions are as follows:
RF_model = classRF_train (p_train, t_train, ntree, mtree, extra_options);
Feature_measure = RF_model.importance.
The function “classRF_train” is engaged to train the RF model. The input parameters p_train and t_train are the features and labels of the training set, respectively. The input parameter ntree is the number of trees; here, it is 100. The input parameter mtree is the number of predictors used for segmentation at each node; here, it is a rounding down for the number of features, that is, 4. The input parameter extra_options is used to control the RF model.
The function “RF_model.importance” allows to calculate feature weights using accuracy and Gini index. The accuracy and Gini index reflect the importance of the features, and the larger the value of accuracy and Gini index, the more important the feature is.
2.2.5 Life-threatening arrhythmia classification
In this study, supervised learning methods, which include BPNN, ELM, and DT, are engaged to design classifiers to identify four life-threatening malignant arrhythmias. The BPNN and DT classifiers are built, trained, and tested with the “neural network toolbox” in MATLAB 2020b. In addition, the classification performance is analyzed using Kappa coefficients, accuracy, and time consumption.
2.2.5.1 BPNN classifier
A BPNN classifier consists of an input layer, one or more hidden layers, and an output layer. After entering the training set into the input layer, the training set is calculated by weights and thresholds in the hidden layer, and the result is transported to the output layer to calculate a prediction value. If the error between the predicted value and the expected value is too large, the error is passed to the input layer and calculated again until the predicted value and the expected value meet the requirements (Hamdani et al., 2022). The BPNN classifier is composed of the following three functions:
Net = feedforwardnet (option);
Net_BP = train (Net, p_train, t_train);
Error_sim_BP = sim (Net_BP, p_test).
The function “feedforwardnet ( )” is utilized to build the BPNN classifier. The option is the number of nodes in every layer of the BPNN; here, the number of nodes in one input layer, two hidden layers, and one output layer is 5, 15, 15, and 1, respectively. In addition, the number of training sessions, the minimum error of the training target, and the learning rate are set to 3,000, 0.001, and 0.1, respectively. The training function and the transfer function of the second hidden layer use “BFGS Quasi-Newton” and “sigmoid,” respectively. The parameter Net is the design result of the classifier.
The function “train ( )” is exploited to train the BPNN classifier. The input parameters p_train and t_train are the features and labels of the training set, respectively. The output parameter Net_BP is the predicted value of the BPNN after training.
The function “sim ( )” is engaged to test the BPNN classifier. The feature of the test set p_test is compared with the predicted value until the training parameters are satisfied, and the classification result Error_sim_BP is obtained.
2.2.5.2 ELM classifier
The ELM classifier has the same structure as the BPNN classifier, and they both belong to the feed-forward neural network, while the hidden layer of ELM classifier is one. The weights and thresholds of the BPNN classifier are constantly changing, while the ELM classifier generate the unchanged weights and thresholds initially, which will save a lot of time compared to the training of BPNN classifier (Wang et al., 2021).
The key points in the building and training ELM classifier are the calculation of the connection weights (IW) between the hidden layer and the input layer, the thresholds (B) of the hidden layer neurons and the connection weights (LW) between the hidden layer and the output layer. Here, the number of nodes in the input layer, hidden layer, and output layer are 12, 300, and 5, respectively. In addition, IW and B are randomly generated by function “rand ( )” in MATLAB 2020b, where IW = rand (300,15) * 2–1, B = rand (300,1), and LW is calculated with the help of the function “pinv ( ).” The predicted value Y is computed by performing the classification using the sinusoidal transfer function based on parameters IW, B, and LW, and Y is equal to the inverse matrix of the inverse matrix of the output in the hidden layer (H) by LW; then, the maximum value of all the features of Y is used as the label, marked as 1, the others are 0, and the new predicted value is output.
2.2.5.3 DT classifier
The DT has a top-down structure, growing down from the root to the nodes in a certain order to make a decision, and getting results at the leaves (Charbuty and Abdulazeez, 2021). The two functions of the DT classifier are as follows:
Ctree = fitctree (p_train, t_train);
T_sim_tree = predict (Ctree, p_test).
The function “fitctee ( )” is exploited to build and test the DT classifier, and the output parameter Ctree is the trained decision tree. The function “predict” is utilized to test the trained decision tree. Here, the feature space of the training and testing sets for the input parameters p_train, t_train, and p_test is different from that of BPNN and the ELM classifiers, which should be the number of samples ×feature properties.
3 Results
3.1 The preprocessing results
After de-noising the ABP signal using an integer coefficient notch filter and an integer coefficient low-pass filter, a clean ABP signal is obtained. Figure 6 displays the ABP before and after filtering for an ET patient. The AC interference and baseline drift are presented in Figure 6A, and the red box of Figure 6A is enlarged to Figure 6B in order to clearly observe these noises. It can be observed that the AC interference and the baseline drift have been wiped out in Figure 6C, the EMG interference has been eliminated in Figure 6D based on Figure 6C, and it can be clearly observed that the burr is eliminated in Figure 6C.
FIGURE 6. ABP signal before and after filtering for an ET patient. (A) ABP signal before filtering. (B) Enlargement of the red box (A). (C) De-nosing the ABP signal by an integer coefficient notch filter. (D) De-nosing the ABP signal obtained (C) by an integer coefficient low-pass filter.
3.2 Pulse rate variability extraction results
The results of Peaks in subjects extracted from the ABP signal of different groups by the methods of SWIDFT and manual calibration are illustrated in Figure 7. Also, it can be observed that the method is highly accurate and robustly stable, which can be engaged effectively for PRV calculations.
FIGURE 7. Peaks of ABP signal extracted. (A) Peak detection of a healthy young subject. (B) Peak detection of a healthy old subject. (C) Peak detection of an EB subject. (D) Peak detection of an ET subject. (E) Peak detection of a VT subject. (F) Peak detection of a VF subject.
The PRV results of subjects extracted based on the Peaks detection are illustrated in Figure 8, from which it is obvious that the amplitudes of those PRV signals are different. The PRV signals’ average of healthy young, healthy old, EB, ET, VT, and VF are 75.902 beat per minute (bpm), 60.282 beat per minute (bpm), 44.462 bpm, 137.598 bpm, 112.760 bpm, and 84.714 bpm, respectively. The average heartbeat of EB is the lowest, while that of ET is the highest. The average heartbeat of VF is higher than that of the EB subjects and lower than that of the VT subjects.
FIGURE 8. PRV extracted. (A) PRV of a healthy young subject. (B) PRV of a healthy old subject. (C) PRV of an EB subject. (D) PRV of an ET subject. (E) PRV of a VT subject. (F) PRV of a VF subject.
3.3 Feature extraction results
A total of 19 features (defined in Section 2.2.3) were extracted from the PRV signal, and the statistical results of the features are presented as “mean ± standard deviation” in Table 2. In total, we extracted the 143853 PRV signal from the ABP signal, for which we calculated a feature vector of 19 × 139719. Among them, the feature vectors for healthy, EB, ET, VT, and VF subjects are 19 × 93516, 19 × 6475, 19 × 16124, 19 × 22083, and 19 × 1521, respectively.
3.4 Feature dimensionality reduction results
To reduce the complexity of the algorithm without affecting the accuracy as much as possible, the features extracted from the PRV signal need to be dimensionalized by the method of RF.
The magnitude of the features calculated with RF is displayed in Figure 9, the mean decrease of accuracy is given in Figure 9A, and the mean decrease of Gini index is given in Figure 9B. For each feature, the trend of the Gini index and accuracy is essentially the same, which ensures the correctness of feature importance on both sides. The feature values of the accuracy and Gini index are illustrated in the third and fourth columns of Table 3. Table 3 displays the result of feature dimensionality reduction with RF, and the statistical results of healthy and four life-threatening arrhythmia patients are shown in column 2. According to Figure 8 and the feature values of accuracy and Gini index, 15 feature parameters are selected, which contains most of the information about the PRV signal. In addition, the results of feature selection (h) shows in the last column of Table 3, where h = 1 is the feature accepted and h = 0 is the feature rejected. Therefore, in this study, 15 features can be exploited to detect life-threatening arrhythmias, and the feature vector becomes 15 × 139719.
3.5 Classification results
In this study, kappa coefficients (Islam et al., 2018) and accuracy (Sabut et al., 2021) were exploited to calculate the average performance of supervised learning to recognize the life-threatening arrhythmia results. The accuracy is calculated as follows
where the parameters “TN,” “FP,” “TP,” and “FN” are true negative, false positive, true positive, and false negative of the classification result, respectively.
The kappa coefficient (kappa) is calculated as follows
Moreover, the kappa(i) is utilized to evaluate the average performance of the classification results for the healthy and the four life-threatening arrhythmia subjects, and the i is the label of five types of subjects.
where “p1” is the rate of correct classification, “p2” is the rate of incorrect classification, “Qtt” is the sum of elements on the diagonal of the column matrix, “r” is the number of features, “M” is the number of classes, and “Qt+” and “Q+t” are the sum of elements on tth row and column, respectively. Here, “r” and “M” are equal to 15 and 5, respectively. The kappa∈[-1,1], and the closer the value to 1, the better the classification result.
In Table 3, 15 features were selected by RF to form the feature vector. Therefore, the size of the feature vector becomes 15 × 139,719. We randomly selected 80 and 20 percent of the features as the training set and the test set, respectively. The features of the training and test sets were randomly changed 100 times to minimize the influence of input data differences, and the procedure was run 100 times in order to verify the classification accuracy of BPNN, ELM, and DT. The results of classification performance are displayed as “mean ± standard deviation” in Table 3, and 1, 2, 3, 4, and 5 are the labels for healthy, EB, ET, VT, and VF subjects, respectively.
As demonstrated in Table 4, the average performance of the classifier was verified with accuracy and kappa coefficient, whose result of BPNN is 94.85 ± 1.33% and 89.95 ± 2.62%, that of ELM is 95.05 ± 0.14% and 90.28 ± 0.28%, and that of DT is 98.76 ± 0.08% and 97.59 ± 0.15%. Therefore, the DT classifier has the best average performance in identifying those four life-threatening arrhythmias. In addition, the time consumption of BPNN is 100.58 ± 26.49 s, that of ELM is 8.63 ± 0.22 s, and that of DT is 1.12 ± 0.09 s. In brief, the performance of the DT classifier is optimal in the detection of the four arrhythmias. For identifying these life-threatening arrhythmias with the DT classifier, healthy subjects have the highest average performance with kappa (1) of 99.94 ± 0.05%, and VF patients have the lowest average performance with kappa (5) of 77.87 ± 2.39%. In addition, the average performance to detect EB, ET, and VF are all over 95.00%. With regard to time consumption, the DT and ELM classifiers take significantly less time than the BPNN classifier, which is because the BPNN classifier needs to constantly adjust the weights and thresholds.
4 Discussion
In this study, we propose a method to recognize four life-threatening arrhythmias based on the PRV signal calculated from the ABP signal of 2015 “PhysioNet/CinC” and “Fantasia” databases. A total of 19 features were extracted, and 15 of them were selected after feature dimensionality reduction to train and test the classifier. It can be illustrated that the DT classifier has the best average performance with accuracy and kappa of 97.59 ± 0.15% and 99.94 ± 0.05% in Table 4, respectively.
Figure 10 presents the ABP and PRV signals in different types of patients, where the green line is the standard of whether the disease is present or not, and the EB (1), EB (2), ET (1), VT (1), and VF (1) are the signals of those four life-threatening burst periods. In general, the sudden segment signals EB (1), EB (2), ET (1), VT (1), and VF (1) in patients are used for the recognition of life-threatening arrhythmias. However, since the “transient” of life-threatening arrhythmias can paralyze the patient and the signal can change rapidly and return to normal values, it is more important to confirm for the patient before the burst, which can alert the patients and send them to the hospital in time. The method we used detects not only the burst segment signal but also the normal segment signal before and after the burst, that is, the complete PRV signal in Figure 10, which is effective in identifying episodes of life-threatening arrhythmias.
FIGURE 10. ABP and PRV signals of patients. (A) ABP and PRV signals of a patient with EB. (B) ABP and PRV signals of a patient with ET. (C) ABP and PRV signals of a patient with VT. (D) ABP and PRV signals of a patient with VF.
So far, some researchers have studied the recognition of life-threatening arrhythmias. For example, Lee, K. et al. utilized feature parameters RMSD and ShE to identify AF (Lee et al., 2017), Eerikäinen, L.M. et al. used feature parameters PNN40, PNN70, ShE, RMSD, nRMSD, SampEn, and CSampEn to detect AF (Eerikäinen et al., 2018). Although these methods detect other cardiovascular diseases rather than those four life-threatening arrhythmias described in this study, they can provide ideas for our study. Therefore, the recognition of life-threatening arrhythmias is performed by the method used in this work for the extracted features of these researchers.
The average performance results of training and testing the DT classifier with the features extracted by Lee, K. et al. and Eerikäinen, L.M. et al. are displayed in Table 5. For the features extracted by Lee, K. et al., the performance of the DT classifier gives an accuracy of 83.32 ± 0.22% and kappa of 65.88 ± 0.42%, and the best average performance is ET with the kappa of 68.81 ± 0.92%. For the features extracted by Eerikäinen, L.M. et al., the performance of the DT classifier gives an accuracy of 95.27 ± 0.16% and kappa of 90.72 ± 0.32%, and the best average performance is healthy with the kappa of 95.90 ± 0.37%. In addition, the average performance of identifying those four life-threatening arrhythmias using the features utilized in Eerikäinen, L.M. et al. is better than that of Lee, K. et al., and the difference between them for accuracy and kappa is 11.95% and 24.84%, respectively, which is due to the fact that more features are engaged by Eerikäinen, L.M. et al. However, the accuracy and kappa values of Eerikäinen et al. are 3.49% and 6.87% lower than those of the method we used, which proves that the more features there are, the more comprehensive the information contained, and the higher the classification performance. However, it is not better to use more features if these features are correlated; it will cause a dimensional disaster which will affect the training of the model, reduce the average performance of the classification, and be more time consuming.
5 Conclusion
In this study, a method is presented to identify four types of life-threatening arrhythmia identification based on the PRV signal. First, the noise of ABP signals is eliminated during preprocessing to de-noise the EMG interference, AC interference, and baseline drift. Then, PRV signals are extracted, and 15 features are obtained and downscaled from the PRV signal to form a feature vector. Finally, the BPNN, ELM, and DT classifiers are trained and tested based on the feature vector. The results show that DT has the best average performance with an accuracy of over 98.50% and a kappa of over 97.50%, which is better than some previous studies. Therefore, the method we used can effectively detect EB, ET, VT, and VF and has a potential for monitoring at home. In subsequent studies, the detection of motion artifacts will be added to the preprocessing part to improve the signal availability, and some algorithms such as feature extraction will be optimized. In the future, the DT model based on PRV signals is expected to be used for the recognition of other life-threatening arrhythmias.
Data availability statement
The datasets for this study can be found at the https://www.physionet.org/content/fantasia/1.0.0/ and https://www.physionet.org/content/challenge-2015/1.0.0/.
Author contributions
LC and YC provided ideas, designed the study, and designed the algorithm. LC, JL, and SG reviewed relevant literature and organized data. All authors have read and approved the final manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (grant 61901062 and 62003057).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2022.1008111/full#supplementary-material
References
Alinejad G. M., Rasoulinezhad S., Shamsollahi M. B. (2019). “Prediction of life-threatening heart arrhythmias using obstructive sleep apnoea characteristics,” in Proceeding of the 2019 27th Iranian Conference on Electrical Engineering (ICEE), Yazd, Iran, April 2019 (IEEE), 1761–1764.
Charbuty B., Abdulazeez A. (2021). Classification based on decision tree algorithm for machine learning. Appl. Sci. Technol. Trends 2, 20–28. doi:10.38094/jastt20165
Chorin E., Hochstadt A., Schwartz A. L., Matz G., Viskin S., Rosso R. (2021). Continuous heart rate monitoring for automatic detection of life-threatening arrhythmias with novel bio-sensing technology. Front. Cardiovasc. Med. 748, 707621. doi:10.3389/fcvm.2021.707621
Chou Y. X., Gu J., Liu J. C., Gu Y., Lin J. (2019). Bradycardia and tachycardia detection using a synthesis-by-analysis modeling approach of pulsatile signal. IEEE Access 7, 131256–131269. doi:10.1109/ACCESS.2019.2940921
Chou Y., Zhang A., Wang P., Gu J. (2014). Pulse rate variability estimation method based on sliding window iterative DFT and hilbert transform. J. Med. Biol. Eng. 34, 347–355. doi:10.5405/JMBE.1684
Chou Y., Zhang A., Gu J., Liu J., Gu Y. (2020). A recognition method for extreme bradycardia by arterial blood pressure signal modeling with curve fitting. Physiol. Meas. 41, 074002. doi:10.1088/1361-6579/ab998d
Clifford G. D., Silva I., Moody B., Li Q., Kella D., Shahin A., et al. (2015). “The PhysioNet/computing in cardiology challenge 2015: Reducing false arrhythmia alarms in the ICU,” in 2015 computing in Cardiology conference. (Nice, France: CinC), 273–276. doi:10.1109/CIC.2015.7408639
Deaconu A., Deaconu S., Radu A., Dorobantu M. (2021). Pharmacological treatment of tachyarrhythmias in acute myocardial infarction-a review. Romanian J. Cardiol. 31, 311–317. doi:10.47803/rjc.2021.31.2.311
Du H., Feng L., Xu Y., Zhan E., Xu W. (2021). Clinical influencing factors of acute myocardial infarction based on improved machine learning. J. Healthc. Eng. 2021, 5569039. doi:10.1155/2021/5569039
Eerikäinen L. M., Bonomi A. G., Schipper F., Dekker L. R., Vullings R., de Morree H. M., et al. (2018). Comparison between electrocardiogram-and photoplethysmogram-derived features for atrial fibrillation detection in free-living conditions. Physiol. Meas. 39 (8), 084001. doi:10.1088/1361-6579/aad2c0
Fallet S., Lemay M., Renevey P., Leupi C., Pruvot E., Vesin J. M. (2019). Can one detect atrial fibrillation using a wrist-type photoplethysmographic device? Med. Biol. Eng. Comput. 57, 477–487. doi:10.1007/s11517-018-1886-0
Goldberger A. L., Amaral L., Glass L., Hausdorff J. M., Ivanov P. C., Mark R. G., et al. (2000). Physiobank, physiotoolkit, and physionet: Components of a new research resource for complex physiologic signals. Circulation 101, e215–e220. doi:10.1161/01.cir.101.23.e215
Hamdani H., Arifin Z., Septiarini A. (2022). Expert system of dengue disease using artificial neural network classifier. J. Inform. 10, 59–66. doi:10.30595/juita.v10i1.12476
Hejjel L., Béres S. (2021). Comment on ‘pulse rate variability in cardiovascular health: A review on its applications and relationship with heart rate variability. Physiol. Meas. 42, 018001. doi:10.1088/1361-6579/abd332
Islam M. S., Khreich W., Hamou-Lhadj A. (2018). Anomaly detection techniques based on kappa-pruned ensembles. IEEE Trans. Reliab. 67, 212–229. doi:10.1109/TR.2017.2787138
Iyengar N., Peng C. K., Morin R., Goldberger A. L., Lipsitz L. A. (1996). Age-related alterations in the fractal scaling of cardiac interbeat interval dynamics. Am. J. Physiol. 271, R1078–R1084. doi:10.1152/ajpregu.1996.271.4.R1078
Ishaque S., Khan N., Krishnan S. (2021). Trends in heart-rate variability signal analysisFrontiers in Digital Health3, 13doi:10.3389/fdgth.2021.639444
Jahmunah V., Ng E. Y. K., San T. R., Acharya U. R. (2021). Automated detection of coronary artery disease, myocardial infarction and congestive heart failure using GaborCNN model with ECG signals. Comput. Biol. Med. 134, 104457. doi:10.1016/j.compbiomed.2021.104457
Jan H. Y., Chen M. F., Fu T. C., Lin W. C., Tsai C. L., Lin K. P. (2019). Evaluation of coherence between ECG and PPG derived parameters on heart rate variability and respiration in healthy volunteers with/without controlled breathing. J. Med. Biol. Eng. 39, 783–795. doi:10.1007/s40846-019-00468-9
Lee K., Choi H. O., Min S. D., Lee J., Gupta B. B., Nam Y. (2017). A comparative evaluation of atrial fibrillation detection methods in Koreans based on optical recordings using a smartphone. IEEE Access 5, 11437–11443. doi:10.1109/ACCESS.2017.2700488
Mandal S., Mondal P., Roy A. H. (2021). Detection of Ventricular Arrhythmia by using Heart rate variability signal and ECG beat image. Biomed. Signal Process. Control 68, 102692. doi:10.1016/j.bspc.2021.102692
Mejía-Mejía E., May J. M., Torres R., Kyriacou P. A. (2020). Pulse rate variability in cardiovascular health: A review on its applications and relationship with heart rate variability. Physiol. Meas. 41, 07TR01. doi:10.1088/1361-6579/ab998c
Mejía-Mejía E., May J. M., Elgendi M., Kyriacou P. A. (2021). Differential effects of the blood pressure state on pulse rate variability and heart rate variability in critically ill patients. NPJ Digit. Med. 4, 82. doi:10.1038/s41746-021-00447-y
Mejía-Mejía E., May J. M., Kyriacou P. A. (2022). Effects of using different algorithms and fiducial points for the detection of interbeat intervals, and different sampling rates on the assessment of Pulse Rate Variability from Photoplethysmography. Comput. Methods Programs Biomed. 218, 106724. doi:10.1016/j.cmpb.2022.106724
Mitchell K. J., Schwarzwald C. C. (2021). Heart rate variability analysis in horses for the diagnosis of arrhythmias. Vet. J. 268, 105590. doi:10.1016/j.tvjl.2020.105590
Nordin A. D., Hairston W. D., Ferris D. P. (2019). Faster gait speeds reduce alpha and beta EEG spectral power from human sensorimotor cortex. IEEE Trans. Biomed. Eng. 67, 842–853. doi:10.1109/TBME.2019.2921766
Paliakaitė B., Petrėnas A., Sološenko A., Marozas V. (2019). “Photoplethysmogram modeling of extreme bradycardia and ventricular tachycardia,” in Mediterranean conference on medical and biological engineering and computing (Cham: Springer), 1165–1174.
Qi Y. (2012). Random forest for bioinformatics. Springer, Boston: Ensemble machine learning, 307–323. doi:10.1007/978-1-4419-9326-7_11
Roberts R., Fair J. (2021). Genetics, its role in preventing the pandemic of coronary artery disease. Clin. Cardiol. 44, 771–779. doi:10.1002/clc.23627
Rohila A., Sharma A. (2019). Phase entropy: A new complexity measure for heart rate variability. Physiol. Meas. 40, 105006. doi:10.1088/1361-6579/ab499e
Sabut S., Pandey O., Mishra B. S. P., Mohanty M. (2021). Detection of ventricular arrhythmia using hybrid time–frequency-based features and deep neural network. Phys. Eng. Sci. Med. 44, 135–145. doi:10.1007/s13246-020-00964-2
Saul J. P., Valenza G. (2021). Heart rate variability and the dawn of complex physiological signal analysis: Methodological and clinical perspectives. Philos. Trans. A Math. Phys. Eng. Sci. 379, 20200255. doi:10.1098/rsta.2020.0255
Shuvo S. B., Ali S. N., Swapnil S. I., Al-Rakhami M. S., Gumaei A. (2021). CardioXNet: A novel lightweight deep learning framework for cardiovascular disease classification using heart sound recordings. IEEE Access 9, 36955–36967. doi:10.1109/ACCESS.2021.3063129
Sluyter J. D., Camargo C. A., Lowe A., Scragg R. K. R. (2019). Pulse rate variability predicts atrial fibrillation and cerebrovascular events in a large, population-based cohort. Int. J. Cardiol. 275, 83–88. doi:10.1016/j.ijcard.2018.10.026
Wang J., Lu S., Wang S. H., Zhang Y. D. (2021). A review on extreme learning machine. Multimed. Tools Appl. 22, 1–50. doi:10.1007/s11042-021-11007-7
Keywords: pulse rate variability, arterial blood pressure, cardiovascular diseases, life-threatening arrhythmias, decision tree, intelligent recognition
Citation: Chou L, Liu J, Gong S and Chou Y (2022) A life-threatening arrhythmia detection method based on pulse rate variability analysis and decision tree. Front. Physiol. 13:1008111. doi: 10.3389/fphys.2022.1008111
Received: 31 July 2022; Accepted: 23 September 2022;
Published: 14 October 2022.
Edited by:
Jichao Zhao, The University of Auckland, New ZealandReviewed by:
Lihui Zheng, Chinese Academy of Medical Sciences and Peking Union Medical College, ChinaAdrian Luca, Centre Hospitalier Universitaire Vaudois (CHUV), Switzerland
Copyright © 2022 Chou, Liu, Gong and Chou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yongxin Chou, Y3NsZ2Nob3V5eEBjc2xnLmVkdS5jbg==