- 1Computer, Electrical, and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Makkah, Saudi Arabia
- 2Electrical Engineering Department, Alfaisal University, Riyadh, Saudi Arabia
- 3National Institute for Research in Digital Science and Technology INRIA, Saclay, France
Carotid-to-femoral pulse wave velocity (cf-PWV) is considered a critical index to evaluate arterial stiffness. For this reason, estimating Carotid-to-femoral pulse wave velocity (cf-PWV) is essential for diagnosing and analyzing different cardiovascular diseases. Despite its broader adoption in the clinical routine, the measurement process of carotid-to-femoral pulse wave velocity is considered a demanding task for clinicians and patients making it prone to inaccuracies and errors in the estimation. A smart non-invasive, and peripheral measurement of carotid-to-femoral pulse wave velocity could overcome the challenges of the classical assessment process and improve the quality of patient care. This paper proposes a novel methodology for the carotid-to-femoral pulse wave velocity estimation based on the use of the spectrogram representation from single non-invasive peripheral pulse wave signals [photoplethysmography (PPG) or blood pressure (BP)]. This methodology was tested using three feature extraction methods based on the semi-classical signal analysis (SCSA) method, the Law’s mask for texture energy extraction, and the central statistical moments. Finally, each feature method was fed into different machine learning models for the carotid-to-femoral pulse wave velocity estimation. The proposed methodology obtained an $R2\geq0.90$ for all the peripheral signals for the noise-free case using the MLP model, and for the different noise levels added to the original signal, the SCSA-based features with the MLP model presented an $R2\geq0.91$ for all the peripheral signals at the level of noise. These results provide evidence of the capacity of spectrogram representation for efficiently assessing the carotid-to-femoral pulse wave velocity estimation using different feature methods. Future work will be done toward testing the proposed methodology for in-vivo signals.
1 Introduction
Carotid-to-femoral pulse wave velocity (cf-PWV) is considered a critical index to evaluate arterial stiffness. For this reason, estimating Carotid-to-femoral pulse wave velocity (cf-PWV) is essential for diagnosing and analyzing different cardiovascular diseases. Despite its broader adoption in the clinical routine, the measurement process of cf-PWV is considered a demanding task for clinicians and patients making it prone to inaccuracies and errors in the estimation. A smart non-invasive, and peripheral measurement of cf-PWV could overcome the challenges of the classical assessment process and improve the quality of patient care. This paper proposes a novel methodology for the cf-PWV estimation based on the use of the spectrogram representation from single non-invasive peripheral pulse wave signals [photoplethysmography (PPG) or blood pressure (BP)]. This methodology was tested using three feature extraction methods based on the semi-classical signal analysis (SCSA) method, the Law’s mask for texture energy extraction, and the central statistical moments. Finally, each feature method was fed into different machine learning models for the cf-PWV estimation. The results obtained for each feature method provide evidence of the capacity of spectrogram representation combined with machine learning models as an intelligent tool for efficiently assessing the cf-PWV estimation. Cardiovascular diseases (CVDs) are the leading cause of mortality worldwide, with 17.9 million deaths in 2019, representing 32% of all global deaths, (Mensah et al., 2019). Patients at risk of evolving CVDs are assessed by evaluating different bio-markers ranging from age and sex to arterial stiffness (AS), (Gade et al., 2021). Arterial stiffness is considered one of the highest risk markers and has attracted much attention in clinical and experimental studies, (van Sloten et al., 2014). Arterial stiffness depicts the rigidity of the arterial vessels, positively associated with arterial pulse pressure, which can significantly affect the heart and vascular physiology.
Over the last decades, a myriad of techniques for evaluating AS have been explored and validated, some of which are more widely functional nowadays in clinical practice than others, for example, the cardio-ankle vascular index that reflects the stiffness from the ascending aorta to the ankle arteries (Matsushita et al., 2019), the pulse pressure defined as the difference between the diastolic and systolic pressure (Mackenzie et al., 2002) assessing arterial stiffness. It is usually evaluated by dividing the distance traveled by the pulse wave between two arterial sites divided by the time taken to travel the distance (path length between the two sites). When the two arterial sites are the carotid and femoral sites, then we refer to the Carotid to femoral pulse wave velocity (cf-PWV) and usually provide information on the central arterial stiffness. The feasibility of cf-PWV in evaluating vascular stiffness has been validated through a strong correlation with major parameters and conditions such as hypertension severity levels, vascular aging, and atherosclerosis (Blacher et al., 1999; Shokawa et al., 2005; Mattace-Raso et al., 2006; Willum Hansen et al., 2006; Choi et al., 2007; Kim and Kim, 2019). Despite the crucial role of cf-PWV, there is no reliable method for estimating the cf-PWV. Most methods that exist in the literature have limitations. For instance, they rely on experienced personnel to realize the correct measurement, consisting in acquiring the carotid and femoral pressure waveform and measuring the traveling distance, as highlighted in (Matsushita et al., 2019). The measurement can therefore be subject to errors and inaccuracies, in particular, when evaluating the path length between the carotid and femoral sites (Tavallali et al., 2018). Further discussions on the measurement modalities and the main advantages and limitations of cf-PWV measurements can be read in (Rajzer et al., 2008).
Recent papers have investigated the use of Artificial Intelligence (AI) in estimating pulse wave velocity where non-invasive available measurements are used. AI-based approaches present the advantage of estimating the cf-PWV from non-invasive measurements which can be incorporated into the clinical routine without involving any complex protocol or experienced personnel. For instance, a non-calibrated carotid tonometry pressure waveform has been combined with a clinical routine variable to feed a machine learning model with Intrinsic Frequency features (Tavallali et al., 2015). Recently, a multi-layers perceptron-based cf-PWV estimation using fiducial points-based features extracted from the photoplethysmogram (PPG) signal and its first, second, and third derivatives, has been proposed. Another investigation by Weiwei et al., in (Jin et al., 2021), has proposed two machine learning pipelines, namely the Gaussian process regression and Recurrent Neural Network for the cf-PWV estimation from the radial blood pressure waveform. The two proposed machine learning pipelines used key features generated from the timing and magnitude of the fiducial points and the heart rate. More recently, in 2022, (Garcia et al., 2022) used a Multiple Linear Regression model to study the feasibility of the Semi-Classical Signal Analysis (SCSA)-based features extracted from Blood Pressure (BP) and PPG signals extracted from peripheral locations. In this study, feature extraction from a two-dimensional signal representation of the BP and PPG signals improved the estimation accuracy and robustness compared to the original one-dimensional signals’ results. Finally, (Li et al., 2022) proposed the cf-PWV prediction based on the XGBoost algorithm using wrist photoplethysmogram (wPPG) signals acquired from wearable devices. Despite the promising results, AI-based algorithms are not yet reliable and require improvements in terms of accuracy but also in terms of the universality of the algorithms, which refers to the fact that they can perform well for data that have not been considered in the training of the machine learning model.
In this paper, our objective is to contribute to improving AI-based algorithms for the estimation of the cf-PWV by proposing the use of spectrograms of pulse wave signals instead of one-dimensional signals. We believe that using the spectrogram, which provides both temporal and frequency dimensions of the signal, will help in improving the accuracy of measuring cf-PWV and therefore would help in including the cf-PWV measure in the clinical routine practice without the need for an expert. The use of spectrogram representation on PPG signals has been studied in the past proving good performance over different applications. In 2020, (Donida Labati et al., 2021) used a SVM model with features extracted from the PPG spectrogram for biometric recognition. Another use of PPG spectrogram representation is presented by (Siam et al., 2021) where they use the spectrogram as an input image for Blood Pressure estimation using Siamese networks and Convolutional neural networks (CNN).
This work used the spectrogram representation from peripheral signals for cf-PWV estimation using three feature methods. The first feature type was based on the Semi-Classical Signal Analysis (SCSA) method that relies on the Schrodinger operator’s spectral problem. The second type was based on the Law’s mask filters that compute the energy texture of an image, and the third was based on the central moments that give a statistical description of the image. Finally, these features were fed individually to different machine learning models to obtain the final estimation. Figure 1 illustrates the proposed estimation pipeline of the cf-PWV.
FIGURE 1. Schematic illustration of the estimation pipeline of the carotid-to-femoral pulse wave velocity based on pulse wave images. PPG indicates the photoplethysmography signals, BP the blood pressure signals and ML indicates machine learning models.
2 Materials and methods
2.1 Materials
2.1.1 Dataset
Due to the absence of real hemodynamic data to validate and test the proposed approach, in this paper, we used an in silico hemodynamic public database1. A pre-validated one-dimensional model has been used for generating the database of simulated pulse wave signals at different arterial locations. This model generates the signals based on different cardiovascular properties such as age, heart rate, blood density, and arterial diameter among other cardiovascular parameters (Charlton et al., 2019). These signals are often used to evaluate various pre-clinical assessment studies and hemodynamic analyzer algorithms, such as assessing pulse wave velocity. The database emulates pulse wave signals of one cardiac cycle of length from 4,374 virtual healthy adults with different ages between 25 and 75 in 10-year increments (six age groups) and heart rate between 66 bpm and 86 bpm, using a sampling frequency of 500 Hz for each signal. Each group has 729 virtual subjects’ pulse waves with distinct cardiac and arterial parameters like arterial stiffness and heart rate within normal ranges. In this study, PPG and BP waveforms at the level of the brachial, radial, and digital arteries were used to create the spectrograms and estimate the cf-PWV.
2.2 Method
The proposed methodology is summarized in Figure 1. A spectrogram is created using pulse wave signals from the in silico data. Then, features are extracted from three different methods and are fed individually into the learning approach stage. The best features were selected to train the model and estimate the cf-PWV value.
2.2.1 Spectrogram generation
The spectrogram is a time-frequency representation used to analyze the change of frequency with respect to time from a given input signal. In this paper, spectrograms are generated using the function spectrogram from MATLAB. It is well known that the selection of parameters involved in the creation of the spectrogram will define the quality of the representation, as shown in (Jablonski and Dziedziech, 2022). For this reason, windows’ parameters, spectrogram’s shape, and overlapping percentage were finely tuned.
Regarding the window type, it has been shown that Hamming and Kaiser’s windows are good options for generating spectrograms from pulse wave signals such as PPG (Zong and Jafari, 2015; Esgalhado et al., 2021). Overlapping percentage values of 0, 60, and 95 for Hamming windows and 0, 61, and 70 for Kaiser, were selected based on the values reported in (Trethewey, 2000) and (Heinzel et al., 2002). However, for the Kaiser window, it is necessary to define an extra parameter called α which changes depending on the overlapping percentage (Heinzel et al., 2002). Values of 0.5, 3, and 5 were used for the 0, 61, and 70 overlapping percentages, respectively. Additionally, the shape selection was made, considering the requirement from the 2D-SCSA feature extraction method to use square images (Kaisserli and Laleg-Kirati, 2014). For this reason, a squared spectrogram of sizes 250, 166, 100, 50, and 20 was used.
Finally, to select the final values, the spectrogram quality coefficients used in (Jablonski and Dziedziech, 2022) were calculated for each combination of parameters, as follows,
where t and f represent the time (rows) and frequency (columns) of the spectrogram, and T and F are the number of time and frequency points respectively. σ represents the standard derivation, μ is the mean. Higher values in these metrics represent a better capacity of the spectrogram to represent the variability of a given signal.
Finally, the combination of parameters with the higher values for the metrics was obtained using the Hamming window, with 0% of overlapping and an image size of 250 × 250 pixels. More details on the parameters tuning and obtained results for all the combinations can be found in the Supplementary Material.
2.2.2 Semi-classical signal analysis method
2.2.2.1 Definition
The semi-classical signal analysis (SCSA) method has been proposed in (Laleg-Kirati et al., 2013) for pulse-shaped signal reconstruction, denoising, and characterization, where the signal is decomposed into a set of signal-dependent adaptive squared eigenfunctions of the Schrödinger operator. The SCSA method has been successfully used for features extraction of blood pressure (BP) and PPG signals by (Laleg-Kirati et al., 2013; Li and Laleg-Kirati, 2021a; Garcia et al., 2022), showing the feasibility of this method to provide useful information on the shape of the input signal which helps to detect morphology changes in the signal. This method has been extended to image representation (Kaisserli and Laleg-Kirati, 2014), denoising (Chahid et al., 2017; Chahid et al., 2018), and feature extraction (Garcia et al., 2022).
Definition 1. Let I (x, y) be a positive real valued square matrix, the image representation I2h of I (x, y) using the 2D-SCSA is defined as follows:
where
and
where Γ is the Gamma function.
2.2.2.2 Numerical computation
The 2D-SCSA requires the computation of eigenvalues and eigenfunctions from a 2D operator, leading to a complex and time-consuming process. To reduce the computational burden, a separation of variables approach has been proposed in (Kaisserli and Laleg-Kirati, 2014) where the standard 1D-SCSA is used for each row and each column; the results are then combined using the following formula: (Kaisserli and Laleg-Kirati, 2014):
where βi,k,h, k = 1, …, Kh and ρj,m,h, m = 1, …, Mh are the eigenvalues for each row and each column respectively, with
2.2.2.3 Parameters selection
The semi-classical parameter h and the parameter γ play crucial roles in the SCSA representation as described in (Laleg-Kirati et al., 2013; Kaisserli and Laleg-Kirati, 2014). When the h value tends to 0, the SCSA reconstruction converges to the original image producing the best result. However, it has been noticed that the number of eigenvalues is restricted by the number of samples (Piliouras, 2020). Even knowing that the SCSA representation improves when h tends to 0, this value cannot be very small as it also depends on the number of samples. In addition, selecting the parameter γ given a specific h is also important since it affects the intensity values of the reconstructed images. It has been found that for small h values, γ tends to increase; in contrast, when h increases, γ tends to decrease.
An appropriate h interval has been proposed in (Piliouras, 2020) for 1D-signals, where a minimum value for h based on the sampling theorem is introduced. This value had been successfully used in (Piliouras, 2020; Li et al., 2021), providing good accuracy for signal representation. This minimum value is defined as:
where Vmax is the maximum value of the input signal and Ts is the sampling period of the images.
In this paper, we propose to extend the idea of using the hmin to image representation. hmin is computed for all the rows (hrmin) and columns (hcmin) in the image, as is shown in Figure 2. However, given that each pixel is affected by the h value taken from columns and rows, the mean between these two values was computed to obtain the h value (hmmin) for each pixel in the image.
FIGURE 2. Computation of the 2D-SCSA’s design parameters h. hrmin and hcmin represent the hmin values extracted from every row and every column of the spectrogram respectively.
To obtain a single h value for the entire image
The value of the γ parameter was selected by a sensitivity analysis. Different values of γ have been tested to maximize the structural similarity index measure (SSIM) and the Peak Signal to Noise ratio (PSNR) between the original spectrogram and the reconstructed one using 2D-SCSA. More information about the obtained results from the sensitivity analysis is presented in the Supplementary Material.
2.2.3 Features extraction
In this study, three different feature methods were computed. Obtaining 36 SCSA-based features, 102 Energy-based features, and 6 Statistic based features for the PPG and BP spectrograms for the Radial, Digital, and Brachial locations. Each feature was fed separately into the feature selection method to finally be combined with the different machine learning algorithms to estimate the cf-PWV values. The features used in this study are shown in Table 1, and with the relevant advantages and disadvantages of the three types of features.
2.2.3.1 SCSA-based features
2D-SCSA features were considered to be the eigenvalues computed from each of the three following eigenvalues matrices obtained after applying the 2D-SCSA on the spectrogram:
1. Matrix composed by the extracted eigenvalues from the rows of the spectrogram.
2. Matrix composed of the extracted eigenvalues from the columns of the spectrogram.
3. Matrix composed by the sum of the matrices above.
The first features calculated were the three first invariants proposed by (Laleg-Kirati et al., 2013), consisting of some momentum of the negative eigenvalues. In this work, we compute the invariants for all three cases above: row eigenvalues, column eigenvalues, and combined eigenvalues:
where
These invariant parameters were used by (Laleg-Kirati et al., 2010; Li and Laleg-Kirati, 2021b; Garcia et al., 2022) for signal processing features extraction and in (Garcia et al., 2022) for image processing features extraction. This shows the feasibility of these invariants to obtain relevant information from pulse wave signals such as BP and PPG.
Furthermore, based on the first three eigenvalues that approximate the general profile of the image (Laleg-Kirati et al., 2013; Li and Laleg-Kirati, 2021a), the mean value of the three first eigenvalues and κ for each matrix were calculated as follows:
We also considered other features as described in (Li et al., 2021) and which consists of the ratio between the first κ of the eigenvalues matrix and
The mean number of eigenvalues Mh obtained in each eigenvalue matrices was used as a feature since this value gives valuable information on signal shape (Li et al., 2021), helping to identify changes in the morphology. Finally, the mean and standard deviation of the κ were used as descriptors of the pixel distribution of the eigenvalues matrices.
2.2.3.2 Energy-based features
Laws’ mask features are standard image processing based features used to measure the “Texture energy” of a group of pixels in an image. This method has been used in the past for feature extraction for biomedical images (Rachidi et al., 2008), speech recognition using spectrograms (Wang, 2014), texture classification (Dash and Jena, 2017), and as a method of segmentation based on the texture presented on the image (Kvyetnyy et al., 2017).
The principle of this method is to estimate the texture features using a set of texture energy transformations (Laws, 1980). This transformation detects the variation within a fixed-size window using different convolution masks that compute the energy of the image. This group of masks is invariant to changes in luminance, contrast, and rotation that allows the detection of textures under different conditions (Laws, 1980; Stockman and Shapiro, 2001).
All the convolution masks used for the energy texture estimation came from the following set of one-dimensional (1-D) kernels of five or three pixels:
• Kernels with length 3
• kernels with length 5
where L (Level) detects the average grey level, E (Edge) extracts edge features, S (Spot) extracts spots, W (Wave) extracts wave features, and R (Ripple) extracts ripples in the image (Laws, 1980). The convolution masks used for the feature extraction were generated by convoluting any vertical one-dimensional vector with a horizontal vector to generate the following 3 × 3 and 5 × 5 filters:
• 3 × 3 filters
• 5 × 5 filters
The images obtained after convolution between each mask and the images should be normalized to make the descriptors contrast-independent. The normalization was made based on the implementation made by (Miroslav and Rodojevi´c, 2007), where all the images were normalized using the image min-max normalization as follows
After the normalization, each outputs Imask were converted to a texture energy image (Emask) by using a moving non-linear window average of absolutes (Rachidi et al., 2008)
Finally, the mean (Eq. 17), standard deviation (Eq. 18), and entropy (Eq. 19) to each of the texture energy images obtained was computed to obtain a measurement of the global energy texture for each mask (Rachidi et al., 2008).
2.2.3.3 Central moment features
Statistical central moments are a set of features used to describe the spread and shape of the pixel’s distribution in an image (Grubbström and Tang, 2006) and are computed as
where μk represents the kth central moment about the mean μ of the spectrogram image I.
These features have been used for different applications where spectrograms are involved such as event classification and inference detection (Dennis et al., 2011; Mulimani and Koolagudi, 2018; Oh and Kim, 2019).
In this work, we extracted a set of features inspired by features used for audio event classification in (Mulimani and Koolagudi, 2018) derived from the second, third, and fourth central moment, described as follows:
where
2.3 Feature selection
The feature selection is a technique used to reduce the number of features by eliminating the irrelevant, redundant, and noisy features to improve the model performance (Kumar and Minz, 2014). In this study, we use the Maximum Relevance—Minimum Redundancy (MRMR) algorithm, which is a feature selection method that chooses
where ρ(f, s) is the Pearson correlation, F(Y, f) is the F-statistic and Y is the target variable to estimate, S the set of selected features,
Finally, the number of features
2.4 Machine learning models
To create the training and testing dataset for supervised machine learning models, the dataset was split into two different groups where the 70% of the total dataset was used for the training set, and the 30% left was used for the testing set. Finally, each of the three different features types were fed into the following machine learning methods: Random forest regression (RF), Gradient Boost Regressor (GBR), multilayer perceptron (MLP), Multiple Linear regression (MLR), and Suppor Vector Regression (SVR).
2.4.1 Model training
A common practice in machine learning to increase the performance of the models is to standardize the features to have mean 0 and variance 1 using the z-score defined as follows,
where
2.4.2 Model evaluation
To evaluate the performance of the models, we used the R-squared (R2) value and the root mean square error (RMSE) between the actual value and the predicted by the model (Bahloul et al., 2021; Garcia et al., 2022).
where μ is a function that evaluates the mean of cf − PWVreal over N subjects.
2.5 Noise addition
To test the performance of the proposed methodology against noisy data, a high-frequency Gaussian white noise was added to the pulse wave signals to simulate the electrical noise found during the recording (Ban and Kwon, 2016). The typical cause for this type of noise is radio, TV, cellular, and distant lightning (Kularatna et al., 2019). The intensity of the noise was defined using the signal-to-noise ratio (SNR) defined as follows:
where Ps and Pn correspond to the power of the signal and Gaussian white noise, respectively (Bahloul et al., 2021; Garcia et al., 2022). The selected noise intensity for the BP signals is 20, 10, and 5 dB based on the values used by (Jin et al., 2021). For the PPG signals, we define the values as 65, 45, and 30 dB based on the values reported by (Maxim integrated, 2017) (Elsamnah et al., 2019).
3 Results
3.1 Noise-free case
This project proposed a novel methodology based on spectrogram representation of the signals to estimate the cf-PWV. Table 3 shows the result for the PPG spectrograms where the MLP and SVR models obtained the best results with a R2 = 0.90 or higher and a RMSE = 0.71 or lower for the three features types, producing the best performance values of R2 = 0.99 and RMSE = 0.16 for the SVR with energy features applied to the brachial location. In contrast, the MLR model shows the worst performance of R2 = 0.73 and RMSE = 1.09 using statistical features extracted from the Brachial location. However, it is important to notice that in the case of the SCSA and energy features, the MLR models presented R2 = 0.90 for higher and RMSE = 0.66 or lower, obtaining the best results of R2 = 0.95 and RMSE = 0.47 for the energy features extracted from the Radial location. The feature type with the better overall performance in the estimation was the energy feature presenting a mean of R2 = 0.97 and RMSE = 0.32 for all the models in the three different locations. Similarly, the result obtained for the BP spectrograms showed the best performance for the SVR and MLP models with at least a R2 = 0.97 and a RMSE = 0.36 for all the features with a maximum difference of 0.02 for the R2 and 0.19 between the features. On the other hand, the worst results were presented for the MLR models with a lower value of R2 = 0.77 and RMSE = 1.00 for the Brachial location using the statistical features. As in the PPG spectrogram, the energy features presented the best overall result with a mean value of R2 = 0.97 and RMSE = 0.36 for all the models in the three different locations. Finally, it is important to notice that all the features obtained a performance of R2 between 0.90 and 0.99 for the BP and PPG spectrograms in the different locations.
These results show the capacity of the spectrogram as signal representation for cf-PWV estimation using noise-free PPG and BP signals from the Radial, Brachial, and Digital locations. It is important to notice that BP shows a better performance with all three different features compared with the results obtained for the PPG where the energy-based features performed better than the other features for the three locations proposed. It is important to notice that the MLR model with SCSA and energy features obtained values of R2 ≥ 0.90 showing a great capacity to obtain a linear relationship between the features extracted from the spectrogram and the cf-PWV. This is a great advantage since these linear models can allow an easier implementation of the model in real-life applications.
3.2 Noisy case
Table 4 shows the results obtained for the different levels of noise. The highest results obtained for each of the nose levels were R2 = 0.98 and RMSE = 0.33 for the SNR = 65 using the MLP models with the energy features applied to the Brachial location. In addition, SCSA features applied in the brachial location presented the best result for the SNR = 45 with values R2 = 0.96 and RMSE = 0.43 using the SVR model, and values of R2 = 0.92 and RMSE = 0.60 for the SNR = 30 using the MLP and SVR models. In contrast, the worst results were obtained in all the cases by the MLP models using statistic features with values of R2 = 0.76 and RMSE = 1.01 for the Radial location with SNR = 0.65, R2 = 0.68 and RMSE = 1.17 for Digital and brachial location with SNR = 0.45, and R2 = 0.65 and RMSE = 1.23 for Digital and brachial location with SNR = 0.30. For the BP spectrogram, The best values obtained were applied for Radial locations with values of R2 = 0.98 and RMSE = 0.26 for the SNR = 20 using the GBR model with the energy features, R2 = 0.98 and RMSE = 0.32 for the SNR = 10 using the SVR model with the SCSA features, and R2 = 0.97 and RMSE = 0.38 for the SNR = 5 using the SVR model with the SCSA features. In contrast, similar to the PPG spectrograms, the lower results were obtained for the MLR models using the statistical features obtaining values of R2 = 0.70 and RMSE = 1.14 for SNR = 20, R2 = 0.68 and RMSE = 1.18 for SNR = 10, and R2 = 0.69 and RMSE = 1.16 for SNR = 5.
These results show that even with different levels of white gaussian noise added to the original signals, the proposed methodology obtained R2 ≥ 0.90 for the PPG signals with the different levels of noise, using the MLP and SVR models for the different locations. In contrast, similar to the case of noise-free, the BP signals obtained better results, presenting an R2 ≥ 0.90 using the MLP model for the three different features in the proposed locations. Furthermore, in the case of BP signals, the MLR model obtained accurate results for the cf-PWV.
4 Discussion
This study investigated a novel methodology to estimate the cf-PWV based on the application of the spectrogram representation of single PPG or BP signals extracted from a peripheral location. The use of the spectrogram representation for the analysis of biomedical signals such as PPG had been studied before as input for data-driven approaches like the classification of peripheral diseases by (Allen et al., 2021), or biometric recognition, (Donida Labati et al., 2021). For this reason, in this project, the use of the spectrogram from BP or PPG signals to estimate the cf-PWV values is investigated as a novel methodology to take advantage of the frequency and temporal information encoded in the spectrogram matrix.
In this project, three feature types based on the Schrodinger spectrum, the image’s energy texture, and the image’s statistical distribution were combined with different machine learning algorithms to estimate the carotid-to-femoral pulse wave velocity (cf-PWV). The results for the noisy-free signals presented a mean value of R2 = 0.92 and RMSE = 0.54 for the PPG spectrograms and R2 = 0.95 and RMSE = 0.44 for the BP spectrograms for the noisy-free signals extracted from the Radial, Digital and Brachial location, showing the spectrogram’s capacity to encode valuable information that can be extracted to estimate the cf-PWV presenting. The energy-based features using Law’s masks presented the best performance for the PPG and BP signals with values of R2 = 0.99 and RMSE = 0.16 for the SVR applied to PPG spectrogram the Brachial location and R2 = 0.99 and RMSE = 0.17 for the SVR applied to BP spectrogram the Brachial location. Nevertheless, it is important to recall that all three different feature types obtained one or more models with at least a R2 = 0.90 and RMSE = 0.64 for all the locations. These results show the feasibility of the different types of features to extract valuable information from a spectrogram created using noisy-free signals to estimate the cf-PWV values.
In contrast, for the noisy cases, the PPG spectrograms presented values of R2 = 0.92 and RMSE = 0.56 for the SNR = 65, R2 = 0.88 and RMSE = 0.70 for the SNR = 45, and R2 = 0.82 and RMSE = 0.84 for the SNR = 30, showing a decrease in the performance for the SNR = 45 and SNR = 30 cases of 0.04 and 0.10 for the R2, and 0.16 and 0.30 for the RMSE. It is important to notice that the SCSA features presented the best overall performance for the noisy cases, obtaining the best mean results value of R2 = 0.94 and RMSE = 0.50 for MLP and SVR models applied for the different noisy PPG spectrograms and R2 = 0.97 and RMSE = 0.34 for the SVR model applied for the noisy BP spectrograms, obtaining in a value of R2 = 0.90 or higher for each of the noise level presented in this study. However, it is important to notice that the SCSA method presented a high computational complexity to compute the features for each combination of signals (PPG or BP) and location (Radial, Digital, Brachial). For this reason, it is essential to develop future works to reduce this complexity to extend this method for real applications where time and computational cost play an essential role.
Previous studies have been using machine learning or deep learning models to estimate the cf-PWV based on PPG or BP signals (Tavallali et al., 2015; Jin et al., 2021; Li et al., 2022). However, a direct comparison between our work and many of the previous studies cannot be made given that these studies use real data for the estimation, in contrast with the in silico data used in this study. Nevertheless, the study made by (Jin et al., 2021) used the same Blood Pressure in silico signals from the Radial location for the noisy case. In this study, the authors proposed the use of an LSTM deep-learning model to estimate the cf-PWV, obtaining an R2 ≥ 0.98 and a RMSE ≤ 0.24. In contrast, the proposed method obtained a similar performance of an R2 ≥ 0.97 and a RMSE ≤ 0.38.
It is important to notice that the model parameters used for the generation of the in silico pulse wave signals were changed with age, allowing the investigation of the effects of aging in the estimation of cf-PWV. Previous studies had demonstrated that there could exist a decrease in the performance of the estimation for high PWV values associated with the sensitivity to variations in the transit time during the cf-PWV estimation (Li et al., 2022; Jin et al., 2021). This same behavior was noticed in this project for some of the models where there is an increase of the error estimation for higher values of cf-PWV (usually higher than 9
Even if these results are promising, it is crucial to consider the different limitations presented in this project. The principal limitation is the use of in silico data rather than real data collected from a specific human population. Nevertheless, the in silico data allows us to achieve an initial validation of the proposed methodology, whose results will permit us to proceed with the use of real data. Another limitation of this project is the spectrogram representation made using one-cycle signals; given that the real PPG and BP signals present multiple cycles, this will change the spectrogram image obtained, and this could make the features computed in this work may not work for the multi-cycle representation. However, future work will be done toward solving this limitation to validate the proposed methodology to obtain a more realistic analysis of the feasibility of the spectrogram to estimate the cf-PWV.
5 Conclusion
This paper investigates a new methodology to estimate cf-PWV based on the spectrogram representation obtained from BP or PPG signals taken from peripheral signals using machine learning models. The proposed approach incorporates three different types of features to probe the feasibility of the spectrogram to accurately estimate the cf-PWV. The results prove that the three different methods could obtain good performance, where the energy features showed the best performance for all the models without noise and the SCSA presented the best results against the noise levels proposed in the study. In the future, further validation of the proposed methodology in real human signals needs to be conducted to overcome the limitation of using in silico data with one cardiac cycle. This proposed method may be implemented for personal healthcare applications upon successful clinical validation. Also, it can open the door for future investigations of new machine learning methods and feature extraction techniques to improve the estimation of cf-PWV based on spectrogram representation. The code is available at https://github.com/EMANG-KAUST/Spectrogram_AS_Frontiers. We welcome developments to the existing code or contributions of new algorithms for inclusion in future versions of the arterial stiffness prediction platform.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
The authors confirm their contribution to the paper as follows: study conception and design: MB, JV, and T-ML-K; data collection: MB and JV; simulations and coding: JV, analysis and interpretation of results: MB, JV, and T-ML-K; draft manuscript preparation: MB, JV, and T-ML-K. All authors reviewed the results and approved the final version of the manuscript.
Funding
This work has been supported by the King Abdullah University of Science and Technology (KAUST), Saudi Arabia, Base Research Fund (BAS/1/1627- 01-01) to Taous Meriem Laleg and KAUST.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2023.1100570/full#supplementary-material
Footnotes
1https://peterhcharlton.github.io/pwdb/index.html
References
Allen, J., Liu, H., Iqbal, S., Zheng, D., and Stansby, G. (2021). Deep learning-based photoplethysmography classification for peripheral arterial disease detection: A proof-of-concept study. Physiol. Meas. 42, 054002. doi:10.1088/1361-6579/abf9f3
Bahloul, M. A., Chahid, A., and Laleg-Kirati, T.-M. (2021). “A multilayer perceptron-based carotid-to-femoral pulse wave velocity estimation using ppg signal,” in 2021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI IEEE), 1–6.
Ban, D., and Kwon, S. (2016). “Movement noise cancellation in ppg signals,” in 2016 IEEE International Conference on Consumer Electronics (ICCE) (Institute of Electrical and Electronics Engineers Inc), 47–48. doi:10.1109/ICCE.2016.7430517
Blacher, J., Guerin, A. P., Pannier, B., Marchais, S. J., Safar, M. E., and London, G. M. (1999). Impact of aortic stiffness on survival in end-stage renal disease. Circulation 99, 2434–2439. doi:10.1161/01.cir.99.18.2434
Chahid, A., Serrai, H., Achten, E., and Laleg-Kirati, T.-M. (2018). “A new roi-based performance evaluation method for image denoising using the squared eigenfunctions of the Schrödinger operator,” in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 5579–5582. doi:10.1109/EMBC.2018.8513615
Chahid, A., Serrai, H., Achten, E., and Laleg-Kirati, T.-M. (2017). “Adaptive method for mri enhancement using squared eigenfunctions of the Schrödinger operator,” in 2017 IEEE Biomedical Circuits and Systems Conference (BioCAS), 1–4. doi:10.1109/BIOCAS.2017.8325107
Charlton, P. H., Harana, J. M., Vennin, S., Li, Y., Chowienczyk, P. J., and Alastruey, J. (2019). Modeling arterial pulse waves in healthy aging: A database for in silico evaluation of hemodynamics and pulse wave indexes. Am. J. Physiology - Heart Circulatory Physiology 317, H1062–H1085. doi:10.1152/ajpheart.00218.2019
Choi, C. U., Park, E. B., Suh, S. Y., Kim, J. W., Kim, E. J., Rha, S.-W., et al. (2007). Impact of aortic stiffness on cardiovascular disease in patients with chest pain: Assessment with direct intra-arterial measurement. Am. J. Hypertens. 20, 1163–1169. doi:10.1016/j.amjhyper.2007.07.004
Dash, S., and Jena, U. R. (2017). Multi-resolution laws’ masks based texture classification. J. Appl. Res. Technol. 15, 571–582. doi:10.1016/j.jart.2017.07.005
Dennis, J., Tran, H. D., and Li, H. (2011). Spectrogram image feature for sound event classification in mismatched conditions. IEEE Signal Process. Lett. 18, 130–133. doi:10.1109/LSP.2010.2100380
Donida Labati, R., Piuri, V., Rundo, F., Scotti, F., and Spampinato, C. (2021). “Biometric recognition of ppg cardiac signals using transformed spectrogram images,” in Pattern recognition. ICPR international workshops and challenges. Editors A. Del Bimbo, R. Cucchiara, S. Sclaroff, G. M. Farinella, T. Mei, M. Bertiniet al. (Cham: Springer International Publishing), 244–257.
Elgeldawi, E., Sayed, A., Galal, A. R., and Zaki, A. M. (2021). Hyperparameter tuning for machine learning algorithms used for Arabic sentiment analysis. Informatics 8, 79. doi:10.3390/informatics8040079
Elsamnah, F., Bilgaiyan, A., Affiq, M., Shim, C. H., Ishidai, H., and Hattori, R. (2019). Comparative design study for power reduction in organic optoelectronic pulse meter sensor. Biosensors 9, 48. doi:10.3390/bios9020048
Esgalhado, F., Fernandes, B., Vassilenko, V., Batista, A., and Russo, S. (2021). The application of deep learning algorithms for ppg signal processing and classification. Computers 10, 158. doi:10.3390/computers10120158
Gade, J.-L., Thore, C.-J., Sonesson, B., and Stålhand, J. (2021). In vivo parameter identification in arteries considering multiple levels of smooth muscle activity. Biomechanics Model. Mechanobiol. 1–13, 1547–1559. doi:10.1007/s10237-021-01462-4
Garcia, J. M. V., Bahloul, M. A., and Laleg-Kirati, T.-M. (2022). “A multiple linear regression model for carotid-to-femoral pulse wave velocity estimation based on schrodinger spectrum characterization,” in 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 143–147. doi:10.1109/EMBC48229.2022.9871031
Grubbström, R. W., and Tang, O. (2006). The moments and central moments of a compound distribution. Eur. J. Operational Res. 170, 106–119. doi:10.1016/j.ejor.2004.06.012
Heinzel, G., Rüdiger, A., and Schilling, R. (2002). Spectrum and spectral density estimation by the discrete fourier transform (dft), including a comprehensive list of window functions and some new flat-top windows.
Jablonski, A., and Dziedziech, K. (2022). Intelligent spectrogram – A tool for analysis of complex non-stationary signals. Mech. Syst. Signal Process. 167, 108554. doi:10.1016/j.ymssp.2021.108554
Jin, W., Chowienczyk, P., and Alastruey, J. (2021). Estimating pulse wave velocity from the radial pressure wave using machine learning algorithms. Plos one 16, e0245026. doi:10.1371/journal.pone.0245026
Kaisserli, Z., and Laleg-Kirati, T.-M. (2014). Image representation and denoising using squared eigenfunctions of schrodinger operator. arXiv preprint arXiv:1409.3720.
Kim, H.-L., and Kim, S.-H. (2019). Pulse wave velocity in atherosclerosis. Front. Cardiovasc. Med. 6, 41. doi:10.3389/fcvm.2019.00041
Kularatna, N., Ross, A. S., Fernando, J., and James, S. (2019). Background to surge protection. Elsevier. doi:10.1016/b978-0-12-811664-7.00001-x
Kumar, V., and Minz, S. (2014). Feature selection: A literature review. SmartCR 4, 211–229. doi:10.6029/smartcr.2014.03.007
Kvyetnyy, R., Sofina, O., Olesenko, A., Komada, P., Sikora, J., Kalizhanova, A., et al. (2017). “Method of image texture segmentation using laws’ energy measures,” in In Photonics Applications in Astronomy, Communications, Industry, and High Energy Physics Experiments 2017 (SPIE), 10445, 1784–1792.
Laleg-Kirati, T. M., Crépeau, E., and Sorine, M. (2013). Semi-classical signal analysis. Math. Control, Signals, Syst. 25, 37–61. doi:10.1007/s00498-012-0091-1
Laleg-Kirati, T. M., Médigue, C., Papelier, Y., Cottin, F., and Van De Louw, A. (2010). Validation of a semi-classical signal analysis method for stroke volume variation assessment: A comparison with the PiCCO technique. Ann. Biomed. Eng. 38, 3618–3629. doi:10.1007/s10439-010-0118-z
Li, P., and Laleg-Kirati, T. M. (2021a). Central blood pressure estimation from distal PPG measurement using semiclassical signal analysis features. IEEE Access 9, 44963–44973. doi:10.1109/ACCESS.2021.3065576
Li, P., and Laleg-Kirati, T. (2021b). Signal denoising based on the Schrödinger operator’s eigenspectrum and a curvature constraint. IET Signal Process. 15, 195–206. doi:10.1049/sil2.12023
Li, P., Piliouras, E., Poghosyan, V., AlHameed, M., and Laleg-Kirati, T.-M. (2021). “Automatic detection of epileptiform eeg discharges based on the semi-classical signal analysis (scsa) method,” in 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC IEEE), 928–931.
Li, Y., Xu, Y., Ma, Z., Ye, Y., Gao, L., and Sun, Y. (2022). An xgboost-based model for assessment of aortic stiffness from wrist photoplethysmogram. Comput. Methods Programs Biomed. 226, 107128. doi:10.1016/j.cmpb.2022.107128
Mackenzie, I., Wilkinson, I., and Cockcroft, J. (2002). Assessment of arterial stiffness in clinical practice. QJM Int. J. Med. 95, 67–74. doi:10.1093/qjmed/95.2.67
Matsushita, K., Ding, N., Kim, E. D., Budoff, M., Chirinos, J. A., Fernhall, B., et al. (2019). Cardio-ankle vascular index and cardiovascular disease: Systematic review and meta-analysis of prospective and cross-sectional studies. J. Clin. Hypertens. 21, 16–24. doi:10.1111/jch.13425
Mattace-Raso, F. U., van der Cammen, T. J., Hofman, A., van Popele, N. M., Bos, M. L., Schalekamp, M. A., et al. (2006). Arterial stiffness and risk of coronary heart disease and stroke: The rotterdam study. Circulation 113, 657–663. doi:10.1161/CIRCULATIONAHA.105.555235
Mensah, G. A., Roth, G. A., and Fuster, V. (2019). The global burden of cardiovascular diseases and risk factors: 2020 and beyond.
Miroslav, G. L., and Rodojevic, R. (2007). Texture segmentation: Co-Occurrence matrix and laws’ texture masks methods.
Mulimani, M., and Koolagudi, S. G. (2018). “Acoustic event classification using spectrogram features,” in TENCON 2018 - 2018 IEEE Region 10 Conference, 1460–1464. doi:10.1109/TENCON.2018.8650444
Oh, M., and Kim, Y.-H. (2019). Statistical approach to spectrogram analysis for radio-frequency interference detection and mitigation in an l-band microwave radiometer. Sensors 19, 306. doi:10.3390/s19020306
Piliouras, E. (2020). Contributions to the semi-classical signal analysis method: The arterial stiffness assessment case study. doi:10.25781/KAUST-NVCZ0
Rachidi, M., Chappard, C., Marchadier, A., Gadois, C., Lespessailles, E., and Benhamou, C. (2008). “Application of laws’ masks to bone texture analysis: An innovative image analysis tool in osteoporosis,” in 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 1191–1194. doi:10.1109/ISBI.2008.4541215
Rajzer, M. W., Wojciechowska, W., Klocek, M., Palka, I., Brzozowska-Kiszka, M., and Kawecka-Jaszcz, K. (2008). Comparison of aortic pulse wave velocity measured by three techniques: Complior, sphygmocor and arteriograph. J. Hypertens. 26, 2001–2007. doi:10.1097/HJH.0b013e32830a4a25
Shokawa, T., Imazu, M., Yamamoto, H., Toyofuku, M., Tasaki, N., Okimoto, T., et al. (2005). Pulse wave velocity predicts cardiovascular mortality findings from the Hawaii-los angeles-hiroshima study. Circulation J. 69, 259–264. doi:10.1253/circj.69.259
Siam, A. I., Sedik, A., El-Shafai, W., Elazm, A. A., El-Bahnasawy, N. A., El Banby, G. M., et al. (2021). Biosignal classification for human identification based on convolutional neural networks. Int. J. Commun. Syst. 34. doi:10.1002/dac.4685
Stockman, G., and Shapiro, L. G. (2001). Computer vision. Upper Saddle River, NJ, USA: Prentice Hall PTR.
Tavallali, P., Hou, T. Y., Rinderknecht, D. G., and Pahlevan, N. M. (2015). On the convergence and accuracy of the cardiovascular intrinsic frequency method, 2. Royal Society open science, 150475.
Tavallali, P., Razavi, M., and Pahlevan, N. M. (2018). Artificial intelligence estimation of carotid-femoral pulse wave velocity using carotid waveform. Sci. miscs 8 (1). doi:10.1038/s41598-018-19457-0
Trethewey, M. W. (2000). Window and overlap processing effects on power estimates from spectra. Mech. Syst. Signal Process. 14, 267–278. doi:10.1006/mssp.1999.1274
van Sloten, T. T., Schram, M. T., van den Hurk, K., Dekker, J. M., Nijpels, G., Henry, R. M., et al. (2014). Local stiffness of the carotid and femoral artery is associated with incident cardiovascular events and all-cause mortality: The hoorn study. J. Am. Coll. Cardiol. 63, 1739–1747. doi:10.1016/j.jacc.2013.12.041
Wang, K.-C. (2014). The feature extraction based on texture image information for emotion sensing in speech. Sensors 14, 16692–16714. doi:10.3390/s140916692
Willum Hansen, T., Staessen, J. A., Torp-Pedersen, C., Rasmussen, S., Thijs, L., Ibsen, H., et al. (2006). Prognostic value of aortic pulse wave velocity as index of arterial stiffness in the general population. Circulation 113, 664–670. doi:10.1161/CIRCULATIONAHA.105.579342
Zhao, Z., Anand, R., and Wang, M. (2019). “Maximum relevance and minimum redundancy feature selection methods for a marketing machine learning platform,” in 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 442–452. doi:10.1109/DSAA.2019.00059
Keywords: pulse wave velocity, spectrogram, PPG, distal blood pressure, machine learning (ML), image processing, semi-classical signal analysis
Citation: Vargas JM, Bahloul MA and Laleg-Kirati T-M (2023) A learning-based image processing approach for pulse wave velocity estimation using spectrogram from peripheral pulse wave signals: An in silico study. Front. Physiol. 14:1100570. doi: 10.3389/fphys.2023.1100570
Received: 16 November 2022; Accepted: 31 January 2023;
Published: 03 March 2023.
Edited by:
Abdul Wahab, Nazarbayev University, KazakhstanReviewed by:
Haipeng Liu, Coventry University, United KingdomPranjali Gajbhiye, Nirvesh Enterprises Private Limited, India
Copyright © 2023 Vargas, Bahloul and Laleg-Kirati. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Taous-Meriem Laleg-Kirati, taousmeriem.laleg@kaust.edu.sa