- 1Department of Mechanical Engineering, University College London, London, United Kingdom
- 2Barts Heart Centre, Barts Health NHS Trust, London, United Kingdom
- 3Centre for Cardiovascular Medicine and Devices, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
- 4Medis Medical Imaging Systems BV, Leiden, Netherlands
- 5NIHR Barts Biomedical Research Centre, Queen Mary University of London, London, United Kingdom
- 6Satsuma Lab, Centre for Medical Image Computing, University College London, London, United Kingdom
- 7UCL Respiratory, University College London, London, United Kingdom
There is increasing evidence that coronary artery wall shear stress (WSS) measurement provides useful prognostic information that allows prediction of adverse cardiovascular events. Computational Fluid Dynamics (CFD) has been extensively used in research to measure vessel physiology and examine the role of the local haemodynamic forces on the evolution of atherosclerosis. Nonetheless, CFD modelling remains computationally expensive and time-consuming, making its direct use in clinical practice inconvenient. A number of studies have investigated the use of deep learning (DL) approaches for fast WSS prediction. However, in these reports, patient data were limited and most of them used synthetic data generation methods for developing the training set. In this paper, we implement 2 approaches for synthetic data generation and combine their output with real patient data in order to train a DL model with a U-net architecture for prediction of WSS in the coronary arteries. The model achieved 6.03% Normalised Mean Absolute Error (NMAE) with inference taking only 0.35 s; making this solution time-efficient and clinically relevant.
1 Introduction
Coronary Artery Disease (CAD) is the leading cause of death globally and is associated with approximately 9 million deaths worldwide (Khan et al., 2020). Local hemodynamic forces distribution and in particular wall shear stress (WSS) seems to play a pivotal role in the initiation of the atherosclerotic process and its evolution. Measurement of WSS can predict vulnerable plaques and adverse cardiovascular events (Stone et al., 2018). Blood vessels with large diameters and low flow are exposed to low WSS, whereas vessels with a small lumen and high flow are exposed to high WSS (Cecchi et al., 2011). As in vivo experimental measurement of WSS is impractical, computational “measurement” using computational fluid dynamics (CFD) has been broadly used to assess coronary physiology. Whilst the efficiency of CFD modelling has improved over the last decades, these models are still computationally expensive and time-consuming, limiting their direct use in clinical practice (Ferdian et al., 2022). To address this challenge, several studies have investigated the use of deep learning (DL) approaches for fast WSS prediction.
DL has commonly been used in the medical imaging domain for fast data analysis. An example in the field of flow modelling is in image-based estimation of boundary conditions for blood flow problems, assisted by DL (Arzani et al., 2022). DL has also been used for biomedical problems in various stages of numerical modelling; some of which include automatic generation and quality assessment of meshes (Zhang et al., 2020).
Moreover, DL has also been used to predict numerical simulation results. For instance, Gharleghi et al. (2022) applied a DL solution to predict time-varying WSS in the left main coronary bifurcation while Jordanski et al. (2018) used DL methods and in particular multivariate linear regression, multilayer perceptron and Gaussian conditional random fields in order to predict WSS distribution at the carotid bifurcation. Furthermore, Suk et al. (2022) utilised mesh convolutional neural networks in order to predict WSS in a 3D coronary artery with and without bifurcation using synthetically generated coronary arteries with stenosis.
These models are mostly inspired by fully convolutional networks (FCN) in combination with autoencoders (encoder-decoder models) that were initially developed for image segmentation (Ronneberger et al., 2015). The architectures that have become well known are U-net and V-net, with U-net being the most adopted (Shen et al., 2017). Similar to autoencoders, a U-net architecture consists of one part that contracts to capture global context, followed by a second part to expand and therefore enable localisation. In the study of Gharleghi the application of U-net for DL- WSS prediction et al. enabled WSS estimation of coronary left main stem bifurcation with a normalised mean absolute error (Gharleghi et al., 2020) of 10.38% (with 0.56% std.), based on 3,429 training data sets including patient-specific (127) and synthetic data (3,302).
However, DL-based prediction of WSS in stenosed coronary arteries using patient-specific geometries has not widely been studied, despite the fact that these are potentially highly clinically-relevant analyses. An earlier study used multi-layer perceptrons, multivariate linear regression, and convolutional neural networks to generate WSS values from 2,000 patient-based but idealized coronary artery geometry (Su et al., 2020). In this paper, for the first time, we apply a U-net-based DL prediction method to extract WSS in real patients’ stenosed coronary arteries. It is known that the training of a U-net, similar to other deep learning models, relies on a large dataset. As such, the implementation of data augmentation methods in advance can be used in order to learn effectively from very few annotated data samples. As patient data for training is limited in this study, we used simple methods to generate synthetic data for training and evaluate method’s efficacy.
2 Materials and methods
2.1 Patient data
2.1.1 Data source and patient characteristics
Stenosed coronary artery geometry and centreline data derived from X-ray angiograms and CFD simulation results were obtained from our previous study (Tufaro et al., 2021). Fifty vessels from the study cohort were randomly extracted for the present analysis and their geometries are displayed in Supplementary Figure S7. The original study included patients that had a coronary angiogram for clinical purposes between January 2012 and June 2017 from three cardiac centres in the United Kingdom: Barts Heart Centre (London), Essex Cardiothoracic Centre (Basildon) and Royal Free Hospital (London). The dataset consisted of patients who underwent a coronary angiography and had at least one intermediate atherosclerotic lesion with a fractional flow reserve (FFR) of 0.81–0.85. Exclusion criteria included an ambiguous culprit lesion in the context of an acute coronary syndrome presentation, lesions at the edge of a stent (
2.1.2 Computational fluid dynamics data
CFD simulations were conducted based on the 3D vessel geometries reconstructed from 3D QCA. The corresponding pressure and WSS were calculated over the lumen-wall interface of each patient, using the patient-specific inflow condition, itself estimated from the velocity of the contrast agent (derived from the length of the model, the time required for the contrast to fill the vessel, and the cine frame rate) and assuming steady state. Details of CFD modelling has been presented elsewhere (Tufaro et al., 2021). The anatomical and haemodynamic features of the vessels are summarised in Table 1.
The anatomical and CFD data were extracted over the lumen wall surface, resampled on a rectangular grid that has 36 data points circumferentially and segments at 1.5 mm increments longitudinally (i.e., 24–77 segments). This was conducted using a custom-made MATLAB code. The local anatomical feature map of each patient can be used as input in training and prediction of a model, whilst the CFD-derived variables (pressure and WSS) are the outputs. This paper focuses on the prediction of WSS only, because pressure drop prediction has not only been conducted more thoroughly using ML (Farajtabar et al., 2021; Fossan et al., 2021; Pajaziti et al., 2023), but can also be calculated reliably using reduced-order 0D models (Schrauwen et al., 2014).
We chose to use 7 input features from the original CFD data, as outlined in Table 2. Centreline-based features were mapped to the rectangular grid on the lumen border, i.e., points on the lumen border on one cross-section have the same centreline-based features. Going forward, we calculate additional morphological features and standardised the data format as described in the Feature Engineering section.
Table 2. Patient data features considered in DL model. Centreline-based features are mapped to corresponding locations on the lumen border.
2.1.3 Feature engineering
In addition to the features available as part of the original dataset, we calculated a variety of geometrical features that may improve the predictive power of the model. These features include the polar coordinates of the wall, centreline curvature, distance along the centreline, tangential and inner curvature vectors of the centreline. Each new feature is calculated as follows:
The final set of input includes the original 7 features with 10 engineered features, 17 features in total.
2.1.4 Data normalisation and imputation
As the cardiac anatomy varies between individuals, the various coronary arteries all of which have different contours and shapes were analysed together. As their Cartesian coordinates would not be expected to mapped onto each other, a common feature space had to be created. To make the data consistent, the proximal end of each vessel centreline was first shifted onto the origin (0, 0, 0). The vessels were then rotated to align the global centreline vector (i.e., the vector connecting the first and last point of the centreline) with the global Z-axis. This was applied onto its coordinates such that all geometries ultimately align in the same plane.
Furthermore, the number of slices for each patient ranged from 24 to 77. The training data needs to conform to a consistent shape for all patients. Accommodating the lowest common denominator would lead to a truncation and therefore loss of critical patient data. We therefore interpolate the patient data to the maximum number of slices (77 slices). Nonetheless, a U-net shaped architecture with pooling requires an even number for this dimension, thus we use 76 slices per patient. The increments at which the coronaries are sliced comprises in the range of 0.24–0.66 mm.
The structure of the data that will be used for the DL model becomes an array in the shape (
Figure 1. Data structure showing the data scanning direction: Starting point at
2.2 Synthetic data generation
Deep learning models typically need many samples for training. This is because a model will need to learn the weights for numerous parameters and uses gradient-based optimization to converge to a global optimum. When the model attempts to fit data patterns, it may learn random noise if a dataset is small. This causes what is known as overfitting and has as a consequence the inability of the model to generalise well. We have conducted a practical experiment to show that as the training data size increases, model loss decreases (refer to Supplementary Figure S9).The need for larger realistic datasets, and particularly those that are more private/secure (obfuscate real patient data), have driven studies into the generation of synthetic data. It is important that the synthetic data follows the underlying distribution of the real data and captures correlations between features in order to be plausible. As yet, there are no public datasets of arterial CFD that can be used for training. Since in our study we have only 50 patients in scope, it is essential to augment the training set by generating synthetic data.
In machine learning, and particularly in the case of images, there are many simple approaches that can be utilised to create a larger sample. These methods are referred to as data augmentation and involve transforming an image via shifting (horizontally or vertically), flipping (horizontally or vertically), rotating (clockwise or anti-clockwise), zooming in or out, and changing the brightness (Chlap et al., 2021). These methods can create more variation in the data and artificially expand the dataset.
The use of flipping as an approach of data augmentation is commonly seen in medical image processing (Cossio, 2023; Perez and Wang, 2017). Nishio et al. applied random 10° rotations, translations, and horizontal flipping to chest X-ray radiography (CXR) images in order to improve the accuracy of their CNN model for COVID-19 diagnosis (Nishio et al., 2020). Schmid et al. (2023) propose the use of statistical shape and intensity models (SSIM) to generate augmented CT images of hip bones including flipping, shifting and rotation. Although the reversal of inlet and outlet may not be physiologically representative, we examined the ratio of inlet and outlet radii, which is 0.86
Inspired by these approaches, we modified the geometry of the coronary artery and running the CFD analysis in ANSYS. In that way, we were able to create realistic synthetic data by making 3 modifications to the original patient data. After implementing the following modifications listed, 550 synthetic patients were created that were used for training.
2.3 Deep learning model and training
In order to train the model, we split the synthetic dataset generated. Of the total 600 patient dataset (50 real + 550 synthetic), 80% was used for training (40 real + 440 synthetic generated from those) and 20% was used as a test set (10 real patients). The validation set, still used as part of the training process for hyperparameter tuning, is automatically created once the training begins and comprises 20% of the training set (96 real/synthetic data mixture). It is important to note that the train/test split is completed at the real patient level, such that only synthetic data associated with the 40 real patients (440 patients) can be used for training. This is done in order to prevent leakage of information from the training set into the test set. In other words, the model has not seen any information similar to the test patients, not even their synthetic data. Once all data creation and pre-processing was complete, the neural network architecture was designed.
We adopted and modified the neural network architecture used in Gharleghi et al. (2020) by removing the concatenation with global features such as bifurcation angle, which is not relevant for our use case. We also added a spatial dropout layer in order to reduce overfitting of the model. The final neural network can be categorised as a U-Net and had an architecture shown in Figure 2. This network uses average pooling to scale the input data down to half of its resolution. This is done twice, leading to a quarter representation of the original dataset. The data is then passed through 2 convolutional layers where a 3 × 3 convolution was applied, followed by up-sampling and concatenation with the higher resolution data. Adam optimizer was then used in addition to a 10% spatial dropout as a regularization technique. The activation function selected was ReLU such that the output is constrained to a positive value.
As the loss function, we selected the mean squared error (MSE) although other similar studies in the literature, including Gharleghi et al. (2020) and Jordanski et al. (2018), typically adopt the mean absolute error (MAE). The MSE was calculated as per Eq. 3, where
In the case of our patient dataset, we are investigating patients with stenosis where WSS is significantly elevated. Our choice of MSE is to better predict the peak WSS, as it incurs a higher penalty when mismatched. To highlight the importance of this, we also trained the model using MAE for comparison. The model was trained for 1,000 epochs. The computational resource utilised was an NVIDIA T4 Tensore Core GPU with 52GB RAM. Training time is approximately 40 min on average. Increasing infrastructure resources can lead to faster model training and inference.
2.4 Model evaluation and investigation
The deep learning model was assessed for WSS prediction accuracy. We implement k-fold cross-validation with 5 folds, where the DL-based prediction of WSS for the 10 patient data left aside in Section 2.3 are compared against their original CFD-based WSS, over 5 repeated training
DL models are typically considered as a black box models, and the associations between particular inputs and outputs have not been formally investigated. However, as our work also aims to understand the mechanistic insight behind WSS profiles, explainability (xAI) techniques were used to quantitatively assess feature importance on a trained model. Many techniques exist that have been used in DL models such as Shapley values (Lundberg et al., 2017), however as this approach is computationally expensive, and the model internals are available, we opted for the integrated gradients approach.
Integrated Gradients is a common technique for explaining differentiable models such as neural networks. It is based on two key properties: implementation invariance and sensitivity. It works by creating a straight path between a reference point (typically zeros) and the inputs to the model. By partitioning and interpolating the path, the model can compute predictions at the different partition points. The gradient information at the points of this path are calculated, making it computationally efficient (Holzinger et al., 2022). The intuition is that the gradient in the points where a model’s predictions have flattened out are zero and therefore do not contain information that contributes to the explanation. With this process, the significance of each input feature in the prediction of output (i.e., WSS) can be estimated. In order to apply this method on our deep learning model, we utilise the Innvestigate library1 (Alber et al., 2018).
3 Results
3.1 Overall model assessment
The results of the MSE on the test set are shown in Table 3. The mean absolute error (MAE) is also displayed to show how different the absolute value of the prediction is from the gold-standard, CFD-based WSS for the same patients. In the literature, the normalised mean absolute error (NMAE) is typically reported, thus we include this metric in order to provide comparison with the other state-of-the-art methods. The NMAE can be calculated by either dividing the MAE by the mean or by the difference between the maximum and minimum WSS. For this paper, we divided by max-min, where the difference in WSS is 33.33 Pa.
Table 3. Global model error evaluated for 10 test patients. The two groups of error values are based on different loss functions (MSE and MAE) used in training, and the error is also assessed based on both MSE and MAE. Summary statistics are provided for training patients for comparison (
As previously mentioned, the model training with our data took on average 40 min. This means that it took less than 2.5 s per epoch; not impractically long in the current form, yet it can be accelerated even further in the future with additional infrastructure resources. Model training results are presented in the Supplementary Table S4. Prediction time was approximately 35 milliseconds, which is many orders of magnitude faster than the CFD processing time, which ranges from 20 min to approximately 3 h in our cases. Furthermore, the model was able to predict the WSS on the test set to a NMAE of 6.03% with a standard deviation of 0.47% when normalised by the difference between the maximum and the minimum WSS.
In order to show correlation between the CFD-based WSS and the DL model prediction, a Bland-Altman plot was generated for the test set and shown in Figure 3. Here, the minimum, maximum, and mean predominant WSS in every 3-mm vascular segments were used. Briefly, predominant WSS is defined as a moving average of WSS within a window around a point of interest, which has a size of
Figure 3. Bland-Altman plot for min (left), mean (centre) and max (right) predominant WSS in 3-mm segments along the vessels.
3.2 Patient-level WSS prediction
By scanning along the slices of data in the direction depicted by Figure 1, we can plot the WSS profile for a particular patient. Figure 4 illustrates the WSS profile for a patient in more detail, the DL-predicted WSS values are plotted over the wall surface from proximal (left) to distal (right) and across the circumference of each cross-section.
Figure 4. WSS profile for one patient with predictions from MAE (red) vs. MSE (green) trained models.
The plotted data display the output of the model trained with MAE as the loss (red) and the MSE as the loss (green), along with the ground truth CFD-based WSS that are displayed in blue. It is clear that the model trained by MSE predicts WSS that fits better to the ground truth. Although the MAE result had a low overall error, it did not capture the local variations of WSS as much as the MSE-based training, especially in the areas of stenosis. This justifies the choice of MSE loss for model training.
Among the 10 test cases, we performed a qualitative assessment using 3D WSS surface plot in order to illustrate in more detail the patient-level WSS prediction results. Two of those vessels with the most accurate and inaccurate WSS predictions are presented in Figure 5, and similar plots for the full 10 test cases are presented in the Supplementary Figure S10. It is apparent that in both cases the WSS patterns are generally captured by the DL model, but there are differences after looking the data in detail, e.g., in the distal region of Patient 7 where the vessel is more tortuous. This trend is observed in the other models shown in the Supplementary Material. Patient 7 appears to be the most curved/tortuous and having multiple stenosis sites, which could have given the difficulty in the prediction of WSS by the DL model. Additionally, we observed that the prediction by the DL model underestimates WSS in the stenosis region and tends to overestimate otherwise.
Figure 5. Example 3D maps of predicted WSS, in comparison with original CFD-based WSS: Patient 3 with small prediction error (left) and Patient 7 with larger error (right). WSS maps of all 10 patients are shown in Supplementary Figure S10.
3.3 Feature importance
Figure 6 shows the value of the 17 features based on the integrated gradients estimations. The integrated gradients are calculated per each data point on the vascular wall surface and give an evaluation on how the input features contribute to the prediction. In order to conduct an overall assessment of the model, the samples need to be aggregated; the mean feature importance across all points along a vessel surface, and the mean across all test patients are summed to have the final results. Our findings indicate that the radius and inlet velocity are the most important features while the circumferential coordinate
Figure 6. Feature importance for deep learning model inputs, aggregated over 10 test patients’ data. Positive and negative scores represent positive and negative correlations, similarly to statistical correlation coefficients.
The top six important features were then selected (velocity, radius, theta, curvature, tangential vector in Y direction and normal vector in Z direction) and the model was retrained with MSE as the loss function. This resulted in an MSE of 8.89
4 Discussion
In this study, we conducted DL-based WSS prediction of stenosed arteries based on CFD-based WSS calculations of 50 patients’ and 550 synthetic vessels. The results showed, that despite the relatively limited number of training data, DL-based WSS prediction is feasible in stenosed patient-specific geometries. WSS prediction of diseased coronary arteries is a challenging problem as the range of WSS that needs to be predicted is much larger than that of a vessel without a stenosis including bifurcations. Dolan et al. (2012) reported that the WSS of the arterial system ranges from 1 to 7 Pa while in straight arteries, the time-averaged WSS physiological range is between 1.5 and 2.5 Pa. In the areas of bifurcation, WSS is raised because of flow impingement on bifurcation carina and range from 11 to 34 Pa (Lindekleiv et al., 2010). However, in the areas of stenosis this can even be
4.1 Synthetic data
As mentioned earlier, training of DL models tend to require large data sets. For example, Cho et al. (2016) investigated the impact of increasing the number of training samples of axial Computed Tomography (CT) images for classification into six anatomical classes. The results showed that an increase in the number of samples from 5 to 200 improved model accuracy from 8% to 95% (Cho et al., 2016). Similar findings have been also reported in other studies that highlighted the relationship between a larger data set and improved model performance, not only for machine learning but especially for deep learning (Sarker, 2021; Halevy et al., 2009; Zhang et al., 2019).
Gharleghi et al. (2020) generate a synthetic dataset out of 127 patients by modifying the bifurcation angle geometry and, as a result, obtain 3,302 synthetic patients. In our paper, to augment our dataset, we took a simple approach by flipping the geometry in combination with changing the input velocity. This results in 550 synthetic patients that were then used to train a DL model that can predict WSS with an accuracy that is similar to previous reports. An advantage of the proposed method is that it does not require creation of vessel geometry and computational mesh generation. These are time-consuming processes requiring up to 3 h to estimate the WSS patterns of a single new synthetic “model.”
The training dataset size in this study, is deemed effective in terms of the trade-off between accuracy and training time, based on our preliminary investigation using idealised curved tubes with a stenosis (Supplementary Material). However, it is likely that a larger training datasets would provide more accurate estimation of WSS patterns as real coronary anatomy varies between patients and vessels and this critically determines WSS. To mitigate this, acquisition of real patient data is ideal, but considering the largest number of patients reported in a CFD analysis is approximately 500 (Stone et al., 2012), synthetic data will still be necessary. Additionally, patients’ data are often associated with confidentiality concerns, which may prevent the dataset from being shared with the research community. Thus, future studies should try to combine the inclusion of larger clinical datasets and the creation of synthetic data from these data.
A common method for synthetic data generation of geometries in biomedical engineering is statistical shape modelling (SSM). Three-dimensional (3D) SSM facilitates the study of shape variability and allow the creation of new geometries with a wide range of variability (Alemany et al., 2019). Such statistically varied models offer an opportunity for experiments by exploring how changes to the shape geometry impact other factors (Sarker, 2021), which has been utilised in combination with CFD (Lamecker and Zachow, 2016). For example, Bruse et al. (2017) show that SSMs can aid to improve cardiac device development by modelling hemodynamic and geometric boundary conditions in cardiac anatomy. These models have been shown to improve both the efficiency robustness and value of synthetic patient data (King et al., 2019).
4.2 Model results
As previously noted, the MSE was used as a loss function as it facilitates a better fitting of the model prediction to WSS peaks due to the larger penalty in these areas. This is ideal for our vessels as there are sharp increases of WSS in the areas of a tight stenosis, reaching up to 70 Pa, whereas normal WSS range is reported to be 1–7 Pa (Malek, 1999). The WSS prediction error from the model trained on MAE turned out to be 2.10
Bland-Altman plots were generated for the minimum, maximum and mean predominant WSS to assess the difference between the CFD-based and DL-predicted estimations (refer to Figure 2). The limits of agreement range from −3.59 to 3.37 Pa. These ranges may have implications when WSS is low, but in the case of prediction of WSS in stenotic vessels where WSS in our dataset reached over 70 Pa, this result shows that the DL performed well and may have a value for real time computation of the WSS. The heterogeneous distribution of the WSS data in the test set underscores the challenge of the prediction task. Of note at the lesion site, which is the most clinically relevant segment, the range of difference between DL and reference standard was −4.04–3.96 Pa and this is comparable to that reported by Tufaro et al. (2022) (−4.1–5.7 Pa) who compared the estimations of two CFD-based approaches (one conventional CFD analysis performed using the ANSYS software a CFD analysis performed by a dedicated software CAAS Workstation WWS) in models reconstructed by two different software (Medis vs. Pie Medical).
An example of patient-level WSS prediction in Figure 4 shows that the error values are comparable for both the MAE trained and MSE trained models (2.85 Pa vs. 2.55 Pa respectively) however the WSS profile of the MSE approached better the reference standard in areas of stenosis. Nonetheless, in order to identify the areas that may be more difficult for prediction, a difference 3D plot is generated in addition to 3D visualization for DL-predicted and CFD-based coronaries (refer to Figure 5). Although the DL prediction is able to capture the overall WSS distribution, it can be seen that in diseased segments, the prediction is typically lower than the CFD-based results and the error is higher. Nevertheless, our prediction accuracy is similar to other existing models that range from 1.6% to 10.1% NMAE (normalised by the maximum) (Su et al., 2020; Suk et al., 2022), despite the fact that we included distinct stenoses.
4.3 Feature importance
Explainability is one of the current key advancements in machine learning models, moving forward from the use of black-box models (Belle and Papantonis, 2020). It is important to explain why a model has made a particular prediction before applying this in clinical practice. In typical supervised learning, e.g., linear regression models, model coefficients associating the input features to prediction output can be used to explain the importance of a specific input feature. However, this is more challenging with DL approaches, as they are typically black-boxes. In this paper, we introduced engineered features based on domain knowledge in order to improve model prediction. Feature engineering plays a key role in model prediction (Heaton, 2016). Previous works have demonstrated a relationship between key features and WSS, such as curvature and velocity (van Oijen, 2005).
Our results indeed indicate that velocity carries significant importance, following the vessel radius. This makes a mechanistic sense since WSS on a tube wall is theoretically determined by the flow rate and radius assuming Newtonian fluid and parabolic velocity profile
4.4 Limitations
This study was an initial attempt to predict WSS in patient-specific stenotic vessel geometries, hence it has limitations. First, the number of patient-specific vessels was limited to 50, of which only 40 were used for training. This also limited the range of stenosis degree and inflow velocity as presented in Table 1. Although the results indicated comparable predictive capability to the literature, this can be improved by taking advantage of the full set of patients
Future work is also needed to improve model performance and optimise DL architecture. The U-net approach used in this study is a well established approach, but it has been developed in 2015 (Ronneberger et al., 2015) and since then several other DL approaches have been introduced in cardiovascular research such as physics-informed neural networks (PINNs) (Raissi et al., 2019) and graph-neural networks (GNNs) (Scarselli et al., 2009). These DL architectures may enhance model performance.
Improvements to data augmentation in terms of quality and quantity can also be introduced. For instance, additional features may be engineered such as torsion in order to enrich the data set and, as a result, the data quality. Moreover, a finer level of granularity such as more slices per patient may be beneficial for achieving more realistic results, in addition to generating a larger cohort of synthetic data. Finally, a further deep dive into explainability should be considered, in particular in the areas of stenosis compared to the rest of the artery.
5 Conclusion
This paper demonstrated for the first time that DL-based prediction of WSS is feasible and has overall high performance that is comparable with previously-reported studies based on idealised stenotic vessels. The model used for prediction is inspired by a U-net architecture and achieves state-of-the-art performance at 6.03% NMAE. Training time is under 2.5 s per epoch and inference is at the order of milliseconds, making this a fast solution and an attractive alterative to current CFD analysis. Furthermore, we demonstrate the impact of utilising the MSE rather than the MAE as a loss function for training. Finally, model performance is explained via ranked feature importance calculated using the integrated gradients method. Although the model currently provides inaccurate predictions for some patients and may not yet be applicable for clinical application, it appears that it has the potential to replace the CFD-based WSS computation in clinical practice, as it is computationally inexpensive and able to operate in real time.
Data availability statement
The datasets for this article are not publicly available due to concerns regarding participant/patient anonymity. Requests to access the datasets should be directed to the corresponding author.
Ethics statement
All patient identifiable fields were removed prior analysis. Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and institutional requirements.
Author contributions
SA: Formal Analysis, Investigation, Methodology, Software, Writing–original draft, Writing–review and editing. VT: Data Curation, Investigation, Writing–review and editing. MT: Data Curation, Writing–review and editing. PK: Software, Writing–review and editing. AM: Resources, Writing–review and editing. AB: Resources, Writing–review and editing. JJ: Supervision, Writing–review and editing. CB: Conceptualization, Resources, Writing–review and editing. RT: Conceptualization, Methodology, Supervision, Resources, Writing–original draft, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was funded in part by the Wellcome Trust (209553/Z/17/Z). For the purpose of open access, the author has applied a CC-BY public copyright licence to any author accepted manuscript version arising from this submission.
Conflict of interest
Author PK was employed by company Medis Medical Imaging Systems BV.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2024.1360330/full#supplementary-material
Footnotes
1https://github.com/albermax/innvestigate
References
Alber, M., Lapuschkin, S., Seegerer, P., Hägele, M., Schütt, K. T., Montavon, G., et al. (2018). Innvestigate neural networks!. J. Mach. Learn. Res. 20, 1–8.
Alemany, S., Uriel, J., Ballester, A., and Parrilla, E. (2019). “Chapter 32 - three-dimensional body shape modeling and posturography,” in DHM and posturography. Editors S. Scataglini, and G. Paul (Cambridge, MA: Academic Press), 441–457. doi:10.1016/B978-0-12-816713-7.00032-5
Arzani, A., Wang, J., Sacks, M., and Shadden, S. (2022). Machine learning for cardiovascular biomechanics modeling: challenges and beyond. Ann. Biomed. Eng. 50, 615–627. doi:10.1007/s10439-022-02967-4
Belle, V., and Papantonis, I. (2020). Principles and practice of explainable machine learning. Front. Big. Data 4, 688969. doi:10.3389/fdata.2021.688969
Bruse, J. L., Giusti, G., Baker, C., Cervi, E., Hsia, T. Y., Taylor, A. M., et al. (2017). Statistical shape modeling for cavopulmonary assist device development: variability of vascular graft geometry and implications for hemodynamics. J. Med. Dev. 11, 021011. doi:10.1115/1.4035865
Cecchi, E., Giglioli, C., Valente, S., Lazzeri, C., Gensini, G. F., Abbate, R., et al. (2011). Role of hemodynamic shear stress in cardiovascular disease. Atherosclerosis 214, 249–256. doi:10.1016/j.atherosclerosis.2010.09.008
Chlap, P., Min, H., Vandenberg, N., Dowling, J., Holloway, L., and Haworth, A. (2021). A review of medical image data augmentation techniques for deep learning applications. J. Med. Imaging Radiat. Oncol. 65, 545–563. doi:10.1111/1754-9485.13261
Cho, J., Lee, K., Shin, E., Choy, G., and Do, S. (2016) How much data is needed to train a medical image deep learning system to achieve necessary high accuracy?. arXiv.
Cossio, M. (2023). Augmenting medical imaging: a comprehensive catalogue of 65 techniques for enhanced data analysis. arXiv.
Dolan, J. M., Kolega, J., and Meng, H. (2012). High wall shear stress and spatial gradients in vascular pathology: a review. Ann. Biomed. Eng. 41, 1411–1427. doi:10.1007/s10439-012-0695-0
Farajtabar, M., Biglarian, M., and Miansari, M. (2021) Machine-learning identification of hemodynamics in coronary arteries in the presence of stenosis. arXiv.
Ferdian, E., Dubowitz, D. J., Mauger, C. A., Wang, A., and Young, A. A. (2022). Wssnet: aortic wall shear stress estimation using deep learning on 4d flow MRI. Front. Cardiovasc. Med. 8, 769927. doi:10.3389/fcvm.2021.769927
Fossan, F. E., Müller, L. O., Sturdy, J., Bråten, A. T., Jørgensen, A., Wiseth, R., et al. (2021). Machine learning augmented reduced-order models for ffr-prediction. Comput. Methods Appl. Mech. Eng. 384, 113892. doi:10.1016/j.cma.2021.113892
Gharleghi, R., Samarasinghe, G., Sowmya, A., and Beier, S. (2020). “Deep learning for time averaged wall shear stress prediction in left main coronary bifurcations,” in 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), Lowa City, IA, April 3–7, 2020, 1–4.
Gharleghi, R., Sowmya, A., and Beier, S. (2022). Transient wall shear stress estimation in coronary bifurcations using convolutional neural networks. Comput. Methods Programs Biomed. 225, 107013. doi:10.1016/j.cmpb.2022.107013
Halevy, A., Norvig, P., and Pereira, F. (2009). The unreasonable effectiveness of data. IEEE Intell. Syst. 24, 8–12. doi:10.1109/mis.2009.36
Heaton, J. (2016). An empirical analysis of feature engineering for predictive modeling. SoutheastCon 2016, 1–6.
Holzinger, A., Saranti, A., Molnar, C., Biecek, P., and Samek, W. (2022). Explainable ai methods - a brief overview. Cham, Switzerland: Springer International Publishing, 13–38. doi:10.1007/978-3-031-04083-2_2
Hoogendoorn, A., Kok, A. M., Hartman, E. M. J., de Nisco, G., Casadonte, L., Chiastra, C., et al. (2019). Multidirectional wall shear stress promotes advanced coronary plaque development: comparing five shear stress metrics. Cardiovasc. Res. 116, 1136–1146. doi:10.1093/cvr/cvz212
Jordanski, M., Radovic, M., Milosevic, Z., Filipovic, N., and Obradovic, Z. (2018). Machine learning approach for predicting wall shear distribution for abdominal aortic aneurysm and carotid bifurcation models. IEEE J. Biomed. Health Inf. 22, 537–544. doi:10.1109/JBHI.2016.2639818
Khan, M., Hashim, M., Mustafa, H., Baniyas, M., Suwaidi, S., AlKatheeri, R., et al. (2020). Global epidemiology of ischemic heart disease: results from the global burden of disease study. Cureus 12, e9349. doi:10.7759/cureus.9349
King, A. P., and Eckersley, R. J. (2019). “Chapter 10 - statistical shape models,” in Statistics for biomedical engineers and scientists. Editors A. P. King, and R. J. Eckersley (Cambridge, MA: Academic Press), 217–228. doi:10.1016/B978-0-08-102939-8.00019-0
Kok, A. M., Molony, D., Timmins, L. H., Ko, Y. A., Boersma, E., Eshtehardi, P., et al. (2019). The influence of multidirectional shear stress on plaque progression and composition changes in human coronary arteries. EuroIntervention 15, 692–699. doi:10.4244/EIJ-D-18-00529
Lamecker, H., and Zachow, S. (2016). Statistical shape modeling of musculoskeletal structures and its applications. Cham, Switzerland: Springer International Publishing, 1–23. doi:10.1007/978-3-319-23482-3_1
Lindekleiv, H. M., Valen-Sendstad, K., Morgan, M. K., Mardal, K. A., Faulder, K., Magnus, J. H., et al. (2010). Sex differences in intracranial arterial bifurcations. Gend. Med. 7, 149–155. doi:10.1016/j.genm.2010.03.003
Lundberg, S. M., and Lee, S. I. (2017). “A unified approach to interpreting model predictions,” in Advances in neural information processing systems Volume 30. Editors I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, and S. Vishwanathan (Glasgow, Scotland: Curran Associates, Inc).
Malek, A. (1999). Hemodynamic shear stress and its role in atherosclerosis. JAMA 282, 2035. doi:10.1001/jama.282.21.2035
Marcus, J. T., Smeenk, H. G., Kuijer, J. P. A., van der Geest, R. J., Heethaar, R. M., and van Rossum, A. (1999). Flow profiles in the left anterior descending and the right coronary artery assessed by mr velocity quantification: effects of through-plane and in-plane motion of the heart. J. Comput. assisted Tomogr. 23 (4), 567–576. doi:10.1097/00004728-199907000-00017
Morbiducci, U., Mazzi, V., Domanin, M., De Nisco, G., Vergara, C., Steinman, D., et al. (2020). Wall shear stress topological skeleton independently predicts long-term restenosis after carotid bifurcation endarterectomy. Ann. Biomed. Eng. 48, 2936–2949. doi:10.1007/s10439-020-02607-9
Nishio, M., Noguchi, S., Matsuo, H., and Murakami, T. (2020). Automatic classification between covid-19 pneumonia, non-covid-19 pneumonia, and the healthy on chest x-ray image: combination of data augmentation methods. Sci. Rep. 10, 17532. doi:10.1038/s41598-020-74539-2
Pajaziti, E., Montalt-Tordera, J., Capelli, C., Sivera, R., Sauvage, E., Quail, M., et al. (2023). Shape-driven deep neural networks for fast acquisition of aortic 3d pressure and velocity flow fields. PLOS Comput. Biol. 19, e1011055. doi:10.1371/journal.pcbi.1011055
Perez, L., and Wang, J. (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv.
Raissi, M., Perdikaris, P., and Karniadakis, G. (2019). Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707. doi:10.1016/j.jcp.2018.10.045
Ronneberger, O., Fischer, P., and Brox, T. (2015). “U-net: convolutional networks for biomedical image segmentation,” in Medical image computing and computer-assisted intervention – miccai 2015. Miccai 2015. Cham, Switzerland: Springer. doi:10.48550/ARXIV.1505.04597
Sarker, I. (2021). Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput. Sci. 2 (6), 420. doi:10.1007/s42979-021-00815-1
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M., and Monfardini, G. (2009). The graph neural network model. IEEE Trans. Neural Netw. 20, 61–80. doi:10.1109/TNN.2008.2005605
Schmid, J., Assassi, L., and Chênes, C. (2023). A novel image augmentation based on statistical shape and intensity models: application to the segmentation of hip bones from ct images. Eur. Radiol. Exp. 7, 39. doi:10.1186/s41747-023-00357-6
Schrauwen, J., Wentzel, J., van der Steen, A., and Gijsen, F. (2014). Geometry-based pressure drop prediction in mildly diseased human coronary arteries. J. Biomech. 47, 1810–1815. doi:10.1016/j.jbiomech.2014.03.028
Shen, D., Wu, G., and Suk, H. I. (2017). Deep learning in medical image analysis. Annu. Rev. Biomed. Eng. 19, 221–248. doi:10.1146/annurev-bioeng-071516-044442
Stone, P., Saito, S., Takahashi, S., Makita, Y., Nakamura, S., Kawasaki, T., et al. (2012). Prediction of progression of coronary artery disease and clinical outcomes using vascular profiling of endothelial shear stress and arterial plaque characteristics the prediction study. Circulation 126, 172–181. doi:10.1161/CIRCULATIONAHA.112.096438
Stone, P. H., Maehara, A., Coskun, A. U., Maynard, C. C., Zaromytidou, M., Siasos, G., et al. (2018). Role of low endothelial shear stress and plaque characteristics in the prediction of nonculprit major adverse cardiac events. JACC Cardiovasc. Imaging 11, 462–471. doi:10.1016/j.jcmg.2017.01.031
Su, B., Zhang, J. M., Zou, H., Ghista, D., Le, T. T., and Chin, C. (2020). Generating wall shear stress for coronary artery in real-time using neural networks: feasibility and initial results based on idealized models. Comput. Biol. Med. 126, 104038. doi:10.1016/j.compbiomed.2020.104038
Suk, J., de Haan, P., Lippe, P., Brune, C., and Wolterink, J. M. (2022). Mesh convolutional neural networks for wall shear stress estimation in 3d artery models. Berlin, Germany: Springer International Publishing, 93–102. doi:10.1007/978-3-030-93722-5_11
Teng, Z., Canton, G., Yuan, C., Ferguson, M., Yang, C., Huang, X., et al. (2010). 3d critical plaque wall stress is a better predictor of carotid plaque rupture sites than flow shear stress: an in vivo mri-based 3d fsi study. J. Biomech. Eng. 132 (3), 031007. doi:10.1115/1.4001028
Torii, R., Wood, N. B., Hadjiloizou, N., Dowsey, A. W., Wright, A. R., Hughes, A. D., et al. (2009). Stress phase angle depicts differences in coronary artery hemodynamics due to changes in flow and geometry after percutaneous coronary intervention. Am. J. Physiology-Heart Circul. Phys. 296, H765–H776. doi:10.1152/ajpheart.01166.2007
Tufaro, V., Safi, H., Torii, R., Koo, B. K., Kitslaar, P., Ramasamy, A., et al. (2021). Wall shear stress estimated by 3d-qca can predict cardiovascular events in lesions with borderline negative fractional flow reserve. Atherosclerosis 322, 24–30. doi:10.1016/j.atherosclerosis.2021.02.018
Tufaro, V., Torii, R., Erdogan, E., Kitslaar, P., Koo, B. K., Rakhit, R., et al. (2022). An automated software for real-time quantification of wall shear stress distribution in quantitative coronary angiography data. Int. J. Cardiol. 357, 14–19. doi:10.1016/j.ijcard.2022.03.022
Zafar, H., Sharif, F., and Leahy, M. J. (2014). Measurement of the blood flow rate and velocity in coronary artery stenosis using intracoronary frequency domain optical coherence tomography: validation against fractional flow reserve. IJC Heart & Vasc. 5, 68–71. doi:10.1016/j.ijcha.2014.10.004
Zhang, L., Wang, X., Yang, D., Sanford, T., Harmon, S., Turkbey, B., et al. (2019) When unseen domain generalization is unnecessary? Rethinking data augmentation. arXiv.
Keywords: deep learning, coronary artery, stenosis, computational fluid dynamics, synthetic data
Citation: Alamir SH, Tufaro V, Trilli M, Kitslaar P, Mathur A, Baumbach A, Jacob J, Bourantas CV and Torii R (2024) Rapid prediction of wall shear stress in stenosed coronary arteries based on deep learning. Front. Bioeng. Biotechnol. 12:1360330. doi: 10.3389/fbioe.2024.1360330
Received: 22 December 2023; Accepted: 12 July 2024;
Published: 12 August 2024.
Edited by:
Nima Maftoon, University of Waterloo, CanadaReviewed by:
Juhyun Lee, University of Texas at Arlington, United StatesAmirhossein Arzani, The University of Utah, United States
Copyright © 2024 Alamir, Tufaro, Trilli, Kitslaar, Mathur, Baumbach, Jacob, Bourantas and Torii. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ryo Torii, ci50b3JpaUB1Y2wuYWMudWs=