Carotid atherosclerotic plaque segmentation in multi-weighted MRI using a two-stage neural network: advantages of training with high-resolution imaging and histology

Li, Ran; Zheng, Jie; Zayed, Mohamed A.; Saffitz, Jeffrey E.; Woodard, Pamela K.; Jha, Abhinav K.

doi:10.3389/fcvm.2023.1127653

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 24 May 2023

Sec. Cardiovascular Imaging

Volume 10 - 2023 | https://doi.org/10.3389/fcvm.2023.1127653

This article is part of the Research TopicIntravascular and Non-invasive Imaging of Inflammatory Conditions in AtherosclerosisView all 7 articles

Carotid atherosclerotic plaque segmentation in multi-weighted MRI using a two-stage neural network: advantages of training with high-resolution imaging and histology

Ran Li^1,2

Jie Zheng^1,2

Mohamed A. Zayed³

Jeffrey E. Saffitz⁴

Pamela K. Woodard^1,2*

Abhinav K. Jha^1,2*

¹Mallinckrodt Institute of Radiology, Washington University in St. Louis, St. Louis, MO, United States
²Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, United States
³Department of Surgery, Washington University School of Medicine in St. Louis, St. Louis, MO, United States
⁴Department of Pathology, Beth Israel Deaconess Medical Center, Boston, MA, United States

Introduction: A reliable and automated method to segment and classify carotid artery atherosclerotic plaque components is needed to efficiently analyze multi-weighted magnetic resonance (MR) images to allow their integration into patient risk assessment for ischemic stroke. Certain plaque components such as lipid-rich necrotic core (LRNC) with hemorrhage suggest a greater likelihood of plaque rupture and stroke event. Assessment for presence and extent of LRNC could assist in directing treatment with impact upon patient outcomes.

Methods: To address the need to accurately determine the presence and extent of plaque components on carotid plaque MRI, we proposed a two-staged deep-learning-based approach that consists of a convolutional neural network (CNN), followed by a Bayesian neural network (BNN). The rationale for the two-stage network approach is to account for the class imbalance of vessel wall and background by providing an attention mask to the BNN. A unique feature of the network training was to use ground truth defined by both high-resolution ex vivo MRI data and histopathology. More specifically, standard resolution 1.5 T in vivo MR image sets with corresponding high resolution 3.0 T ex vivo MR image sets and histopathology image sets were used to define ground-truth segmentations. Of these, data from 7 patients was used for training and from the remaining two was used for testing the proposed method. Next, to evaluate the generalizability of the method, we tested the method with an additional standard resolution 3.0 T in vivo data set of 23 patients obtained from a different scanner.

Results: Our results show that the proposed method yielded accurate segmentation of carotid atherosclerotic plaque and outperforms not only manual segmentation by trained readers, who did not have access to the ex vivo or histopathology data, but also three state-of-the-art deep-learning-based segmentation methods. Further, the proposed approach outperformed a strategy where the ground truth was generated without access to the high resolution ex vivo MRI and histopathology. The accurate performance of this method was also observed in the additional 23-patient dataset from a different scanner.

Conclusion: In conclusion, the proposed method provides a mechanism to perform accurate segmentation of the carotid atherosclerotic plaque in multi-weighted MRI. Further, our study shows the advantages of using high-resolution imaging and histology to define ground truth for training deep-learning-based segmentation methods.

1. Introduction

Atherosclerosis is the most common cause of death in the United States and throughout the world (1). Identification of atherosclerotic plaque composition including high risk features such as lipid rich necrotic core (LRNC) with hemorrhage has the potential to allow for event risk assessment and may allow better selection of patients for intervention (2, 3). High-resolution multi-weighted magnetic resonance imaging (MRI) has emerged as an effective tool for visualization and characterization of atherosclerotic plaque composition (4, 5). The signal characteristics of major plaque components across MR sequences of various (T1, T2, proton density) weighting have been well established with respect to histology (6, 7). Five different atherosclerotic plaque components have been identified based on signal intensities of multi-weighted MR images (Table 1). However, the manual segmentation and classification of plaque components, which currently depends on offline processing, requires a time consuming comparison of plaque signal characteristics across at least four sets of differently contrast-weighted MR images. This is labor-intensive, and therefore costly, and has the potential to delay the delivery of medical care. To address these issues, several automated segmentation algorithms based on multi-weighted MR images have been developed (8–14). These are typically supervised segmentation methods that use a data subset as a training set on which segmentation is performed manually. While these methods perform voxel-wise segmentation using image properties such as absolute value of intensities, intensity gradients, and wall distances, most are highly dependent on manually provided reference values. These values are error-prone due to two reasons: (1) inter and intra-reader variability; and (2) training set image quality which, because of its in vivo acquisition, often has limited resolution, and suffers from motion and noise-related artifacts. To accurately segment various plaque components, histopathology is the preferred gold standard as the ground truth. However, direct comparison of histopathology and in vivo MR images with relatively low spatial resolution is intractable due to the differences and inconsistences in image characteristics, including shrinking size of fixed tissue, different orientation and slice thickness.

TABLE 1

Table 1. Criteria of tissue segmentation. The symbols describe the signal intensity relative to adjacent muscle.

In this study, we developed a two-stage neural-network-based method for carotid vessel wall and plaque component segmentation by utilizing both high resolution ex vivo MR images and histopathology in the same set of patients as ground truth. Use of high-resolution ex vivo MR images helps bridge the gap between standard resolution in vivo MR images and histopathology images and potentially achieve a more accurate definition of the ground truth. A 9-patient standard resolution 1.5 T in vivo MRI data set with corresponding high resolution 3.0 T ex vivo MRI data and histopathology images was used to define ground-truth segmentations on the standard-resolution images. Data from 7 of these patients was used to train the proposed two-stage network, while the data from the rest of the 2 patients was used for testing. The first part of the two-stage network was a convolutional neural network (CNN) for inner and outer vessel wall segmentation. The second part was a Bayesian deep neural network (BNN) that allowed for input of aggregated multi-weighted MR image data. The goal of the BNN was to achieve pixel-level segmentation of plaque components. We hypothesize that this two-stage neural network can be used to account for the class imbalance of vessel wall and background and has the potential to out-perform both manual segmentation and state-of-the-art single-stage-based segmentation methods. To evaluate the generalizability of the method to an external dataset, we tested the method on a separate data set of 23 patients 3.0 T in vivo-only MR images.

2. Method

2.1. Data acquisition

A total of 9 patients (6 males and 3 females) who were scheduled for carotid endarterectomy surgery were scanned in vivo on a 1.5 T Sonata MR Scanner (Siemens Medical Solutions, Malvern, PA) using bilateral dedicated 4-element carotid surface coils within one week prior to surgery (Machnet, Netherlands). MR sequences acquired spin-lattice relaxation time (T1) weighted, spin-spin transverse relaxation (T2) weighted, proton-density weighted, and time of flight (TOF) images. At approximately 2 h after surgery, the dissected carotid plaque tissue was placed in Phosphate Buffered Saline (PBS) solution and then scanned ex vivo on a 3 T Siemens Allegra MR scanner using a similar but higher resolution multi-weighted MR protocol (Table 2a). Note that we also included TOF images in the ex vivo acquisition to keep the consistency of contrast (gradient-echo contrast) as those used in vivo. A 3.5-cm diameter volume coil (Nova Medical, Inc, Wilmington, MA) was used as a transmitter and receiver (15). After the ex vivo MRI examination, the tissue was fixed and stained with hematoxylin and eosin (H&E) and Masson’s trichrome stains. The paraffin-embedded tissue blocks were cut every 1 mm, in an orientation to approximate the orientation of the vivo and ex vivo MRI slices. The whole dataset included a total of 84 sets of in vivo MR images, ex vivo MR images, and corresponding pathological sections. We used the ex vivo and pathological sections to establish ground truth.

TABLE 2

Table 2. MR imaging parameters.

As per best practices to evaluate deep-learning-based methods (16) and to evaluate the generalizability of our method to variation in scanners, we also evaluated our method on an additional external dataset acquired with 3 T PET-MRI system. More specifically, an in vivo multi-contrast MR images from 23 patients (12 males and 11 females) scanned on a 3.0 T Siemens PET-MRI system (Siemens mMR, Siemens Healthineers, Malvern, PA) were obtained (Table 2b). A pair of 4-element surface coils (Siemens Healthineers, Malvern, PA) was placed around the neck area for signal reception. A total of 445 in vivo MR slices with distinguishable carotid anatomy were selected for further analysis.

2.2. Data preprocessing and approach to define ground truth

The acquired images were corrected for coil sensitivity using contrast-limited adaptive histogram equalization algorithm (17). In addition, to alleviate the issue of low signal-to-noise ratio, a block-match and 3D filtering algorithm (18) was used to decrease noise prior to segmentation. Images from different MR sequences were co-registered based on the distance to the bifurcation. Following these steps, to generate ground-truth data from data sets of 9 patients for training the network, three atherosclerotic plaque components, namely LRNC with older hemorrhage (late subacute or late chronic hemorrhage >1 week), calcification, and fibrous tissue were segmented. A reader with over five years’ experience in MR imaging was employed to generate ground truth. Intensity-based criteria (Table 1) was used for tissue classification (7) to perform preliminary segmentation of these plaque components. First, the lumen and outer boundary of vessel wall were manually identified. To minimize the impact of noise and improve the consistency of manual segmentation, the adjacent sternocleidomastoid muscle was used as a reference to quantitatively define the threshold and signal intensity criterion. The preliminary segmentation was then manually validated with assistance of ex vivo images and histopathology to establish the ground truth.

The segmentation procedure used to generate ground truth is delineated in Figure 1. With slice thickness of only 1 mm in the ex vivo images, we could directly compare the segmented ex vivo images to the histopathology for a clearer definition of the generated ground truth. We used the same intensity-based criteria as used on the in vivo MR images to segment the ex vivo image. The second row of Figure 1 shows the histopathological sections with segmentation of their corresponding ex vivo MR images. To validate the in vivo segmentation, the trained reader ensured that the locations, sizes, and shapes of plaque components in segmented in vivo images were as close as possible to the segmented ex vivo images. If the in vivo segmentation had an apparent difference from ex vivo segmentation and pathological sections, the reference muscle was reselected until the difference was eliminated. All ground-truth generating steps were performed using a custom designed tool developed with Matlab. In a total of 84 2D slices acquired from 9 patients, a subset of 70 slices from 7 patients was augmented using flipping and rotation of those images to yield a dataset of 420 slices. This was used as the training set. Once trained, this method was tested with 14 slices from the other 2 patients, where again, the ground truth was defined with the assistance of ex vivo MR imaging data and histopathology.

FIGURE 1

Figure 1. The strategy to generate ground truth segmentation shown on a representative slice.

Further, as mentioned above, the method was also tested with an additional test dataset of 445 MR slices from 23 patients obtained from a 3.0 T MR scanner. Ground truth segmentation of the additional test dataset was obtained using manual annotation performed by an experienced observer with the same custom designed tool as described above.

2.3. Proposed segmentation method

The proposed method consists of two networks, namely a CNN followed by a BNN, referred to as Stage I and Stage II, respectively. T1W images were used in the CNN algorithm which segmented the contours of lumen and outer artery wall. The output of the CNN was grouped with the 4-channel aggregated MR images and input to the BNN, which then provided segmentation of plaque components. The details of two networks are provided in Supplementary Appendix.

2.3.1. Training

The CNN and BNN were trained separately. We randomly selected 80% of the data out of the whole dataset as the training set. The CNN was trained with T1-weighted images only, and the network hyperparameters were optimized on the training set via five-fold cross validation. Subsequently, the BNN was trained with the same training set but comprised of all multi-weighted MR images and vessel wall masks. The hyperparameter combination of BNN was optimized with the same method as that of CNN. The loss functions of the CNN and BNN were a combination of cross-entropy loss, Dice loss and K-L divergence loss, denoted by Loss_CE, Loss_Dice and Loss_KLD respectively. These loss functions are given by:

\begin{aligned} Los s_{CE} = y_{true} \log (y_{pred}) + (1 – y_{true}) \log (1 – y_{pred}) \\ Los s_{Dice} = 1 – \frac{2 \sum_{pixel} y_{true} y_{pred}}{\sum_{pxiel} y_{_{true}}^{2} + \sum_{pxiel} y_{_{pred}}^{2}} \\ Los s_{KLD} = \sum_{pxiel} y_{true} \log (\frac{y_{true}}{y_{pred}}) \end{aligned}

The mixed loss function of CNN, denoted by Loss_CNN, is given by:

Los s_{CNN} = \frac{1}{2} (Los s_{CE} + Los s_{Dice}) .

The purpose of the mixed loss function of CNN was to handle the class imbalance caused by the vessel wall, which often occupies a considerably smaller volume relative to the background (19).

The mixed loss function of BNN, denoted by Loss_BNN, is given by:

Los s_{BNN} = \frac{1}{2} (Los s_{CE} + Los s_{KLD}) .

The Loss_KLD was added to the Loss_CE for the posterior distribution approximation of the BNN (20).

2.4. Data analysis

2.4.1. Figures of merit (FOM) for evaluation

In all the experiments, to evaluate the performance, three FOMs were employed: Dice similarity coefficient (DSC), precision, and sensitivity, given by

\begin{aligned} DSC = \frac{2 T P}{F N + F P + 2 T P} \\ Sensitivity = \frac{T P}{F N + T P} \\ Precision = \frac{T P}{T P + F P} \end{aligned}

where TP, TN, FP, and FN denotes true positive, true negative, false positive and false negative of prediction of labels based on normalized signal intensities on multi-weighted MR images respectively. The mean values and 95% confidence intervals (CIs) of these FOMs were computed.

2.4.2. Comparison with other deep learning methods

We compared the performance of the proposed method with three state-of-the-art deep learning methods, namely a U-Net (21), ResNet-101 (22), and DeepLabv3 (23). These methods were chosen as they are widely used in medical image segmentation. All compared methods were trained with ground truth obtained with the assistance of the high-resolution images and optimized via five-fold cross validation.

We also compared the performance of our method to performance of trained readers. Two trained readers, each with two years of experience in MR imaging, referred to as Observer I and Observer II, were asked to segment the 14-slice test data using the same standard customized software as we mentioned. These observers were not provided the ex vivo data or the histopathologic sections for these 14 slices. Each observer was asked to segment the regions of interest twice to decrease the intra-observer variability. We then compared the performance of each observer with the performance of our proposed method.

In all the comparison studies, statistical significance was assessed via a paired sample t-test, with a p-value <0.05 leading to the inference of a statistically significant difference.

2.4.3. Impact of using histology and ex vivo images to define ground truth

To investigate the impact of our high-resolution assisted ground-truth generation procedure on segmentation performance, we trained the proposed method using a strategy where the ground truth was generated without referring to the high-resolution images. In this strategy, one observer was asked to manually annotate all 70 slices of the training set three times to eliminate intra-observer variability. The two-stage neural network was then trained with 3 sets of manually annotated ground truth separately. The performance using this strategy was then evaluated on the test set (14 slices) and compared with the strategy that used high-resolution ex vivo MR images and histopathology to define the ground truth.

2.4.4. Sensitivity to variations in training data

We assessed the sensitivity of the plaque segmentation to variations in training data. First, we randomly separated the BNN training data into two subsets. The BNN was trained and optimized on these two subsets individually. This process yielded two versions of the proposed method, each trained with a different dataset. We then evaluated both versions using the 14-slice test data set, resulting in two sets of segmentation. The similarity of these two segmentation sets was quantified using DSC values, with a high value indicating less sensitivity to variations in the training data. For comparison, we also evaluated the sensitivity of the standard U-Net method to this variation in the training data.

2.4.5. Studying the efficacy of using two networks in the proposed method

To study the impact of using two networks in our approach, we compared our method with an approach that just used the BNN (i.e., did not contain the Stage I). Our proposed two-stage method vs. the BNN were trained on the same training set separately and optimized via five-fold cross validation.

2.4.6. Evaluation with dataset from different scanner

To evaluate the generalizability of our method to variations in scanners, the proposed method was tested using a test set consisting of 445 3.0 T MR slices described above. An experienced reader with over four years of experience in MR imaging manually annotated 445 multi-weighted MR images using a customized software. Next the trained reader reviewed each preliminary label of the tissue-types and manually corrected inaccurate labels. We also compared the performance of the proposed method on the additional test set with the Standard U-Net, DeepLab v3, and ResNet-101.

3. Results

3.1. Performance in segmenting plaque components

The performance of the proposed method in segmenting each tissue type is shown in Table 3. The proposed method outperformed (p < 0.05) all other methods on DSCs of all tissue types, yielding DSCs of 0.78 [95% confidence interval (CI): 0.75, 0.8], 0.62 (95% CI: 0.6, 0.65), and 0.74 (95% CI: 0.73, 0.76) for LRNC with older hemorrhage, calcification, and fibrous tissue, respectively. Two representative results of plaque segmentation are shown in Figure 2.

FIGURE 2

Figure 2. Two representative examples of multi-weighted MR images (acquired at 1.5 T) in the top two rows. The bottom two rows show the segmented LRNC with older hemorrhage (shown in red) and calcification (shown in blue) using the proposed method and three other deep learning-based segmentation algorithms, as compared to the ground truth. We observe that visually, the segmentation predicted by the proposed method is close to the ground truth.

TABLE 3

Table 3. Performance in segmenting plaque components (95% confidence interval) in 2/9 patients scanned at 1.5 T MRI.

It was observed that the proposed method generally outperformed Observer I and Observer II over a range of tissue types and FOMs (Table 3), yielding 10%–28% higher DSCs in segmenting LRNC with older hemorrhage, calcification, and fibrous tissue, except that Observer II yielded better sensitivity for segmenting calcification. This demonstrates the higher accuracy of our model in comparison to human observers who did not have access to the ex vivo or histopathology data. In comparison to those of the BNN trained with manually annotated ground truth, the proposed method obtained significantly better DSCs with 11% and 16% improvement of calcification and fibrous tissue (Table 3).

3.2. Sensitivity to variations in training data

Table 4 shows the DSC between the segmentations yielded by the proposed method when the method was trained with two different training datasets. We observe that the DSC between the segmentations obtained with the two training datasets was greater than 0.8 for all three plaque components with the proposed method. Further, the corresponding DSC values obtained with the U-net-based method were typically lower compared to the proposed method. This provides evidence that the proposed method is relatively insensitive to variations in the training data and more robust than U-Net.

TABLE 4

Table 4. Sensitivity to variations in training data.

3.3. Studying the efficacy of using two networks in the proposed method

The comparison of the proposed method vs. just using a BNN is shown in Table 5. The proposed method significantly outperformed just using a BNN, yielding 15%, 40%, and 19% improvement of DSCs corresponding to LRNC with older hemorrhage, calcification, and fibrous tissue, respectively.

TABLE 5

Table 5. Comparison of the proposed approach with just using a BNN.

3.4. Evaluation with dataset from a different scanner

The performance of the proposed method in the dataset from a 3 T MR scanner (23 patients) is shown in Table 6. The proposed method outperformed Standard U-Net, DeepLab v3, and ResNet-101 methods. In addition, two representative results are shown in Figure 3. In these results, we observe that the proposed method provided segmentation results similar to manual segmentation for both LRNC with older hemorrhage and calcification. We do note that manual labeling classified more tissue as LRNC with older hemorrhage in comparison to our proposed method. However, overall, when assessed upon the same 23-patient MRI data set, our proposed method was more accurate compared to the other three deep-learning-based methods.

FIGURE 3

Figure 3. Examples of additional test of the proposed method on the data acquired from the 3 T scanner. The segmented LRNC with older hemorrhage is shown in red, and the calcification in blue, using both the manual annotation and the proposed method. We see that the manual annotations are close to the output obtained with the proposed method.

TABLE 6

Table 6. Evaluation with additional test dataset (23 patients scanned on a 3 T MRI).

4. Discussion

In this manuscript, based on the hypothesis that a two-stage neural network will account for the class imbalance of vessel wall and background, we implemented a two-stage neural network model with a CNN followed by a BNN to segment carotid atherosclerotic plaque components on multi-weighted MR images. Our major findings are: (1) the ground truth defined with the assistance of histopathology and ex vivo MR images improves segmentation performance of deep learning-based method; (2) the performance of the proposed method is superior to other state-of-the-art deep learning methods and manual segmentation by trained readers in segmenting atherosclerotic plaque components on multi-weighted MR images.

Several supervised algorithms for segmenting in vivo carotid plaque components in multi-weighted MRI have been developed to facilitate accurate assessment of plaque composition. However, these methods are dependent on manually annotated ground truth through visual comparison between relatively low resolution in vivo MRI data and external high resolution histopathological images. This approach may introduce segmentation errors due to the differences in resolution and orientation of MRI and histopathological images. To improve the quality of ground truth, we proposed a strategy where ground truth was generated with the assistance of high-resolution ex vivo MR images that were obtained from both non-fixed carotid specimens within 2–3 h after the endarterectomy and histopathology from the then later fixed specimens. As shown in Table 3, this strategy yields superior performance (p < 0.05) in the segmentation of LRNC with older hemorrhage, calcification and fibrous tissue compared to an approach that uses only the manually labeled ground truth for training. The method also provided superior performance (p < 0.05) compared to trained readers who were not provided the high-resolution images. Furthermore, the proposed method outperformed (p < 0.05) all compared deep-learning methods. This improved performance of our proposed method shows the advantages of using ground truth obtained with high-resolution imaging and histology.

We observe in Table 3 that, among all three plaque components, the proposed method yielded the best performance in segmenting LRNC with older hemorrhage. This may be attributed to relatively high sample size of LRNC with older hemorrhage in our data set, providing an abundance of training samples. Moreover, the relatively high contrast of LRNC with older hemorrhage on all 4 contrast-weighted images also contribute to this performance. The fibrous tissue is as common as LRNC with older hemorrhage. However, the lower contrast of fibrosis contributes to a lower segmentation performance compared to LRNC with older hemorrhage. Calcification is hypointense on all four contrast-weighted images, which would make calcification easy to be segmented by our model. In our dataset, calcification was less frequently present compared to LRNC tissue and fibrous tissue. Thus, less accurate performance was observed for segmenting the calcification.

In Table 4, we observe that the method was relatively insensitive to changes in training data samples. This may be attributed to reliable ground truth in the training data sets. More specifically, it is likely that access to high resolution data reduces variability in ground-truth generation, and thus makes the method less sensitive to changes in training data. Another reason may be the probabilistic parameters in BNN, application of ensemble predictors, and resistance of the BNN to overfitting (24). This result also has important practical implications since it implies that the method could be trained at different centers, and still may yield similar performance. Overall, these results provide evidence of the generalizability of the proposed method to additional dataset and to variations in training data.

To assess the generalizability of the proposed method to differences in scanners, we performed additional testing on a separate dataset of 23 patients acquired from a third MRI scanner, which was part of a PET-MRI system. Although this scanner was from the same vendor, the use of a different system and imaging parameters helped evaluate the generalizability of our method. In these 23 patients, the proposed method yielded strong performance (DSC = 0.7) in segmenting LRNC with older hemorrhage (Table 6) demonstrating feasibility of its use in clinical practice.

Recently proposed best practices for evaluation of AI algorithms have recommended that the evaluation of an AI algorithm should yield a descriptive claim that quantifies the performance of the AI algorithm (16). We outline the following claim for the proposed algorithm: “A two-stage neural network-based approach for carotid atherosclerotic plaque segmentation in multi-weighted MRI, that was trained with the assistance of high-resolution imaging and histopathology images, outperformed (p < 0.05) state-of-the-art segmentation methods, yielded DSC of 0.78 (95% CI: 0.75, 0.8) in segmenting LRNC with hemorrhage using an independent test set, and outperformed (p < 0.05) a strategy where the method was trained without the assistance of these high-resolution images.”

Our study had several limitations. The study was performed with data from a single center. To assess for generalizability, evaluation of the method on datasets from different institutions is desirable. Next, our sample size for training the method was limited. To address this issue, we performed data augmentation using flipping and rotation, but other approaches, such as using simulation-based studies (25, 26) may be explored. Next, we note that the proposed method classifies each image pixel as belonging to only one region. However, given the access to high-resolution data, the method could be advanced to compute the volume that a given region occupies in each voxel. A Bayesian partial-volume estimation procedure was recently proposed towards achieving this goal in positron emission tomography (27) and single-photon emission computed tomography (28), and can be advanced for this application. A limitation of the evaluation study with the patient data is that manual segmentation was used as ground truth. However, as mentioned earlier, this segmentation, itself, may be erroneous. Finally, the figures of merit used for evaluation included precision, sensitivity, and DSC, but performance of these metrics may not translate to superior clinical performance (29). In carotid plaque imaging, the clinical goal is to assess vulnerability of the plaque (plaque with a high risk to rupture). Thus, preferably, the method should be evaluated based on this task (30). One challenge in performing this type of evaluation is the lack of ground truth, quantitative values of vulnerability, and lack of correlation to potential patient outcome. To address these issues, no-gold-standard evaluation techniques are being developed that evaluate the performance of segmentation methods on quantitative tasks in the absence of ground truth (31, 32). This research is currently under investigation.

5. Conclusion

In conclusion, our proposed deep-learning method trained on ground truth obtained with the assistance of high-resolution ex vivo and histopathology data yielded accurate performance of segmentation of carotid plaque components on MR images, outperformed other state-of-the-art segmentation methods, and yielded superior performance compared to trained readers. Additionally, the two-stage neural network model with CNN and BNN architecture was observed to be relatively insensitive to variations in training data and yielded reliable segmentation over other clinical datasets. These promising results motivate further evaluation of the proposed method using larger patient data sets for the accurate assessment of plaque vulnerability.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: https://github.com/rockman151/carotid.

Ethics statement

The studies involving human participants were reviewed and approved by Washington University Investigational Review Board. The patients/participants provided their written informed consent to participate in this study.

Author contributions

RL was responsible for model design, methodology, formal analysis, and manuscript writing. JS was responsible for reviewing histopathology images and data interpretation. MZ facilitated patient participation in the study and reviewed the manuscript. PW, JZ, and AJ were responsible for experimental design, data acquisition, supervision/oversight, funding resources, and reviewing the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported in part by NIH grants R01 HL132600, R01 HL150891, R01 EB031051, R01 HL159803 and R56 EB028287.

Acknowledgments

The authors would like to thank the Mallinckrodt Institute of Radiology Center for High Performance Computing, and Washington University School of Medicine in St. Louis for providing the GPU clusters.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2023.1127653/full#supplementary-material.

References

1. Virani SS, Alonso A, Aparicio HJ, Benjamin EJ, Bittencourt MS, Callaway CW, et al. Heart disease and stroke statistics—2021 update: a report from the American heart association. Circulation. (2021) 143(8):e254–e743. doi: 10.1161/CIR.0000000000000950

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Wang TJ. Assessing the role of circulating, genetic, and imaging biomarkers in cardiovascular risk prediction. Circulation. (2011) 123(5):551–65. doi: 10.1161/CIRCULATIONAHA.109.912568

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ricotta JJ, AbuRahma A, Ascher E, Eskandari M, Faries P, Lal BK. Updated society for vascular surgery guidelines for management of extracranial carotid disease. J Vasc Surg. (2011) 54(3):e1–e31. doi: 10.1016/j.jvs.2011.07.031

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Makowski MR, Botnar RM. MR Imaging of the arterial vessel wall: molecular imaging from bench to bedside. Radiology. (2013) 269(1):34–51. doi: 10.1148/radiol.13102336

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Sanz J, Fayad ZA. Imaging of atherosclerotic cardiovascular disease. Nature. (2008) 451(7181):953–7. doi: 10.1038/nature06803

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yuan C, Mitsumori LM, Ferguson MS, Polissar NL, Echelard D, Ortiz G, et al. In vivo accuracy of multispectral magnetic resonance imaging for identifying lipid-rich necrotic cores and intraplaque hemorrhage in advanced human carotid plaques. Circulation. (2001) 104(17):2051–6. doi: 10.1161/hc4201.097839

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Saam T, Ferguson MS, Yarnykh VL, Takaya N, Xu D, Polissar NL, et al. Quantitative evaluation of carotid plaque composition by in vivo MRI. Arterioscler Thromb Vasc Biol. (2005) 25(1):234–9. doi: 10.1161/01.ATV.0000149867.61851.31

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Liu F, Xu D, Ferguson MS, Chu B, Saam T, Takaya N, et al. Automated in vivo segmentation of carotid plaque MRI with morphology-enhanced probability maps. Magn Reson Med. (2006) 55(3):659–68. doi: 10.1002/mrm.20814

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Hofman JMA, Branderhorst WJ, Ten Eikelder HMM, Cappendijk VC, Heeneman S, Kooi ME, et al. Quantification of atherosclerotic plaque components using in vivo MRI and supervised classifiers. Magn Reson Med An Off J Int Soc Magn Reson Med. (2006) 55(4):790–9. doi: 10.1002/mrm.20828

CrossRef Full Text | Google Scholar

10. Van’t Klooster R, Naggara O, Marsico R, Reiber JHC, Meder J-F, Van Der Geest RJ, et al. Automated versus manual in vivo segmentation of carotid plaque MRI. Am J Neuroradiol. (2012) 33(8):1621–7. doi: 10.3174/ajnr.A3028

CrossRef Full Text | Google Scholar

11. Dong Y, Pan Y, Zhao X, Li R, Yuan C, Xu W. Identifying carotid plaque composition in MRI with convolutional neural networks. 2017 IEEE International Conference on Smart Computing (SMARTCOMP). Hong Kong, China. (2017) pp. 1–8. doi: 10.1109/SMARTCOMP.2017.7947015

12. Mukhoti J, Gal Y. Evaluating bayesian deep learning methods for semantic segmentation. arXiv Prepr arXiv181112709. (2018). doi: 10.48550/arXiv.1811.12709

13. Guan Q, Du B, Teng Z, Gillard J, Chen S. Bayes Clustering and structural support vector machines for segmentation of carotid artery plaques in multicontrast MRI. Comput Math Methods Med. (2012) 2012:549102. doi: 10.1155/2012/549102. Retraction in: Computational And Mathematical Methods In Medicine. Comput Math Methods Med. (2014) 2014:836280. PMID: 23365619; PMCID: PMC3536030.23365619

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Zhang Q, Qiao H, Dou J, Sui B, Zhao X, Chen Z, et al. Plaque components segmentation in carotid artery on simultaneous non-contrast angiography and intraplaque hemorrhage imaging using machine learning. Magn Reson Imaging. (2019) 60:93–100. doi: 10.1016/j.mri.2019.04.001

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Zheng J, El Naqa I, Rowold FE, Pilgram TK, Woodard PK, Saffitz JE, et al. Quantitative assessment of coronary artery plaque vulnerability by high-resolution magnetic resonance imaging and computational biomechanics: a pilot study ex vivo. Magn Reson Med. (2005) 54(6):1360–8. doi: 10.1002/mrm.20724

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Jha AK, Bradshaw TJ, Buvat I, Hatt M, Prabhat KC, Liu C, et al. Nuclear medicine and artificial intelligence: best practices for evaluation (the RELAINCE guidelines). J Nucl Med. 63(9):1288–99. doi: 10.2967/jnumed.121.263239

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Reza AM. Realization of the contrast limited adaptive histogram equalization (CLAHE) for real-time image enhancement. J VLSI Signal Process Syst Signal Image Video Technol. (2004) 38(1):35–44. doi: 10.1023/B:VLSI.0000028532.53893.82

CrossRef Full Text | Google Scholar

18. Dabov K, Foi A, Katkovnik V, Egiazarian K. Image denoising with block-matching and 3D filtering. Image processing: algorithms and systems, neural networks, and machine learning. (2006). p. 606414. doi: 10.1117/12.643267

19. Yeung M, Sala E, Schönlieb C-B, Rundo L. Unified focal loss: generalising dice and cross entropy-based losses to handle class imbalanced medical image segmentation. Comput Med Imaging Graph. (2022) 95:102026. doi: 10.1016/j.compmedimag.2021.102026

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Wen Y, Vicol P, Ba J, Tran D, Grosse R. Flipout: Efficient pseudo-independent weight perturbations on mini-batches. arXiv Prepr arXiv180304386. (2018). doi: 10.48550/arXiv.1803.04386

21. Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. Lect Notes Comput Sci (Including Subser Lect Notes Artif Intell Lect Notes Bioinformatics. (2015) 9351:234–41. doi: 10.1007/978-3-319-24574-4_28

CrossRef Full Text | Google Scholar

22. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition. (2016). p. 770–8. doi: 10.48550/arXiv.1512.03385

23. Chen L-C, Papandreou G, Schroff F, Adam H. Rethinking atrous convolution for semantic image segmentation. arXiv Prepr arXiv170605587. (2017). doi: 10.48550/arXiv.1706.05587

24. Burden F, Winkler D. Bayesian Regularization of neural networks. Artif Neural Netw. (2008) 458:25–44. doi: 10.1007/978-1-60327-101-1_3

CrossRef Full Text | Google Scholar

25. Leung KH, Marashdeh W, Wray R, Ashrafinia S, Pomper MG, Rahmim A, et al. A physics-guided modular deep-learning based automated framework for tumor segmentation in PET. Phys Med Biol. (2020) 65(24):245032. doi: 10.1088/1361-6560/ab8535

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Liu Z, Laforest R, Mhlanga J, Fraum TJ, Itani M, Dehdashti F, et al. Observer study-based evaluation of a stochastic and physics-based method to generate oncological PET images. In: Medical imaging 2021: image perception, observer performance, and technology assessment. (2021). p. 9–17. doi: 10.48550/arXiv.2102.02975

27. Liu Z, Mhlanga JC, Laforest R, Derenoncourt P-R, Siegel BA, Jha AK. A Bayesian approach to tissue-fraction estimation for oncological PET segmentation. Phys Med Biol. (2021) 66(12):124002. doi: 10.1088/1361-6560/ac01f4

CrossRef Full Text | Google Scholar

28. Liu Z, Moon HS, Li Z, Laforest R, Perlmutter JS, Norris SA, et al. A tissue-fraction estimation-based segmentation method for quantitative dopamine transporter SPECT. Med Phys. (2022) 49(8):5121–37. doi: 10.1002/mp.15778

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Liu Z, Mhlanga JC, Siegel BA, Jha AK. Need for objective task-based evaluation of AI-based segmentation methods for quantitative PET. arXiv Prepr arXiv230300640. (2023). doi: 10.1117/12.2647894

30. Jha AK, Kupinski MA, Rodriguez JJ, Stephen RM, Stopeck AT. Task-based evaluation of segmentation algorithms for diffusion-weighted MRI without using a gold standard. Phys Med Biol. (2012) 57(13):4425. doi: 10.1088/0031-9155/57/13/4425

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Jha AK, Mena E, Caffo BS, Ashrafinia S, Rahmim A, Frey EC, et al. Practical no-gold-standard evaluation framework for quantitative imaging methods: application to lesion segmentation in positron emission tomography. J Med Imaging. (2017) 4(1):11011. doi: 10.1117/1.JMI.4.1.011011

CrossRef Full Text | Google Scholar

32. Jha AK, Caffo B, Frey EC. A no-gold-standard technique for objective assessment of quantitative nuclear-medicine imaging methods. Phys Med Biol. (2016) 61(7):2780. doi: 10.1088/0031-9155/61/7/2780

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: atherosclerotic plaque, MR images, segmentation, CNN, BNN

Citation: Li R, Zheng J, Zayed MA, Saffitz JE, Woodard PK and Jha AK (2023) Carotid atherosclerotic plaque segmentation in multi-weighted MRI using a two-stage neural network: advantages of training with high-resolution imaging and histology. Front. Cardiovasc. Med. 10:1127653. doi: 10.3389/fcvm.2023.1127653

Received: 19 December 2022; Accepted: 27 April 2023;
Published: 24 May 2023.

Edited by:

Hai-Ling Margaret Cheng, University of Toronto, Canada

Reviewed by:

Daniel Haehn, University of Massachusetts Boston, United States
Gustav Strijkers, Amsterdam University Medical Center, Netherlands

© 2023 Li, Zheng, Zayed, Saffitz, Woodard and Jha. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Pamela K. Woodard d29vZGFyZHBAd3VzdGwuZWR1 Abhinav K. Jha YS5qaGFAd3VzdGwuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Carotid atherosclerotic plaque segmentation in multi-weighted MRI using a two-stage neural network: advantages of training with high-resolution imaging and histology

1. Introduction

2. Method

2.1. Data acquisition

2.2. Data preprocessing and approach to define ground truth

2.3. Proposed segmentation method

2.3.1. Training

2.4. Data analysis

2.4.1. Figures of merit (FOM) for evaluation

2.4.2. Comparison with other deep learning methods

2.4.3. Impact of using histology and ex vivo images to define ground truth

2.4.4. Sensitivity to variations in training data

2.4.5. Studying the efficacy of using two networks in the proposed method

2.4.6. Evaluation with dataset from different scanner

3. Results

3.1. Performance in segmenting plaque components

3.2. Sensitivity to variations in training data

3.3. Studying the efficacy of using two networks in the proposed method

3.4. Evaluation with dataset from a different scanner

4. Discussion

5. Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher's note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good