- 1Department of Interventional Neuroradiology, Beijing Tiantan Hospital and Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
- 2School of Biomedical Engineering, Capital Medical University, Beijing, China
- 3Department of Neurointerventional Engineering and Technology, Beijing Engineering Research Center (NO: BG0287), Beijing, China
- 4China National Clinical Research Center for Neurological Diseases, Beijing, China
- 5Department of Neurosurgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
- 6Department of Radiology, Third Medical Center of Chinese PLA General Hospital, Beijing, China
- 7Department of Interventional Neuroradiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
- 8Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
Background and purpose: Anatomical labeling of the cerebral vasculature is a crucial topic in determining the morphological nature and characterizing the vital variations of vessels, yet precise labeling of the intracranial arteries is time-consuming and challenging, given anatomical structural variability and surging imaging data. We present a U-Net-based deep learning (DL) model to automatically label detailed anatomical segments in computed tomography angiography (CTA) for the first time. The trained DL algorithm was further tested on a clinically relevant set for the localization of intracranial aneurysms (IAs).
Methods: 457 examinations with varying degrees of arterial stenosis were used to train, validate, and test the model, aiming to automatically label 42 segments of the intracranial arteries [e.g., 7 segments of the internal carotid artery (ICA)]. Evaluation metrics included Dice similarity coefficient (DSC), mean surface distance (MSD), and Hausdorff distance (HD). Additionally, 96 examinations containing at least one IA were enrolled to assess the model’s potential in enhancing clinicians’ precision in IA localization. A total of 5 clinicians with different experience levels participated as readers in the clinical experiment and identified the precise location of IA without and with algorithm assistance, where there was a washout period of 14 days between two interpretations. The diagnostic accuracy, time, and mean interrater agreement (Fleiss’ Kappa) were calculated to assess the differences in clinical performance of clinicians.
Results: The proposed model exhibited notable labeling performance on 42 segments that included 7 anatomical segments of ICA, with the mean DSC of 0.88, MSD of 0.82 mm and HD of 6.59 mm. Furthermore, the model demonstrated superior labeling performance in healthy subjects compared to patients with stenosis (DSC: 0.91 vs. 0.89, p < 0.05; HD: 4.75 vs. 6.19, p < 0.05). Concurrently, clinicians with model predictions achieved significant improvements when interpreting the precise location of IA. The clinicians’ mean accuracy increased by 0.04 (p = 0.003), mean time to diagnosis reduced by 9.76 s (p < 0.001), and mean interrater agreement (Fleiss’ Kappa) increased by 0.07 (p = 0.029).
Conclusion: Our model stands proficient for labeling intracranial arteries using the largest CTA dataset. Crucially, it demonstrates clinical utility, helping prioritize the patients with high risks and ease clinical workload.
1 Introduction
Cerebrovascular diseases, such as aneurysms, stenosis, and arteriovenous malformations, are leading causes of death and disability (GBD, 2015 Mortality and Causes of Death Collaborators, 2016). The intrinsic characteristics of intracranial arteries enables to aid in understanding the disease pathogenesis that causes the morphology change and dysfunction of specific arterial segment and manifests as related clinical symptoms (Turan et al., 2010; Mackey et al., 2012). Hence, precise anatomical labeling of the intracranial arteries is crucial for physicians to understanding the mechanism, diagnosis, and treatment of cerebrovascular conditions. Although consensus on defining the fine segments of intracranial arteries through imaging and anatomy has been established (Bouthillier, van Loveren, and Keller, 1996; Yavagal and Haussen, 2011; Harrigan and Deveikis, 2018), manual labeling of fine segments of intracranial arteries is time-consuming and prone to inter-and intra-observer variability. Furthermore, this problem is exacerbated due to the lack of experienced radiologists given the increasing imaging data. Previous studies on anatomical labeling of intracranial arteries have been constrained by limited datasets and only performed on magnetic resonance angiography (MRA) images (Dunas et al., 2016; Robben et al., 2016; Dunas et al., 2017). Additionally, computed tomography angiography (CTA) seems to be more encouraging in the assessment of aneurysms, vessel stenosis and patency (Koelemay et al., 2004; Manniesing et al., 2008). Consequently, automated anatomical labeling of cerebral vasculature with detailed segments on CTA is urgent and essential for the diagnosis and treatment schemes of arterial diseases.
Deep learning (DL) has shown significant potential in medical image analysis tasks. Notably, the U-Net framework with symmetric network architecture is widely adopted in the field of medical image segmentation because of its flexibility and achieves remarkable successes (Jin et al., 2020; Klimont et al., 2020; Mubashar et al., 2022). Previous models have focused on the segmentation of the specific artery (e.g., the carotid artery) or the whole 3D cerebral vessels (Groves et al., 2020; Klimont et al., 2020; Bortsova et al., 2021; Guo et al., 2021). Only few studies strive for the automatic segmentation or labeling of intracranial arteries involved with detailed segments (i.e., automatic labelling of fine segments).
So far, this is the first study to develop a powerful U-Net-based DL model for labeling the most detailed segments of intracranial arteries on CTA scans. Additionally, 7 segments of the internal carotid artery (ICA) were successfully labeled for the first time where stenosis and aneurysms frequently occur. Moreover, the trained DL algorithm was applied in a clinical experiment to assess the impact on the precise intracranial aneurysm (IA) localization. The performance of 5 expert raters with different experience levels as to localization accuracy, clinical decision time, and the interrater agreement was analyzed without and with the support of artery classification by the DL algorithm, for a washout period of 14 days between two interpretations.
2 Materials and methods
2.1 Dataset
The local Institutional Review Board (IRB) approved this retrospective study, waiving the requirement for informed consent, in adherence to the principles of the Declaration of Helsinki. Data for model development were selected from head CTA images, with or without arterial stenosis, in the imaging database of our hospital between January 2016 and November 2019. The exclusion criteria included patients with intracranial aneurysms (IAs), arteriovenous malformation, arteriovenous fistula, Moyamoya disease, and poor image quality. Dataset for validation of the proposed model in real-world clinical scenarios, the inclusion criteria included head CTA images with at least one IA and without other arterial diseases, with digital subtraction angiography (DSA) verification from July 2019 to May 2020 in the same hospital. The flowchart of data acquisition, selection, and assignment were depicted in Figure 1.
FIGURE 1. Flowchart of data acquisition, selection, and assignment. CTA, computed tomography angiography; IAs, intracranial aneurysms.
Collectively, we included a total of 457 CTA examinations performed on GE Healthcare scanners for model development, which were randomly divided into training (n = 298), tuning (n = 65), and testing sets (n = 94). Furthermore, 96 CTA examinations with 117 aneurysms were included for the validation. The severity of artery stenosis was graded as follows (Zhang et al., 2015; Fu et al., 2023): mild stenosis (<50%), moderate stenosis (50%–70%), severe stenosis (>70%) and occlusive (100%). In case of multiple stenosis, the most severe stenosis was adopted.
2.2 CTA image acquisition, reconstruction and preprocessing
Standard head CT angiography examinations were acquired on axial section with post-processing reconstruction on sagittal, coronal, maximum intensity projection (MIP) and 3-dimensional volume rendered (3D-VR) views as necessary. All included CTA examinations were acquired on axial section with Discovery CT750 HD (GE Healthcare, Chicago, IL, United States) utilizing a slice thickness of 0.63 mm, a tube voltage of 100 kVp, and the effective tube current ranging between 2 and 3 mAs. It is worth noting that DSA images are auxiliary data serving as a reference for enrollment of IAs and are not fed into the proposed model.
For image preprocessing, normalization, spatial resampling, and extraction of binary vessel masks were performed. Z-score normalization was applied to ensure the uniformity in pixel values across all images, creating a standardized input for the DL model. Additionally, spatial resampling was employed to achieve a consistent resolution of 1 × 1 × 1 mm, promoting uniformity in spatial dimensions throughout the dataset. These preprocessing steps were pipelined to ensure the reproducibility of this study. Finally, we generated 3D binary vessel masks by using a simplified U-Net architecture and the entire 3D binary vessel masks were fed into the DL model. The simplified network took CTA images as input, and the sigmoid activation function transformed the feature maps into probability maps. Subsequently, a threshold of 0.5 was usually applied to distinguish between vessel and non-vessel regions (Ma et al., 2021). Notably, the imaging data of DL development and the clinical experiment had equal scan and preprocessing protocols in order to avoid the introduction of bias.
2.3 CTA image annotations
Manual labeling of 42 arterial segments was used as the reference standard to develop the algorithm and evaluate the performance of the proposed model. Trained annotators labeled 42 arterial segments according to the anatomical segments of cerebral vessels (Bouthillier, van Loveren, and Keller, 1996; Yavagal and Haussen, 2011; Harrigan and Deveikis, 2018): including 7 segments of the ICA (C1-C7), 3 segments of the middle cerebral artery (MCA: M1, M2, M3-4), 4 segments of the anterior cerebral artery (ACA: A1, A2, A3, and A4-5), 3 segments of the posterior cerebral artery (PCA: P1, P2, and P3-4), 2 segments of the intracranial vertebral artery (VA: V3 and V4), as well as the posterior communicating artery (PCoA), anterior communicating artery (ACoA), and the basilar artery (BA).
Furthermore, the IAs were also manually labeled. The labeled results were confirmed by two specialized radiologists with 10, 11 years of experience. Disagreements of the two radiologists were arbitrated by a third specialized neuroradiologist. All the annotation was performed using software (3D Slicer Version 4.10.1; https://www.slicer.org).
2.4 Model training for automated labeling of the intracranial arteries
A 3-dimensional convolutional neural network (CNN) was devised based on the U-Net architecture in this study (Figure 2). Specifically, the network comprised the encoding and decoding paths. Traditional convolution block was replaced by Squeeze-and-Excitation Residual (SE-Res) block, which consists of k cascading 3 × 3×3 convolutional layers of n channels involved with the group normalization (GN) and rectified linear units (ReLU) and k cascading 3 × 3×3 convolutional layers of n channels involved with the GN and SE. For better convergence, each block incorporated a skip connection with a 1 × 1×1 convolutional layer to reduce the number of channels. The encoder and decoder paths were augmented with the SE-Res blocks by means of channel-wise attention mechanisms, stable training as the depth of the network increased and adaptive information extraction of the feature map (Wang et al., 2021). In addition, the number of channels (n) was doubled after max pooling and was halved after transpose convolution. Furthermore, the deep supervision module was utilized in the decoder for faster convergence and better performance of the network via more direct learning process of the hidden layers (Lee et al., 2015). It included extra auxiliary branches at different stages of the decoder, allowing for the extraction of feature maps at various resolutions. The combination of SE-Res blocks and the deep supervision branch contributed to the model’s performance to capture both local and global contextual information. Ultimately, feature channels of the encoder were concatenated with the corresponding tensors of the decoder to merge advanced semantic information with low-level positional information.
FIGURE 2. Architecture of the 3D network model for cerebral artery labeling. The proposed labeling model has an encoder-decoder architecture as popular U-Net, and the network takes in an input of preprocessed vessel masks and outputs the predicted probability of class for each voxel. The SE-Res block and deep supervision module are used to achieve better labeling performance of the network. 3D network, 3-dimensional network; GN, group normalization; ReLU, rectified linear units; SE-Res block, squeeze, and excitation-residual block.
During the training phase, CTA images with healthy vessels or stenosis after preprocessing were randomly cropped to 128 × 256×256 pixels and then fed into the vascular labeling model. The sliding window technique was applied to handle the volume during inference time. The network was trained using compound loss function with Dice loss and cross entropy, which is robust on highly imbalanced segmentation tasks. The Adam optimizer with a learning rate of 0.001 and decay rate of 0.98 was utilized to optimize the objective function. The batch size was set to 1 due to the limitation of memory and the number of training epochs was 300. Data augmentations such as random cropping, scaling, rotation, and elastic transformation were applied to CTA scans for learning inherent features and avoiding the overfitting problem. The training was implemented by using the Keras library of the Tensorflow backend on the workstation with a single V100 NVIDIA GPU.
2.5 Evaluation metrics of arterial labeling
Labeling performance is evaluated by determining three metrics: (1) Dice similarity coefficient (DSC), (2) Mean surface distance (MSD; [mm]) and (3) Hausdorff distance (HD; [mm]). The DSC ranges from 0 to 1, where value of 1 indicates high similarity. For MSD and HD, low values indicate high similarity. These three metrics are defined as follows (Hameeteman et al., 2011; Benkarim et al., 2021):
Where G is the cerebral artery region in the ground truth and P is the predicted result.
Where S (.) denotes the set of surface voxels.
2.6 Clinical experiment
Besides the performance of model on voxel-wise segmentation, clinical utility of the model was validated in a real-world clinical scenario for IA localization. A total of 5 clinicians (W.Y., J.L., S.M.G., D.C.W., and J.J.) with different experience levels (5,3,3,1, and 1 year, respectively) participated as readers in the diagnostic accuracy study and identified the precise location of IA without and with algorithm assistance. The clinicians were blinded to clinical histories and read independently in a diagnostic reading room by software (3D Slicer). Following a washout period of 14 days, the examinations were interpreted again by the same corresponding clinician (if the first read was with aid of the algorithm, the second read was without algorithm assistance, and vice versa). Additionally, clinicians were provided with the model’s predictions in the form of 42 segments of vessels only when reading with algorithm assistance. Given the model prediction, readers took it into consideration or disregard it based on clinical judgment.
2.7 Statistical analysis
For comparison of labeling performance between the left and right vessels as well as the normal and stenotic vessels, the Wilcoxon signed rank test and Kruskal Wallis test were implemented, respectively. The proposed algorithm was assessed on CTA images with IAs by computing accuracy, interpretation time, and the interrater agreement of clinicians. The Wilcoxon signed rank test was used to assess differences in accuracy and average time of the clinicians with and without algorithm assistance. Furthermore, to investigate whether differences in their years of experience might impact the model’s usability and performance, five doctors with varying levels of experience were divided into two groups, i.e., high-level group of three doctors (5,3, and 3 years) and primary-level group of two doctors (both 1 year). The Kruskal Wallis test was implemented for comparison of the improved performance on identifying the precise location of IA between the high-level group and primary-level group. Considering over two readers and labels with no ranking or ordering, the interrater agreement of clinicians was determined using Fleiss’ Kappa (Fleiss and Cohen, 1973). To confirm whether model augmentation improved interrater agreement, the permutation test was performed on the difference between Fleiss’ Kappa of clinicians with and without model augmentation. The permutation procedure was repeated 10000 times to yield the null distribution of the Fleiss’ Kappa difference and the p-value was calculated as the proportion of the Fleiss’ Kappa differences that were higher than the observed Fleiss’ Kappa difference. A two-sided p-value less than 0.05 was considered statistically significant. Statistical analysis was conducted with IBM SPSS Statistics 23 (Armonk, New York) and Python 3.8 (Wilmington, Delaware).
3 Results
3.1 Patient and intracranial aneurysm characteristics
A total of 457 examinations (mean age, 51 years ±14 [standard deviation]; 209 female, 45.7%) and 96 examinations (mean age, 57 years ±10; 56 female, 58.3%) with 117 IAs (mean size, 5.1 mm ± 3.6) were used for the model development and the validation of its clinical utility. Table 1 shows the baseline characteristics of data set for labeling model development and Table 2 shows that of internal validation set for IAs localization.
3.2 Labeling performance of the model
The 3D network model for labeling of 42 arterial segments achieves promising performance in the testing set (94 cases) (Table 3). The 3D visualization of manual and CNN-automated labeling for 42 segments of vessels for two cases was shown in Figure 3. Overall, the model performs remarkably in labeling of 42 segments with the mean DSC of 0.88, MSD of 0.82 mm and HD of 6.59 mm. ICA consisting of 7 detailed segments obtained excellent results with DSCs ranging from 0.78 to 0.96, MSDs ranging from 0.24 mm to 0.60 mm, and HDs ranging from 1.67 mm to 4.70 mm Evaluation metrics show a decrease in labeling performance of MCA (M3-4), ACoA and PCoA. The evaluation metrics of large arteries were also calculated and described in Supplementary Table S1.
FIGURE 3. Visualization of manual and automated labeling for typical large vessels of two cases. For case 1 with a DSC of 0.85, MSD of 1.25, and HD of 10.71, the first raw provides the ground truth of detailed segments for the whole cerebral vessels (A), ICA (A1), ACA (A2), MCA (A3) and VA with BA (A4), which are displayed from left to right column. The second row represents the corresponding labeling results of the model, i.e., the whole cerebral vessels (B), ICA (B1), ACA (B2), MCA (B3) and VA with BA (B4). Similarly, for case 2 with a DSC of 0.88, MSD of 0.82, and HD of 5.91, the third raw provides the ground truth of detailed segments for the whole cerebral vessels (C), ICA (C1), ACA (C2), MCA (C3) and VA with BA (C4). The fourth row represents the corresponding labeling results of the model, i.e., the whole cerebral vessels (D), ICA (D1), ACA (D2), MCA (D3) and VA with BA (D4). ACA, anterior cerebral artery; BA, basilar artery; DSC, dice similarity coefficient; HD, Hausdorff distance; ICA, internal carotid artery; MCA, middle cerebral artery; MSD, mean surface distance; VA, vertebral artery.
Besides, the statistical differences between the left and right labeling metrics of vessels were analyzed and the results are presented visually in Supplementary Figure S1 as a violin plot. Clearly, right ICA has a higher DSC value of 0.91 compared to that of 0.90 of the left (p < 0.001). It is noticeable that model performance of left PCoA outperforms that of the right in terms of MSD and HD corresponding to 0.09 mm vs. 0.10 mm (p = 0.005) and 1.15 mm vs.1.40 mm (p = 0.017). Regarding the differences of labeling performance on normal and stenotic vessels, healthy vessels have better labeling results as depicted in Figure 4, particularly for DSC (0.91 vs. 0.89; p = 0.047) and HD (4.75 mm vs. 6.19 mm; p = 0.028).
FIGURE 4. Comparison of the labeling performance on the normal and stenotic vessels. Violin plots are used to show the distribution of three metrics and visualize the statistic results. *p < 0.05; **p < 0.01. DSC, dice similarity coefficient; MSD, mean surface distance; HD, Hausdorff distance.
3.3 Clinical performance on precise IAs localization
Performance improvements in terms of accuracy, interpretation time per case and interrater agreement across clinicians when determining the precise location of IA are reported in Table 4, and individual clinician improvement is detailed in Figure 5. More precisely, clinicians achieved a mean accuracy of 0.82 (95% confidence interval (CI), 0.79–0.86) with model augmentation and there was a statistically significant increase in the mean accuracy (0.04; 95% CI, 0.01 to 0.08; p = 0.003). Additionally, the mean time per case across clinicians was 14.14 s (95% CI, 10.03–18.25 s) with assistance and the time to diagnosis was significantly lower (difference, −9.76 s; 95% CI, −17.06 to −2.45 s; p < 0.001) compared to that of clinicians without assistance. For the clinicians, there was a significant increase of 0.07 (p = 0.029) in their interrater agreement, with a Fleiss’ Kappa of 0.59 without assistance and 0.66 with assistance. Individual performances with and without algorithm assistance were shown in Supplementary Table S2. Comparison of clinical performance of clinicians with primary-level and high-level experience when interpreting the precise location of IA was provided in Supplementary Table S3, which indicates that the accuracy is not affected by the doctor‘s experience but interpretation time of primary-level clinicians is enhanced better (difference, −14.39 s; 95% CI, −17.18 to −11.60; p < 0.001) compared to that of high-level clinicians. Figure 6 depicts two examples of aneurysms located on R-C5 and R-C6 in the validation dataset.
TABLE 4. Clinical performance with and without algorithm assistance to predict precise location of IAs in internal validation cohort.
FIGURE 5. Change in individual clinicians’ performance metric. Horizontal lines depict the change in performance metric for each clinician with and without model assistance. The orange dot represents performance without model, and the blue dot represents performance with model assistance.
FIGURE 6. Examples of aneurysms in the validation dataset. The first raw presents CTA scan (A), binary mask (B), and the labelled arteries (C) with IA located on R-C5 of ICA, which are displayed from left to right column. Similarly, the second raw depicts CTA scan (D), binary mask (E), and the labelled arteries (F) with IA located on R-C6. CTA, computed tomography angiography; IA, intracranial aneurysm; ICA, internal carotid artery; R, right.
4 Discussion
In this study, we developed a DL model firstly based on the CTA scans to automatically label intracranial arteries with 42 anatomical segments with the largest dataset. The proposed model exhibited notable labeling performance with the mean DSC of 0.88, MSD of 0.82 mm and HD of 6.59 mm. Furthermore, the model demonstrated superior labeling performance in healthy subjects compared to patients with stenosis (DSC: 0.91 vs. 0.89, p < 0.05; HD: 4.75 vs. 6.19, p < 0.05). Additionally, a clinically relevant set for the localization of IA was used to assess the model’s clinical utility and results showed that clinicians with model predictions achieved significant improvements when interpreting the precise location of IA.
Previous studies involve mainly atlas construction-based matching of arterial branches (e.g., UBA167) and sophisticated graph-based function processing for labeling of major intracranial arteries on MRA images (e.g., ICA, MCA, and ACA) with overall accuracies ranging from 93% to 96% or F1 score around 0.85 (Dunas et al., 2016; Robben et al., 2016; Dunas et al., 2017). However, these researches neglect the detailed anatomical segments of arteries and suffer from the common limitations due to the relatively small datasets and the subjects without pathological variability, which may have ramifications on the robustness and the clinical utility of algorithm performance. A network has been recently adopted to obtain the exhaustive anatomical classification of the 62 cerebral branches with accuracies ranging from 73% to 100% and F1 scores ranging from 0.67 to 0.99, whereas quantified geometric vessel features are required in advance and only the healthy subjects are utilized for this model development (Hong et al., 2023). Besides, 24 classes of arterial segments have been distinguished via a multi-scale U-Net architecture with macro F1score of 0.89 and balanced class accuracy of 0.83 in labeling detailed segments. However, it considers only the large artery rather than detailed segments of ICA whereas stenosis and aneurysms frequently occur (Hilbert et al., 2022). In this paper, we leveraged voxel-wise segmentation indices instead of classification indices, i.e., DSC, MSD and HD, to evaluate the performance of labeling model, given the anatomical morphology and 3D properties of vessels. The proposed model achieved the overall DSC of 0.88, MSD of 0.82 mm and HD of 6.59 mm, demonstrating superior results compared with the conventional segmentation of carotid lumens and the whole cerebral vessels (Hemmati et al., 2017; Chen et al., 2021; Guo et al., 2021; Huang, Wang, and Li, 2023). Besides, we found that labeling performance of MCA, ACoA and PCoA seemed to decline in line with prior researches (Dunas et al., 2017; Hilbert et al., 2022; Hong et al., 2023). This may attribute to the small diameter of distal MCA (i.e., M3-4, See Table 3) and the potential inter-individual variation of vessels.
As reported in many experiments, the DSC metric cannot fully express the performance of vascular segmentation (Li et al., 2020). Because minimal changes may lead to low DSC in case of small volumes and the DSC would remain high regardless of critical errors in relatively large volumes. Consequently, we also took the spatial distance-based metrics (i.e., MSD and HD) to evaluate the surface coincidence and the segmentation quality of outliers, respectively. In Supplementary Figure S1, our result suggested that there were significant differences between left and right ICA (DSC: 0.90 vs. 0.91, p < 0.01) and PCoA (MSD: 0.09 vs. 0.10, p < 0.01; HD: 1.15 vs. 1.40, p < 0.05). In terms of DSC metric vulnerable to the variation of vascular shape, the significant difference of left and right ICA may be attributed to the asymmetrical nature per se (about 6%) and relatively high incidence of arterial stenosis compared to other arteries (Mujagic et al., 2016). For PCoA, the meta-analysis indicates the prevalence of PCoA hypoplasia or aplasia is almost up to 43%, which is likely to be the primary factor that leads to the difference of labeling performance in terms of distance-based metrics (Jones et al., 2021). Furthermore, we achieved a higher level of labeling performance in healthy controls compared with that of patients with stenosis (See Figure 4), providing the evidence that pathological variations of cerebral vasculature results in the lower prediction performance (DSC: 0.91 vs. 0.89, p < 0.05; HD: 4.75 vs. 6.19, p < 0.05). Patients with different level of stenosis often showed a lack of vascular volume, changes in vascular surface texture, and even partial cerebrovascular loss by means of observing CTA scans, which seems to account for our finding.
We designed a validation process to simulate the clinical scenario of precise IA localization since the location of aneurysm is critical for the growth and rupture risk, clinical decision, and outcome evaluation (Investigators et al., 2012; Thompson et al., 2015). In our study, with model augmentation, the mean accuracy, time to diagnosis and interrater agreement of aneurysm localization across clinicians significantly improved, suggesting that the proposed algorithm seems to assist clinicians with varying level of experience in higher efficiency of diagnosis, more accurate and more consistent clinical interpretations. Additionally, the proposed model has great potential in multiple clinical application aspects. It enables stenosis localization and the automatic quantification of specific segments of blood vessels such as arterial diameter, volume, cross-sectional area (narrowing grade), curvature index, even hemodynamic parameters, thus providing additional guidance for future research and treatment of cerebrovascular diseases.
There exist several limitations. First, this study was conducted on data from a single institution. Hence, the generalizability of the algorithm entails further assessment on multicentric external data and there are challenges in identifying precise location of other vascular lesion such as arteriovenous malformation. Second, the model’s labeling performance on vascular segments for small diameter (e.g., distal MCA) and high incidence variation (e.g., PCoA) was slightly weakened. The model may be matured if self-attention mechanism is incorporated by learning rich hierarchical representations of curvilinear structures (Mou et al., 2021). Also, since we focused on the cerebral vasculature in CTA images, the model’s performance on other imaging modalities remains unknown.
5 Conclusion
The precise anatomical labeling of intracranial arteries is a fundamental step in automated diagnosis and decision-making processes for various arterial diseases, and it remains challenging despite considerable research efforts. We developed a powerful DL model to automatically label 42 intracranial arteries segments on CTA images, demonstrating superiority over existing models. Additionally, a significant improvement in clinicians’ performance to precisely locate IAs was observed when assisted by proposed model. This research represents an initial stride towards a more comprehensive evaluation of labeling algorithms and underscores the immense potential of such advancements in the field of computer-aided medicine.
Data availability statement
The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Institutional Review Board (IRB) of Beijing Tiantan Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’; legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
TC: Formal Analysis, Methodology, Project administration, Validation, Writing–original draft. WeY: Conceptualization, Formal Analysis, Methodology, Project administration, Writing–original draft. LZ: Formal Analysis, Methodology, Software, Validation, Writing–review and editing. WaY: Formal Analysis, Methodology, Validation, Writing–review and editing. JF: Conceptualization, Investigation, Methodology, Supervision, Writing–review and editing. JnL: Data curation, Methodology, Writing–review and editing. JaL: Data curation, Project administration, Writing–review and editing. YT: Data curation, Project administration, Writing–review and editing. DW: Formal Analysis, Methodology, Software, Writing–review and editing, Data curation. SiG: Data curation, Formal Analysis, Methodology, Writing–review and editing. JJ: Data curation, Formal Analysis, Methodology, Writing–review and editing. ZW: Data curation, Formal Analysis, Methodology, Writing–review and editing. YW: Data curation, Investigation, Project administration, Writing–review and editing. QZ: Methodology, Writing–review and editing. YZ: Project administration, Data curation, Writing-original graft. JQ: Conceptualization, Supervision, Writing–review and editing. CL: Conceptualization, Supervision, Writing–review and editing. YJ: Conceptualization, Investigation, Supervision, Writing–review and editing. XZ: Conceptualization, Supervision, Writing–review and editing. YL: Conceptualization, Investigation, Methodology, Project administration, Supervision, Writing–review and editing. ShG: Conceptualization, Data curation, Methodology, Supervision, Writing–review and editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study has been funded by Natural Science Foundation of Beijing Municipality (Beijing Natural Science Foundation) (No. M22007) and National Natural Science Foundation of China (NSFC) (Nos 8217050951 and 62171300).
Acknowledgments
We would like to express our heartfelt gratitude to Drs. Yongjun Wang, Yaou Liu, and Tao Wang of Beijing Tiantan Hospital, along with Xia Meng, Yong Jiang, Hao Li, Min Hu, and Fei Wang from the China National Clinical Research Center for Neurological Diseases. Their invaluable contributions in initiating this project, shaping the study design, and collecting DICOM data have been instrumental. Further appreciation is extended to Drs. Siqiang Liu, Ximin Chen, Rong Feng, Miao Liu, and Weixia Li from the China National Clinical Research Center for Neurological Diseases for their meticulous work in manually segmenting the aneurysms. We also wish to acknowledge Drs. Q Z, Longkai Liang, and Pan Liu from the same institution for their expertise in algorithm development, without which this research would not have reached its full potential. We extend our gratitude to everyone who contributed to the successful publication of this work.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2023.1310357/full#supplementary-material
References
Benkarim O., Paquola C., Park B. Y., Hong S. J., Royer J., Vos de Wael R., et al. (2021). Connectivity alterations in autism reflect functional idiosyncrasy. Commun. Biol. 4 (1), 1078. doi:10.1038/s42003-021-02572-6
Bortsova G., Bos D., Dubost F., Vernooij M. W., Ikram M. K., van Tulder G., et al. (2021). Automated segmentation and volume measurement of intracranial internal carotid artery calcification at noncontrast CT. Radiol. Artif. Intell. 3 (5), e200226. doi:10.1148/ryai.2021200226
Bouthillier A., van Loveren H. R., Keller J. T. (1996). Segments of the internal carotid artery: a new classification. Neurosurgery 38 (3), 425–432. doi:10.1097/00006123-199603000-00001
Chen Y., Fan S., Chen Y., Che C., Cao X., He X., et al. (2021). Vessel segmentation from volumetric images: a multi-scale double-pathway network with class-balanced loss at the voxel level. Med. Phys. 48 (7), 3804–3814. doi:10.1002/mp.14934
Dunas T., Wahlin A., Ambarki K., Zarrinkoob L., Birgander R., Malm J., et al. (2016). Automatic labeling of cerebral arteries in magnetic resonance angiography. MAGMA 29 (1), 39–47. doi:10.1007/s10334-015-0512-5
Dunas T., Wahlin A., Ambarki K., Zarrinkoob L., Malm J., Eklund A. (2017). A stereotactic probabilistic atlas for the major cerebral arteries. Neuroinformatics 15 (1), 101–110. doi:10.1007/s12021-016-9320-y
Fleiss J. L., Cohen J. (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educ. Psychol. Meas. 33 (3), 613–619. doi:10.1177/001316447303300309
Fu F., Shan Y., Yang G., Zheng C., Zhang M., Rong D., et al. (2023). Deep learning for head and neck CT angiography: stenosis and plaque classification. Radiology 307, 220996. doi:10.1148/radiol.220996
GBD 2015 Mortality and Causes of Death Collaborators (2016). Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980-2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet 388 (10053), 1459–1544. doi:10.1016/S0140-6736(16)31012-1
Groves L. A., VanBerlo B., Veinberg N., Alboog A., Peters T. M., Chen E. C. S. (2020). Automatic segmentation of the carotid artery and internal jugular vein from 2D ultrasound images for 3D vascular reconstruction. Int. J. Comput. Assist. Radiol. Surg. 15 (11), 1835–1846. doi:10.1007/s11548-020-02248-2
Guo X., Xiao R., Lu Y., Chen C., Yan F., Zhou K., et al. (2021). Cerebrovascular segmentation from TOF-MRA based on multiple-U-net with focal loss function. Comput. Methods Programs Biomed. 202, 105998. doi:10.1016/j.cmpb.2021.105998
Hameeteman K., Zuluaga M. A., Freiman M., Joskowicz L., Cuisenaire O., Valencia L. F., et al. (2011). Evaluation framework for carotid bifurcation lumen segmentation and stenosis grading. Med. Image Anal. 15 (4), 477–488. doi:10.1016/j.media.2011.02.004
Harrigan M. R., Deveikis J. P. (2018). Handbook of cerebrovascular disease and neurointerventional technique. Third Edition. Cham, Switzerland: Spinger. doi:10.1007/978-3-319-66779-9:33-76
Hemmati H. R., Alizadeh M., Kamali-Asl A., Shirani S. (2017). Semi-automated carotid lumen segmentation in computed tomography angiography images. J. Biomed. Res. 31 (6), 548–558. doi:10.7555/JBR.31.20160107
Hilbert A., Rieger J., Madai V. I., Akay E. M., Aydin O. U., Behland J., et al. (2022). Anatomical labeling of intracranial arteries with deep learning in patients with cerebrovascular disease. Front. Neurol. 13, 1000914. doi:10.3389/fneur.2022.1000914
Hong S. W., Song H. N., Choi J. U., Cho H. H., Baek I. Y., Lee J. E., et al. (2023). Automated in-depth cerebral arterial labelling using cerebrovascular vasculature reframing and deep neural networks. Sci. Rep. 13 (1), 3255. doi:10.1038/s41598-023-30234-6
Huang X., Wang J., Li Z. (2023). 3D carotid artery segmentation using shape-constrained active contours. Comput. Biol. Med. 153, 106530. doi:10.1016/j.compbiomed.2022.106530
Investigators U. J., Morita A., Kirino T., Hashi K., Aoki N., Fukuhara S., et al. (2012). The natural course of unruptured cerebral aneurysms in a Japanese cohort. N. Engl. J. Med. 366 (26), 2474–2482. doi:10.1056/NEJMoa1113260
Jin H., Geng J., Yin Y., Hu M., Yang G., Xiang S., et al. (2020). Fully automated intracranial aneurysm detection and segmentation from digital subtraction angiography series using an end-to-end spatiotemporal deep neural network. J. Neurointerv Surg. 12 (10), 1023–1027. doi:10.1136/neurintsurg-2020-015824
Jones J. D., Castanho P., Bazira P., Sanders K. (2021). Anatomical variations of the circle of Willis and their prevalence, with a focus on the posterior communicating artery: a literature review and meta-analysis. Clin. Anat. 34 (7), 978–990. doi:10.1002/ca.23662
Klimont M., Oronowicz-Jaskowiak A., Flieger M., Rzeszutek J., Juszkat R., Jonczyk-Potoczna K. (2020). Deep learning for cerebral angiography segmentation from non-contrast computed tomography. PLoS One 15 (7), e0237092. doi:10.1371/journal.pone.0237092
Koelemay M. J., Nederkoorn P. J., Reitsma J. B., Majoie C. B. (2004). Systematic review of computed tomographic angiography for assessment of carotid artery disease. Stroke 35 (10), 2306–2312. doi:10.1161/01.STR.0000141426.63959.cc
Lee C. Y., Xie S., Gallagher P., Zhang Z., Tu Z. (2015). “Deeply-supervised nets,” in Proceedings of the eighteenth international conference on artificial intelligence and statistics. San Diego, CA, USA: (PMLR).
Li N., Zhou S., Wu Z., Zhang B., Zhao G. (2020). Statistical modeling and knowledge-based segmentation of cerebral artery based on TOF-MRA and MR-T1. Comput. Methods Programs Biomed. 186, 105110. doi:10.1016/j.cmpb.2019.105110
Ma T., Zhang H., Ong H., Vora A., Nguyen T. D., Gupta A., et al. (2021). “Ensembling low precision models for binary biomedical image segmentation,” in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 03-08 January 2021 (IEEE).
Mackey J., Brown R. D., Moomaw C. J., Sauerbeck L., Hornung R., Gandhi D., et al. (2012). Unruptured intracranial aneurysms in the familial intracranial aneurysm and international study of unruptured intracranial aneurysms cohorts: differences in multiplicity and location. J. Neurosurg. 117 (1), 60–64. doi:10.3171/2012.4.JNS111822
Manniesing R., Viergever M. A., van der Lugt A., Niessen W. J. (2008). Cerebral arteries: fully automated segmentation from CT angiography--a feasibility study. Radiology 247 (3), 841–846. doi:10.1148/radiol.2473070436
Mou L., Zhao Y., Fu H., Liu Y., Cheng J., Zheng Y., et al. (2021). CS(2)-Net: deep learning segmentation of curvilinear structures in medical imaging. Med. Image Anal. 67, 101874. doi:10.1016/j.media.2020.101874
Mubashar M., Ali H., Gronlund C., Azmat S. (2022). R2U++: a multiscale recurrent residual U-Net with dense skip connections for medical image segmentation. Neural Comput. Appl. 34 (20), 17723–17739. doi:10.1007/s00521-022-07419-7
Mujagic S., Kozic D., Huseinagic H., Smajlovic D. (2016). Symmetry, asymmetry and hypoplasia of the intracranial internal carotid artery on magnetic resonance angiography. Acta Med. Acad. 45 (1), 1–9. doi:10.5644/ama2006-124.150
Robben D., Turetken E., Sunaert S., Thijs V., Wilms G., Fua P., et al. (2016). Simultaneous segmentation and anatomical labeling of the cerebral vasculature. Med. Image Anal. 32, 201–215. doi:10.1016/j.media.2016.03.006
Thompson B. G., Brown R. D., Amin-Hanjani S., Broderick J. P., Cockroft K. M., Connolly E. S., et al. (2015). Guidelines for the management of patients with unruptured intracranial aneurysms: a guideline for Healthcare professionals from the American heart association/American stroke association. Stroke 46 (8), 2368–2400. doi:10.1161/STR.0000000000000070
Turan T. N., Makki A. A., Tsappidi S., Cotsonis G., Lynn M. J., Cloft H. J., et al. (2010). Risk factors associated with severity and location of intracranial arterial stenosis. Stroke 41 (8), 1636–1640. doi:10.1161/STROKEAHA.110.584672
Wang J., Lv P., Wang H., Shi C. (2021). SAR-U-Net: squeeze-and-excitation block and atrous spatial pyramid pooling based residual U-Net for automatic liver segmentation in Computed Tomography. Comput. Methods Programs Biomed. 208, 106268. doi:10.1016/j.cmpb.2021.106268
Yavagal D. R., Haussen D. C. (2011). Large artery revascularization. Contin. (Minneap Minn) 17, 1267–1292. doi:10.1212/01.CON.0000410035.26853.45
Keywords: computed tomography angiography, intracranial arteries, deep learning, anatomical labeling, intracranial aneurysm, arterial stenosis
Citation: Chen T, You W, Zhang L, Ye W, Feng J, Lu J, Lv J, Tang Y, Wei D, Gui S, Jiang J, Wang Z, Wang Y, Zhao Q, Zhang Y, Qu J, Li C, Jiang Y, Zhang X, Li Y and Guan S (2024) Automated anatomical labeling of the intracranial arteries via deep learning in computed tomography angiography. Front. Physiol. 14:1310357. doi: 10.3389/fphys.2023.1310357
Received: 09 October 2023; Accepted: 28 November 2023;
Published: 04 January 2024.
Edited by:
Kuanquan Wang, Harbin Institute of Technology, ChinaReviewed by:
Xiaoping Leng, Harbin Medical University, ChinaGiuseppe Baselli, Polytechnic University of Milan, Italy
Xiangyu Li, Harbin Institute of Technology, China
Copyright © 2024 Chen, You, Zhang, Ye, Feng, Lu, Lv, Tang, Wei, Gui, Jiang, Wang, Wang, Zhao, Zhang, Qu, Li, Jiang, Zhang, Li and Guan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Youxiang Li, bGl5b3V4aWFuZ0BtYWlsLmNjbXUuZWR1LmNu; Xu Zhang, emhhbmd4dUBjY211LmVkdS5jbg==
†These authors have contributed equally to this work and share first authorship