Deep learning for prediction of post-thrombectomy outcomes based on admission CT angiography in large vessel occlusion stroke

Sommer, Jakob; Dierksen, Fiona; Zeevi, Tal; Tran, Anh Tuan; Avery, Emily W.; Mak, Adrian; Malhotra, Ajay; Matouk, Charles C.; Falcone, Guido J.; Torres-Lopez, Victor; Aneja, Sanjey; Duncan, James; Sansing, Lauren H.; Sheth, Kevin N.; Payabvash, Seyedmehdi

doi:10.3389/frai.2024.1369702

ORIGINAL RESEARCH article

Front. Artif. Intell., 01 August 2024

Sec. Medicine and Public Health

Volume 7 - 2024 | https://doi.org/10.3389/frai.2024.1369702

This article is part of the Research TopicInnovative Applications of Machine Learning and Cutting-Edge Tools for Stroke Prediction and Treatment StrategiesView all 5 articles

Deep learning for prediction of post-thrombectomy outcomes based on admission CT angiography in large vessel occlusion stroke

Jakob Sommer^1,2

Fiona Dierksen¹

Tal Zeevi^1,3

Anh Tuan Tran¹

Emily W. Avery^1,4

Adrian Mak^1,5

Victor Torres-Lopez⁷

James Duncan^1,3

Lauren H. Sansing^8,10

Kevin N. Sheth^7,8

Seyedmehdi Payabvash^1,8^*

¹Section of Neuroradiology, Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, United States
²Institute of Clinical Pharmacology, University Hospital of RWTH Aachen, Aachen, Germany
³Department of Biomedical Engineering, Yale School of Engineering, New Haven, CT, United States
⁴Department of Radiology, University of California, San Diego, San Diego, CA, United States
⁵CLAIM - Charité Lab for Artificial Intelligence in Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany
⁶Division of Neurovascular Surgery, Department of Neurosurgery, Yale University School of Medicine, New Haven, CT, United States
⁷Division of Neurocritical Care and Emergency Neurology, Department of Neurology, Yale University School of Medicine, New Haven, CT, United States
⁸Center for Brain and Mind Health, Yale University School of Medicine, New Haven, CT, United States
⁹Department of Radiation Oncology, Yale School of Medicine, New Haven, CT, United States
¹⁰Division of Stroke and Vascular Neurology, Department of Neurology, Yale University School of Medicine, New Haven, CT, United States

Purpose: Computed Tomography Angiography (CTA) is the first line of imaging in the diagnosis of Large Vessel Occlusion (LVO) strokes. We trained and independently validated end-to-end automated deep learning pipelines to predict 3-month outcomes after anterior circulation LVO thrombectomy based on admission CTAs.

Methods: We split a dataset of 591 patients into training/cross-validation (n = 496) and independent test set (n = 95). We trained separate models for outcome prediction based on admission “CTA” images alone, “CTA + Treatment” (including time to thrombectomy and reperfusion success information), and “CTA + Treatment + Clinical” (including admission age, sex, and NIH stroke scale). A binary (favorable) outcome was defined based on a 3-month modified Rankin Scale ≤ 2. The model was trained on our dataset based on the pre-trained ResNet-50 3D Convolutional Neural Network (“MedicalNet”) and included CTA preprocessing steps.

Results: We generated an ensemble model from the 5-fold cross-validation, and tested it in the independent test cohort, with receiver operating characteristic area under the curve (AUC, 95% confidence interval) of 70 (0.59–0.81) for “CTA,” 0.79 (0.70–0.89) for “CTA + Treatment,” and 0.86 (0.79–0.94) for “CTA + Treatment + Clinical” input models. A “Treatment + Clinical” logistic regression model achieved an AUC of 0.86 (0.79–0.93).

Conclusion: Our results show the feasibility of an end-to-end automated model to predict outcomes from admission and post-thrombectomy reperfusion success. Such a model can facilitate prognostication in telehealth transfer and when a thorough neurological exam is not feasible due to language barrier or pre-existing morbidities.

1 Introduction

In current stroke guidelines, advanced imaging techniques hold a crucial role in the decision-making process for treatment triage of Large Vessel Occlusion (LVO). Among these advanced imaging modalities, Computed Tomography Angiography (CTA) is a pivotal tool. It serves not only for evaluating treatment eligibility but also for assessment of arterial collateral supply and predicting functional stroke outcome prognosis. It has also been proven that CTA is more sensitive in detecting early infarction signs compared to non-contrast Computed Tomography (CT) (Camargo et al., 2007). Infarct cores determined on CTA images are strongly correlated with lesions defined on diffusion-weighted MRI (DWI) scans (Schramm et al., 2004). Recent studies have also revealed CTA’s potential to provide valuable information for long-term prognostication (van Seeters et al., 2015; Sallustio et al., 2017).

Recently, the emergence of artificial intelligence (AI) models has opened new possibilities for the prediction of long-term outcome from baseline stroke imaging information. These models enable the extraction of prognostic information directly from admission CTA scans, offering the potential to forecast patients’ outcomes. Unlike previous approaches that relied on manually engineered imaging biomarkers or complex preprocessing steps (Avery et al., 2022; Zhang et al., 2023), our approach streamlines the prediction process by taking the 3D images as input with minimal preprocessing in the end-to-end automated pipeline that uses raw unprocessed CTA scan to generate long-term outcome predictions. The disadvantage of complex preprocessing is the risk of information loss and the risk of introducing unknown biases. Deep learning makes it possible to keep the preprocessing steps small and at the same time preserve biomarkers that are currently unknown and not easily visually discernible.

In this study, we trained and tested separate deep learning models to predict 3-month outcomes after LVO thrombectomy from admission CTA scans with and without additional treatment and clinical variables. We compared models’ performance in independent test cohort and analyzed model biases. Such tools can facilitate objective prognostication of LVO stroke patients in acute setting. The prognostication of stroke outcomes by the presented models is especially useful for situations with the absence of reliable neurological exams and provides support for informed discussion regarding outcomes with patients and family members as well as establishing long-term goals of stroke management.

2 Methods

2.1 Study design

The clinical and imaging information for this study were retrieved from the stroke registry of Yale New Haven Hospital – from January 1, 2014, to October 31, 2020. Inclusion criteria were baseline CTA scan with at least 1-mm thickness axial slice available, anterior circulation LVO, attempted mechanical thrombectomy, and clinical outcome metrics. The 3-month follow-up modified Rankin Scale (mRS), or the closest interval to three months from stroke onset available was used to assess clinical outcome, and binarized to favorable (mRS ≤ 2) versus poor (mRS > 2). Exclusion criteria were suboptimal CTA scan quality due to motion degradation, metal artifacts, or scanner-related complications. We obtained the approvals from the institutional review board for the process of retrospective data collection. Informed consent was not sought from participants, as it was waived by the respective IRB. All procedures conducted during this study were adhered to current institutional and national guidelines.

2.2 Image preprocessing and training parameters

All head CTAs were resized to a common image dimension of 128x128x128 voxel using a template and resampled to a common voxel space of 1.5×1.5×1.5 mm using trilinear co-registration. The original images had a median (interquartile) voxel spacing of 0.47 (0.43–0.50) mm, 0.47 (0.43–0.50) mm, and 0.64 (0.63–0.63) mm, for the x-, y-, and z-axis, respectively. The template, resizing, and resampling was performed by applying the methodology described by (Sharrock et al., 2021) and (Rorden et al., 2012) using the Python implementation of the open-source medical package Advanced Neuroimaging Tools (Avants et al., 2011). No intensity scaling, image cropping or limit values of voxels have been used.

We leveraged a pretrained ResNet-50 3D Convolutional Neural Network (CNN) model named “MedicalNet” (Chen et al., 2019), initially trained on CT and MRI images for segmentation of multiple organs in the 3DSeg-8 dataset of the MedicalNet developers. We applied the “MedicalNet” weights to the Medical Open Network for AI (MONAI) ResNet-50 configuration (Cardoso et al., 2022) and performed a training process to fine-tune the pre-trained weights for binary classification in this study. For the training process, we partitioned the dataset with stratified splitting into a 5-folded training/cross-validation cohort (n = 496) and independent test (n = 95) and used a batch size of 6, maximum of 300 epochs, learning rate of 1×10⁻⁶, and a weight-decay regularization of 0 using Adam optimization.

To enhance the training process, we incorporated data augmentation techniques, encompassing rotations (with a 30% probability for each axis, within the range of −0.2 to 0.2 radians), zooms (with a 30% probability between 0.8 and 1.2 times zoom), flips (with a 20% probability), shear, and translations (with a 30% probability at 0.3). These augmented images were input into the ResNet model for each of the five cross-validation folds.

2.3 Training pipeline and model inputs – CTA images, “treatment,” and “clinical” variables

We trained, validated, and tested three separate joint models with inputs from (1) “CTA,” with only admission CTA scans as input; (2) “CTA + Treatment,” with input including admission CTA scans and “treatment” variables – i.e. admission-to-scan and scan-to-puncture time gaps and post-thrombectomy reperfusion efficacy. The post-thrombectomy reperfusion efficacy was determined based on a modified Thrombolysis in Cerebral Infarction (mTICI) system (Zaidat et al., 2013) as documented by neurointerventionalists. These values were converted into 0-to-4 ordinal variables. (3) “CTA + Treatment + Clinical,” with input including CTA, “Treatment,” and “Clinical” variables – i.e. admission NIH stroke scale (NIHSS), patient’s age, and sex.

For the training of our CTA-based model, we processed patient CTA images through a CNN using established preprocessing and training parameters. We selected the model with the lowest validation loss from each of the 5-fold cross-validation iterations, creating five distinct sub-models. These sub-models collectively determine the likelihood of a negative outcome by averaging their probability outputs. This approach mitigates the risk of overestimating the model’s performance on the test set. We then converted the final probability into a class prediction using a threshold optimized for accuracy on the validation set.

Subsequently, the five sub-models collectively predicted the probability of poor outcomes for each patient within the test set. The Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) was then generated based on the mean probabilities per patient to assess the model performance. The whole process of the training and testing are depicted in Figure 1.

Figure 1

Figure 1. Visual description of the training, validation and testing process depending on the model input.

For the training of the ensemble model with multimodal input options, we used the probabilities for individual patients by passing the CTA images through the same models. Secondly, the probabilities are used alongside the other numerical input variables to train a logistic regression model using the validation set. The process is repeated for each cross-validation iteration, thereby creating 5 sub-models. The testing process is similar to the testing process of the “CTA” input pipeline, as the sub-models collectively predict the probability of poor outcomes for each patient within the test set. The optimal threshold to convert the probabilities into a class prediction is determined by the validation set in a similar manner as previously described.

In addition to deep learning models, we trained separate logistic regression models for prediction of outcome based on “Treatment” and “Treatment + Clinical” inputs on training/cross-validation cohort and test their performance in the independent test cohort. Since hyperparameter optimization was not needed when using a logistic regression model, we used the whole training/cross-validation cohort as input. Therefore, only one model instead was created based on “Treatment” or “Treatment + Clinical” inputs. We compared the performance of different models ROC AUC, using the DeLong et al. (1988) method.

2.4 Visual verifications of model attention

We applied the M3d-CAM (Gotkowski et al., 2020) to generate attention maps to visualize the regions in the input CTA scans that influenced the model prediction the most, deduced from the 4th layer of the model based on the first cross-validation set. These attention maps improve the interpretability of model predictions for human eyes and highlight head CTA regions with the highest impact on classification decisions by the deep learning model.

2.5 Model bias analysis

We organized the patients within the test set based on the predictions made by each model in comparison to the ground truth. Specifically, we grouped patients who were incorrectly predicted to have either a favorable or unfavorable outcome versus those who were accurately predicted to have either a favorable or unfavorable outcome.

2.6 Code and libraries

The analyses were conducted with Python 3.10.8, Visual Studio Code 1.72, R 4.3.1 and RStudio Server 2023.06.2. Some of the important Python modules being used are ANTs 0.3.8, Monai 1.1.0, PyTorch and PyTorch-Lightning 2.0.0, Scikit-learn 1.2.2 and M3d-CAM.

3 Results

3.1 Patients characteristics

A total of 591 patients were included in our analysis. The average age of patients was 70.2 ± 15.0 years, 322 (54%) were male, with median (interquartile) admission NIHSS of 14 (10–19), an average onset to CTA scan of 5.4 ± 5.5 h, and an average onset to catheterization of 7.1 ± 5.0 h. Table 1 summarizes the patients’ characteristics in training/cross (n = 496) and independent test (n = 95) cohorts. There was no significant difference between the clinical characteristics of these two cohorts.

Table 1

Table 1. Patients’ characteristics in training/cross-validation versus independent test cohorts.

3.2 Model performance

Figure 2 depicts the AUC and loss function through 5-fold training and cross-validation. The final ensemble models with “CTA,” “CTA + Treatment,” “CTA + Treatment + Clinical,” “Treatment” and “Treatment + Clinical” achieved AUC (95% confidence interval) of 0.70 (0.59–0.81), 0.79 (0.70–0.89), 0.86 (0.79–0.94), 0.73 (0.61–0.85) and 0.86 (0.79–0.93) in independent test cohort, respectively. The AUC curves of the “CTA,” “CTA + Treatment” and “CTA + Treatment + Clinical” models are depicted in Figure 3. There was no significant difference between “CTA + Treatment” versus “Treatment” model AUC (p = 0.32), “CTA + Treatment” versus “Treatment + Clinical” model (p = 0.23), or “CTA + Treatment + Clinical” versus “treatment + clinical” model (p = 0.86). The attention maps projected over 15 mm thickness Maximum Intensity Projection slices of CTA scans reveal the area of brain with the highest attention across the input image (Figure 4).

Figure 2

Figure 2. Training and validation AUC and losses throughout the steps in the five-fold cross-validation process.

Figure 3

Figure 3. Graphical depiction of the AUC curve of mean probabilities of the “CTA”, “CTA + Treatment” and “CTA + Treatment + Clinical” model across all folds for the test set including their respective confidence interval.

Figure 4

Figure 4. Maximum intensity projection (MIP) on a class activation map of three example patient in our cohort and a 3D visualization of each patient’s cerebral vessels. From each MIP only the upper half of the activation between the max and min activation are displayed. The area of occlusion is marked with a red arrow in the 3D visualization. Patient 1: probability for poor outcome = 0.67, label = 1; Patient 2: probability for poor outcome = 0.35, label = 0; Patient 3: probability for poor outcome = 0.78, label = 1.

3.3 Model bias analysis

Comparison of 74 patients with correct prediction versus 21 with incorrect prediction of outcome is summarized in Table 2. Overall, patients with incorrect prediction had better outcome (lower mRS), less severe baseline neurological symptoms (lower NIHSS), younger age, and higher post-thrombectomy reperfusion scores compared to those with correct prediction. This shows that the prediction model is biased toward overestimation of poor outcomes.

Table 2

Table 2. Model bias analysis comparing the characteristics of patients with correct versus incorrect prediction by “CTA + Treatment” input model.

4 Discussion

Our study demonstrates that by utilizing admission CTA scans, an automated model can accurately predict 3-month outcomes after thrombectomy in patients with LVO stroke. The end-to-end pipeline, which involves the preprocessing of admission CTAs, can be smoothly integrated into the clinical workflow. Furthermore, although not achieving statistical significance, we showed that the addition of post-thrombectomy reperfusion can increase the AUC of model predictions in the independent test cohort. The inclusion of clinical predictors such as age, sex, and neurological severity alongside CTA data did not significantly enhance model performance. Hence, CTA-based models can be especially useful in situations where clinical information is unreliable, such as in tele-stroke settings, language barriers, or when pre-existing morbidities make neurological exams challenging. It is worth noting that CTA scans are the de facto first line of imaging used to screen and diagnose LVO stroke, meaning that images are likely available before a complete clinical exam. Thus, a model that incorporates CTA data and accounts for reperfusion success can provide valuable prognostic information prior to the initial neurological exam.

Recent clinical trials (RESCUE–Japan LIMIT, SELECT-2, ANGEL-ASPECT, and TENSION) have demonstrated improved functional outcomes after thrombectomy, even among patients with a large infarct core and anterior circulation LVO (Yoshimura et al., 2022; Bendszus et al., 2023; Huo et al., 2023; Sarraj et al., 2023). In this context, the focus of stroke imaging workflow should shift toward identifying patients with LVO and thus eligible for thrombectomy (regardless of infarct core estimates). Subsequently, it should also identify patients at higher risk of poor outcomes despite thrombectomy – or clinically ineffective reperfusion. These patients are potential candidates for post-thrombectomy treatments, such as neuroprotective or neuroregenerative therapies. Deep learning models, such as the one developed and validated in this study, can provide this critical information.

However, other groups have also applied deep learning models for the prognostication of stroke. One research group applied a “Structured Receptive Field Neural Network” (RFNN), a variant of the ResNet model, with emphasis on the advantages of RFNN models in the context of smaller training datasets (Hilbert et al., 2019). They reported an average AUC of 0.71 for their best-performing model. Despite the complexity of their model and the presumable advantage in training with small datasets, their results did not exceed ours. Notably, their model generated a single 2D image from CTA scans for the prediction task. Their group also provided no comparison with models containing admission clinical information (Hilbert et al., 2019).

Another study employs a specialized model architecture called a “Siamese network” to focus on hemispheric asymmetries (Oliveira et al., 2023). This approach utilizes a more complex model structure, but it has the advantage that images of varying qualities could be assessed more uniformly. This model type utilizes Non-Contrast CT (NCCT) images as the main input. The CTA images were only used to predict the presence or absence of stroke to use as one of the clinical inputs. For this, they were compressed to a 2D image using MIP (Maximum Intensity Projection). In the highest-performing version of this model, the research group achieved an AUC of 0.74 in a test set size of 60 patients. Notably, the group included patients both with and without LVO, whereas patients with LVO are far more likely to have worse outcomes (Smith et al., 2006), and thus the model could have differentiated between those with and without LVO; in contrast, we only included patients with LVO, therefore a more clinically homogenous patient cohort.

Furthermore, in another study the researchers extracted radiomics features from middle cerebral artery (MCA) regions of CTA scans. These were utilized for the prediction of 3-month outcomes in patients with LVO stroke (Avery et al., 2022). Comparing models with different inputs, “Radiomics + Treatment,” and “Radiomics + Treatment + Clinical,” achieved AUCs of 0.68, 0.74, and 0.82 in independent test cohort, respectively (Avery et al., 2022).

Yet, one of the strengths of our research is that we created a fully end-to-end automated preprocessing and prediction pipeline for anterior LVO strokes. Compared to the previously mentioned works, our preprocessing pipeline includes fewer steps and yet achieves the same or better results. For example, we avoided skull removal techniques for CTA images due to artifact risk, we utilized a pre-trained network to improve classification, and we refrained from using MIP preprocessed images as input to minimize data loss. The advantage of this is that we can reduce the error potential and bias of our algorithm. The ensemble model structure, in which several individually trained versions come to a joint decision, has not yet been reported for this task. Not only it minimizes the risk of overfitting, but it also allows the flexible addition of other clinical variables, without any negative impact on the performance of the image analysis. In our analysis, we compared how the model with clinical variables performed in contrast to the model without clinical variables. Another novelty of our method is the inclusion of thrombectomy success as a prognostic input. This variable can provide the best versus worst-case scenario prediction for complete reperfusion versus thrombectomy failure. The “CTA + Treatment” model holds great potential for prognostication in acute stroke settings, as it can provide a wide range of outcome predictions, from the least to the most favorable treatment results based on the presumed lowest to highest mTICI reperfusion. This approach enables the model to estimate the probabilities of 3-month outcomes based on potential thrombectomy success.

It’s worth noting that the model can solely rely on CTA scans, too, which is almost always available at the time of LVO diagnosis. Therefore, a model based on imaging information can provide rapid and objective predictions regardless of local expertise and inter-examiner variabilities.

In our work, we also compared ensemble models using different variables with each other and were able to do a comprehensive assessment of different input strategies. For example, we wanted to simulate situations in which the determination of NIHSS is not possible for various reasons (e.g., tele-stroke setting, pre-existing motor deficit due to musculoskeletal degenerative disease, or language barrier). Although the combination of age, NIHSS, sex, and treatment results emerge as a strong prognostic model (Cummock et al., 2023; Oliveira et al., 2023), a fully automated model based on imaging input alone can be useful for immediate risk stratification as soon as CTA scan is completed.

The use of an ensemble model as described by us also has the advantage of increasing generalizability. Since multiple versions of a model make a decision together by averaging each one’s probability, it results in less fluctuation between each patient and is less prone to overfit. Since it is widely known that deep learning models trained on a specific set of imaging characteristics may struggle to generalize to external datasets with different imaging properties (Li et al., 2023), we also took additional measures to avoid overfitting in the training process. For example, we applied data augmentation techniques as well as L2-regularization, as they are known to increase the generalizability of a trained model (Sanford et al., 2020; Li et al., 2023). Notably, prognostic models based on automated analysis of admission CTA can offer generalizable treatment guidance information given the widespread availability of CTA scans, even in rural areas. This additional information is provided regardless of the presence expert reviewers, and without need for additional advanced imaging (such as perfusion), extra radiation or contrast administration. Risk-stratification of patients can identify potential candidates for additional post-thrombectomy neuroprotective or neuroregenerative therapies.

Attention maps help visually illustrate the deep learning model’s perception and provide insight into the decision-making process of machine learning models. As depicted in Figure 4, cerebral areas within the MCA supply territory had the highest impact on the decisions of our models. These findings confirm that model predictions were based on attention to at-risk cerebral tissue on CTA scans of LVO stroke patients (Waqas et al., 2020).

Our study has limitations. In model bias analysis, we found that false predictions of our model more commonly involved patients with favorable clinical characteristics (Table 2). The class imbalance within the dataset, particularly within the lower mRS categories, may have exerted an influence on the model’s performance during both the training and testing phases. In theory, there are some strategies that can be applied in the future to mitigate the issue of overestimating classes. For example, generative adversarial networks (GANs) are increasingly applied to generate synthetic images that oversample minority classes (Islam and Zhang, 2020), therefore increasing the ability of the model to reliably detect minority classes. Other approaches are aimed at optimizing the analysis of the generated probabilities like prediction uncertainty (Zou et al., 2023) or using a conditional probability for bias correction in prediction (Alexandari et al., 2020).

Also, retrospective datasets may suffer from biases inherent in the data collection process. These biases could be related to patient selection, imaging protocols, or institutional practices. Moreover, the mTICI score included in the dataset were not from core laboratory and therefore subject to inter-examiner variability. In addition, stroke management and outcomes can evolve due to advancements in medical treatments, changes in clinical guidelines, or improvements in healthcare practices. Retrospective studies without external validation might not account for these temporal changes, affecting the model’s applicability to more recent patient cohorts.

5 Conclusion

We showed the feasibility of an end-to-end fully automated to predict post-thrombectomy outcomes from readily available admission CTA images and treatment data. The model can be practically useful in the absence of a reliable neurological exam which can also provide robust prognostication. The comparison of our work to existing literature suggests that a simple deep learning approach, as implemented in our model, strikes a pragmatic balance between performance, architectural simplicity, and preprocessing ease.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: unknown. The datasets analyzed during the current study are available from the corresponding author on reasonable request. The programming code used for analysis can be found at: https://github.com/Fledermaus12/LVOstroke-DL. Requests to access these datasets should be directed to SP, c2FtLnBheWFidmFzaEB5YWxlLmVkdQ==.

Author contributions

JS: Software, Writing – original draft, Writing – review & editing. FD: Formal analysis, Software, Writing – review & editing. TZ: Software, Writing – review & editing. AT: Software, Writing – review & editing. EA: Data curation, Writing – review & editing. AdM: Data curation, Writing – review & editing. AjM: Data curation, Writing – review & editing. CM: Data curation, Writing – review & editing. GF: Data curation, Writing – review & editing. VT-L: Data curation, Writing – review & editing. SA: Data curation, Software, Writing – review & editing. JD: Data curation, Writing – review & editing. LS: Data curation, Writing – review & editing. KS: Data curation, Writing – review & editing. SP: Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. CM was supported by the NIH (R21NS128641). LS was supported by the NIH (R01NS095993, R01NS097728). KS was supported by the AHA (17CSA33550004), NIH (U24NS107215, U24NS107136, U01NS106513, R01NR018335), and grants from Novartis, Biogen, Bard, Hyperfine and Astrocyte. KS reports equity interests in Alva Health. SP was supported by the NIH (K23NS118056) and Doris Duke Charitable Foundation (2020097). The funders were not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Alexandari, A., Kundaje, A., and Shrikumar, A. Maximum Likelihood with Bias-Corrected Calibration is Hard-To-Beat at Label Shift Adaptation. arXiv [Preprint]. Available at: https://arxiv.org/abs/1901.06852 (Accessed July 19, 2024).

Google Scholar

Avants, B. B., Tustison, N. J., Song, G., Cook, P. A., Klein, A., and Gee, J. C. (2011). A reproducible evaluation of Ants similarity metric performance in brain image registration. Neuro Image 54, 2033–2044. doi: 10.1016/j.neuroimage.2010.09.025

PubMed Abstract | Crossref Full Text | Google Scholar

Avery, E. W., Behland, J., Mak, A., Haider, S. P., Zeevi, T., Sanelli, P. C., et al. (2022). Ct angiographic radiomics signature for risk stratification in anterior large vessel occlusion stroke. Neuroimage Clin. 34:103034. doi: 10.1016/j.nicl.2022.103034

PubMed Abstract | Crossref Full Text | Google Scholar

Bendszus, M., Fiehler, J., Subtil, F., Bonekamp, S., Aamodt, A. H., Fuentes, B., et al. (2023). Endovascular thrombectomy for acute ischaemic stroke with established large infarct: multicentre, open-label, randomised trial. Lancet 402, 1753–1763. doi: 10.1016/S0140-6736(23)02032-9

PubMed Abstract | Crossref Full Text | Google Scholar

Camargo, E. C. S., Furie, K. L., Singhal, A. B., Roccatagliata, L., Cunnane, M. E., Halpern, E. F., et al. (2007). Acute brain infarct: detection and delineation with Ct angiographic source images versus nonenhanced Ct scans. Radiology 244, 541–548. doi: 10.1148/radiol.2442061028

PubMed Abstract | Crossref Full Text | Google Scholar

Cardoso, M. J., Li, W., Brown, R., Ma, N., Kerfoot, E., Wang, Y., et al. (2022). Monai: An open-source framework for deep learning in healthcare. [Preprint]. Available at: Available at: https://arxiv.org/abs/2211.02701 (Accessed July 19, 2024).

Google Scholar

Chen, S., Ma, K., and Zheng, Y. (2019). Med3D: Transfer learning for 3D medical image analysis. Corr, abs/1904.00625.

Google Scholar

Cummock, J. S., Wong, K. K., Volpi, J. J., and Wong, S. T. (2023). Reliability of the National Institutes of Health (Nih) stroke scale between emergency room and neurology physicians for initial stroke severity scoring. Cureus 15:e37595. doi: 10.7759/cureus.37595

Crossref Full Text | Google Scholar

Delong, E. R., Delong, D. M., and Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44, 837–845. doi: 10.2307/2531595

Crossref Full Text | Google Scholar

Gotkowski, K., Gonzalez, C., Bucher, A., and Mukhopadhyay, A. M3d-CAM: A PyTorch library to generate 3D data attention maps for medical deep learning. arXiv [Preprint]. (2020). Available at: https://arxiv.org/abs/2007.00453 (Accessed July 19, 2024).

Google Scholar

Hilbert, A., Ramos, L. A., Van Os, H. J. A., Olabarriaga, S. D., Tolhuisen, M. L., Wermer, M. J. H., et al. (2019). Data-efficient deep learning of radiological image data for outcome prediction after endovascular treatment of patients with acute ischemic stroke. Comput. Biol. Med. 115:103516. doi: 10.1016/j.compbiomed.2019.103516

PubMed Abstract | Crossref Full Text | Google Scholar

Huo, X., Ma, G., Tong, X., Zhang, X., Pan, Y., Nguyen, T. N., et al. (2023). Trial of endovascular therapy for acute ischemic stroke with large infarct. N. Engl. J. Med. 388, 1272–1283. doi: 10.1056/NEJMoa2213379

Crossref Full Text | Google Scholar

Islam, J., and Zhang, Y. (2020). Gan-based synthetic brain pet image generation. Brain Inform. 7:3. doi: 10.1186/s40708-020-00104-2

PubMed Abstract | Crossref Full Text | Google Scholar

Li, X., Wu, Y., Tang, C., Fu, Y., and Zhang, L. (2023). Improving generalization of convolutional neural network through contrastive augmentation. Knowl.-Based Syst. 272:110543. doi: 10.1016/j.knosys.2023.110543

Crossref Full Text | Google Scholar

Oliveira, G., Fonseca, A. C., Ferro, J., and Oliveira, A. L. (2023). Deep learning-based extraction of biomarkers for the prediction of the functional outcome of ischemic stroke patients. Diagnostics 13:3604. doi: 10.3390/diagnostics13243604

PubMed Abstract | Crossref Full Text | Google Scholar

Rorden, C., Bonilha, L., Fridriksson, J., Bender, B., and Karnath, H. O. (2012). Age-specific Ct and Mri templates for spatial normalization. NeuroImage 61, 957–965. doi: 10.1016/j.neuroimage.2012.03.020

PubMed Abstract | Crossref Full Text | Google Scholar

Sallustio, F., Motta, C., Pizzuto, S., Diomedi, M., Giordano, A., D'agostino, V. C., et al. (2017). Ct angiography-based collateral flow and time to reperfusion are strong predictors of outcome in endovascular treatment of patients with stroke. J. Neurointerv. Surg. 9, 940–943. doi: 10.1136/neurintsurg-2016-012628

Crossref Full Text | Google Scholar

Sanford, T. H., Zhang, L., Harmon, S. A., Sackett, J., Yang, D., Roth, H., et al. (2020). Data augmentation and transfer learning to improve generalizability of an automated prostate segmentation model. Am. J. Roentgenol. 215, 1403–1410. doi: 10.2214/AJR.19.22347

PubMed Abstract | Crossref Full Text | Google Scholar

Sarraj, A., Hassan, A. E., Abraham, M. G., Ortega-Gutierrez, S., Kasner, S. E., Hussain, M. S., et al. (2023). Trial of endovascular Thrombectomy for large ischemic strokes. N. Engl. J. Med. 388, 1259–1271. doi: 10.1056/NEJMoa2214403

Crossref Full Text | Google Scholar

Schramm, P., Schellinger, P. D., Klotz, E., Kallenberg, K., Fiebach, J. B., Külkens, S., et al. (2004). Comparison of perfusion computed tomography and computed tomography angiography source images with perfusion-weighted imaging and diffusion-weighted imaging in patients with acute stroke of less than 6 hours’ duration. Stroke 35, 1652–1658. doi: 10.1161/01.STR.0000131271.54098.22

PubMed Abstract | Crossref Full Text | Google Scholar

Sharrock, M. F., Mould, W. A., Ali, H., Hildreth, M., Awad, I. A., Hanley, D. F., et al. (2021). 3D deep neural network segmentation of intracerebral hemorrhage: development and validation for clinical trials. Neuroinformatics 19, 403–415. doi: 10.1007/s12021-020-09493-5

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, W. S., Tsao, J. W., Billings, M. E., Johnston, S. C., Hemphill, J. C. 3rd, Bonovich, D. C., et al. (2006). Prognostic significance of angiographically confirmed large vessel intracranial occlusion in patients presenting with acute brain ischemia. Neurocrit. Care. 4, 014–017. doi: 10.1385/NCC:4:1:014

PubMed Abstract | Crossref Full Text | Google Scholar

Van Seeters, T., Biessels, G. J., Kappelle, L. J., Van Der Schaaf, I. C., Dankbaar, J. W., Horsch, A. D., et al. (2015). The prognostic value of Ct angiography and Ct perfusion in acute ischemic stroke. Cerebrovasc. Dis. 40, 258–269. doi: 10.1159/000441088G. J., Boiten, J., Van Rooij, W. J., De Kort, P. L., Roos, Y. B., Van Dijk, E. J., Pleiter, C. C., Mali, W. P., Van Der Graaf, Y. & Velthuis, B. K

Crossref Full Text | Google Scholar

Waqas, M., Mokin, M., Primiani, C. T., Gong, A. D., Rai, H. H., Chin, F., et al. (2020). Large vessel occlusion in acute ischemic stroke patients: a dual-center estimate based on a broad definition of occlusion site. J. Stroke Cerebrovasc. Dis. 29:104504. doi: 10.1016/j.jstrokecerebrovasdis.2019.104504

PubMed Abstract | Crossref Full Text | Google Scholar

Yoshimura, S., Sakai, N., Yamagami, H., Uchida, K., Beppu, M., Toyoda, K., et al. (2022). Endovascular therapy for acute stroke with a large ischemic region. N. Engl. J. Med. 386, 1303–1313. doi: 10.1056/NEJMoa2118191

Crossref Full Text | Google Scholar

Zaidat, O. O., Yoo, A. J., Khatri, P., Tomsick, T. A., Von Kummer, R., Saver, J. L., et al. (2013). Recommendations on angiographic revascularization grading standards for acute ischemic stroke: a consensus statement. Stroke 44, 2650–2663. doi: 10.1161/STROKEAHA.113.001972

Crossref Full Text | Google Scholar

Zhang, L., Wu, J., Yu, R., Xu, R., Yang, J., Fan, Q., et al. (2023). Non-contrast Ct radiomics and machine learning for outcomes prediction of patients with acute ischemic stroke receiving conventional treatment. Eur. J. Radiol. 165:110959. doi: 10.1016/j.ejrad.2023.110959

PubMed Abstract | Crossref Full Text | Google Scholar

Zou, K., Chen, Z., Yuan, X., Shen, X., Wang, M., and Fu, H. (2023). A review of uncertainty estimation and its application in medical imaging. Meta Radiol. 100003. doi: 10.1016/j.metrad.2023.100003

Crossref Full Text | Google Scholar

Keywords: deep learning, stroke, thrombectomy, CT angiography, outcome

Citation: Sommer J, Dierksen F, Zeevi T, Tran AT, Avery EW, Mak A, Malhotra A, Matouk CC, Falcone GJ, Torres-Lopez V, Aneja S, Duncan J, Sansing LH, Sheth KN and Payabvash S (2024) Deep learning for prediction of post-thrombectomy outcomes based on admission CT angiography in large vessel occlusion stroke. Front. Artif. Intell. 7:1369702. doi: 10.3389/frai.2024.1369702

Received: 12 January 2024; Accepted: 17 July 2024;
Published: 01 August 2024.

Edited by:

Tuan D. Pham, Queen Mary University of London, United Kingdom

Reviewed by:

Vivek Yedavalli, Johns Hopkins Medicine, United States
Jose Jaramillo-Villegas, Technological University of Pereira, Colombia

Copyright © 2024 Sommer, Dierksen, Zeevi, Tran, Avery, Mak, Malhotra, Matouk, Falcone, Torres-Lopez, Aneja, Duncan, Sansing, Sheth and Payabvash. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Seyedmehdi Payabvash, c2FtLnBheWFidmFzaEB5YWxlLmVkdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.