- Université Grenoble Alpes, Inserm, CHU Grenoble Alpes, U1216, Grenoble Institut Neurosciences, Grenoble, France
The gold standard to diagnose intracerebral lesions after traumatic brain injury (TBI) is computed tomography (CT) scan, and due to its accessibility and improved quality of images, the global burden of CT scan for TBI patients is increasing. The recent developments of automated determination of traumatic brain lesions and medical-decision process using artificial intelligence (AI) represent opportunities to help clinicians in screening more patients, identifying the nature and volume of lesions and estimating the patient outcome. This short review will summarize what is ongoing with the use of AI and CT scan for patients with TBI.
1. Introduction
Traumatic brain injury (TBI) is a leading cause of death and disability in young people. It affects more than one European in 400 (1). It therefore has a strong human and financial impact on society (2). If the initial aggression induces primary lesions that are instantaneous and unavoidable, it can also cause delayed lesions, named secondary injuries, which strongly influence the neurological outcome. Therefore, the critical care management of TBI patients is to limit secondary brain injuries with therapies that are adapted to patients' severity (3).
Assessment of severity of TBI relies on clinical examination and initial brain imaging. Clinical examination is poor at the early phase of TBI and is based on the pupillary reactivity and the Glasgow Coma Score (GCS) that classifies TBI in 3 stages: mild, moderate, and severe. As a result, GCS categories cover a large pattern of TBI and are unable to differentiate specific evolutions such as diffuse injuries or focal lesions (4). Brain imaging, mainly computed tomography (CT) scan (5), is an additional tool to better classify TBI patients. Based on the radiological reading findings, researchers have built imaging scores that describe and quantify brain lesions. Several scores have been developed, i.e., the Marshall classification (6), the Rotterdam score (7), the Stockholm score (8), and the Helsinki score (9). All scores are correlated with the neurological outcome defined as the Glasgow Outcome Scale (GOS) (9–12).
However, there are some drawbacks with the use of these CT scan scores. One point is their dichotomized answers, i.e., yes or no, that may underestimate the global effect of small intracerebral lesions. Despite their relative simplicity, inter-observer variability should be considered with the use of these scores. Chun et al. (13) measured this inter-observer agreement and found an average Cohen's Kappa coefficient of 0.57 for the detection of CT abnormalities, far from 1, corresponding to a full inter-observer agreement. Finally, categorizing each patient within a CT scan score is demanding in human resources that may explain why these scores are not used in clinical practice. In this context, the development of automated approaches to determine the nature and volume of traumatic brain injuries from CT scans has gained interest among clinicians.
Due to the philosophical implications of intelligence's definition, the general definition of artificial intelligence (AI) remains unclear. Nevertheless, one can use the expression “AI” as defined in (14): a field of computer science dedicated to the creation of systems performing tasks that usually require human intelligence, and the term “machine learning” (ML) as: a sub-field of AI that includes all those approaches that allow computers to learn from data without being explicitly programmed. The rise of computing power capacities at the beginning of the 2,000 and the creation of large database have revealed the effectiveness of AI algorithms applied to medical images to improve the clinical care (14). Among the three ways of learning for ML algorithms (supervised, unsupervised, and reinforcement learning), the main algorithms used on biomedical imaging to predict outcome is the supervised learning, where input data and the variable to predict are known and the algorithms learn the relation between them (15). This expanding field of research may help the quantitative analysis of CT images and offer new perspectives in clinical care of TBI patients by following the process presented in Figure 1A. This review presents studies focusing on the classification and the segmentation of lesions based on the manual or automated analysis of CT scans. The main studies are summarized in Table 1. We will also cover the current research gaps and potential perspectives of the use of AI combined to CT scans in the TBI area.
Figure 1. (A) Contribution of computed tomography (CT) scan analysis by artificial intelligence to the clinical care of traumatic brain injury (TBI) patients. References and terms are defined in Table 1. (B) Example of the use of artificial intelligence (AI) algorithms on clinical routine. CT scans of two patients (P1 and P2) at D0 were quantified with state of the art algorithms. On the right, CT scans of the same two patients acquired at D1 are shown. P1 and P2 had different clinical care. P1 underwent a decompressive craniectomy and not P2. Biggest extra axial hemorrhage (EAH) lesion was segmented with Brain Lesion Analysis and Segmentation Tool for Computed Tomography (BLAST-CT) (16) and radiomic metrics on this region of interest (ROI) were extracted as in (17). At first sight, the two lesions have the same profile, with equivalent volumes and means, but the variance of P1 is higher than twice the one of P2. That could for instance be a biomarker evaluated in further studies to predict the need for craniectomy. ICH, intracranial hemorrhage; GCS, Glasgow Coma Score; MLR, multivariate logical regression; RF, random forest; ANN, artificial neural network; GOS, Glasgow Outcome Score; CT scan, computed tomography image; Ref, References; HU, Hounsfield Units; ROI, region of interest; EAH, extra axial hemorrhage; D, day.
2. Classification
2.1. Definition and Evaluation
Classification is the way for an algorithm to attribute a class to each input object, which is defined by one or more data points. There are different metrics to evaluate the performances of a classification. Let us assume a classification algorithm aimed at predicting a pathology and let us consider its evaluation on a test database. One defines as TP the number of true positive predictions, TN the number of true negative predictions, FP the number of false positive predictions, and FN the number of false negative predictions. The algorithm's Accuracy (Acc), which represents the percentage of good predictions, can be computed as:
To separate the cases of false prediction on healthy or pathological subjects, one can define the Sensitivity (Se) and Specificity (Sp):
Modifying the discrimination threshold of a classification algorithm allows to weigh the cost of predicting the pathology. It is indeed more costly to predict normality for a patient than pathology for a healthy subject. It allows to decrease FN, but as a consequence, increase FP, and modify TP and TN. The modification of this threshold leads to new values of TP, TN, FP, and FN, and one can summarize these values by representing Se in function of (1−Sp), defining the receiver-operator curve (ROC). The measure of the area under the curve (AUC) of the ROC curve represents the aggregate performances of the classification algorithm, whatever its discrimination threshold, and is a metric widely used to evaluate classification algorithms. An AUC of 1 describes a perfect classifier, and a value of 0.5 means that the classifier performs no better than chance.
2.2. State of the Art
2.2.1. Manual Quantification
The first studies involving AI tools aimed to predict the outcome after TBI. Patient outcome at 6 and 12 months was defined favorable vs. unfavorable outcome, i.e., death and severe disability, according to the GOS score. From a methodological point of view, these studies were based on the analysis of heterogeneous metrics, representing demography (age, sex, etc.), physiology (pupil response to light, GCS score, blood glucose concentration, etc.), and brain CT scan metrics manually quantified by radiologists using imaging scores. These metrics were aggregated to build prognostic models using multivariate logical regression approaches as reviewed by Perel et al. (31). Two large datasets of TBI patients produced predictive models: CRASH (18) and IMPACT (19, 32, 33). These models were subsequently externally validated with a good performance in predicting mortality and unfavorable outcome (AUC between 0.65 and 0.87) (34). The prevalence of the metrics was estimated, and if the main predictors are age or GCS (35), CT characteristics manually quantified also carry crucial information able to predict TBI outcome (18, 36, 37). CT characteristics were also evaluated individually, and showed important prognostic ability in TBI outcome, as reviewed by Zhu et al. (38), but imaging scores were only able to predict outcome with an AUC between 0.56 and 0.82 (9, 11, 12).
More recently, instead of multivariate logical regression (MLR), complex algorithms were evaluated. In 2018, 9 ML algorithms were tested upon 232 TBI patients in order to predict outcome and mortality from 14 predictors including radiological reading findings at admission and at the following day (20). The best models were obtained using Random Forest (RF), providing an AUC of 0.895 for outcome prediction and using ridge regression, providing an AUC of 0.875 for mortality prediction, both on a dedicated test set.
Some other benchmarks of ML algorithms included artificial neural network (ANN) experiments (39). ANN-based models showed contrasted performances in classifying the outcome from manual quantified metrics, with studies obtaining results outperforming the state-of-the-art models, showing an AUC = 0.946 predicting the dichotomized GOS (21) or an Acc = 0.92 predicting 6-month mortality (22), and studies showing lower performances predicting in hospital mortality (AUC = 0.706) (23) or dichotomized outcome (AUC = 0.78) (24). These differences can be explained by the gap between the number of subjects of the study (all datasets included) (565, 2059, 11,026, 12,576, respectively, to the order of citation), the proportion of pathological and healthy subjects (6, 10, 26, and 47%) or by the validation method (internal, internal, external, external). These differences indicate that the methodology is a crucial step to truly measure ML model performances and to trust its predictions (40).
2.2.2. Automated Quantification
Besides clinical metrics, CT scan provides high spatial resolution images of the brain that contain much more information than that summarized in imaging scores. Several researchers recently started to use automatic algorithms to exploit this large amount of CT scans with the aim to find biomarkers able to predict outcome in TBI.
2.2.2.1. Histograms
To our knowledge, the first article to study TBI on CT scan was Kim et al. (25). Thanks to an automatic preprocessing pipeline of brain extraction of head injured pediatric patients, the authors showed that the proportion of brain voxel values ranging between 17 and 24 Hounsfield Units (HU) was a good predictor of edema severity (AUC = 0.85). This work demonstrated that (i) CT scans contain valuable quantitative information linked to the evolution of the status of the brain and (ii) automatic tools may be used to extract these quantitative features.
2.2.2.2. Radiomics
Distribution of voxels values, represented by a histogram, is a simple metric to quantitatively characterize an image. One can go further and extract more complex metrics to represent shape, texture, or contrast of an image (41). This research field is called Radiomic (42), and has proven its relevance in tumor diagnosis (43), while its biological meaning is currently discussed (44). Each radiomic metric can be calculated on a 2D or 3D region of interest (ROI), leading to extract a large number of metrics characterizing the ROI. This has been employed in TBI patients (17), suggesting that extracting metrics of first-order statistics (FOS), texture, and shape from ROI of segmented lesions could discriminate the injury's nature, among epidural hematoma (EDH), acute subdural hematoma (ASH), and contusion, and providing an Acc = 89.7. Another way to use radiomic is to extract each metric from a sliding window moving on a CT scan, leading to generate as many parametric maps as metrics used. This method has been used by Muschelli et al. (45) to segment hemorrhage. However, this has not been specifically used for TBI yet to our knowledge.
2.2.2.3. Convolutional Neural Networks
Deep learning is a subset of ML methods based on ANN. There are several types of networks among which recurrent neural networks (RNN) or convolutional neural networks (CNN). In computer vision, due to their ability to analyze pixels and their neighborhood, CNN are the most used networks in classification and segmentation tasks, and so in the biomedical imaging field (46, 47). In TBI, to classify injuries from CT scans, Chilamkurthy et al. (26) built several DL (and RF) algorithms each able to detect the presence or the absence of one type of lesion in a CT scan. Injury types contained different types of intracranial hemorrhage, fracture, midline shift, and mass effect. These algorithms were externally validated with an AUC from 0.92 to 0.97.
3. Automated Segmentation
3.1. Definition and Evaluation
Segmentation is, in image processing, an operation aimed to split an image in 2 or more ROI. Let us assuming a segmentation algorithm aimed to segment a brain lesion and its evaluation on a test database. To compare the output segmentation X to a ground truth segmentation Y, one can computes the Dice similarity coefficient (DSC), defined as, for |X| the number of pixels in the segmentation X:
This coefficient measures the number of common pixels between 2 segmentations in relation to the global number of pixels of the 2 segmentations. Some algorithms are trained on images that include images without lesion, therefore without ground truth segmentation. In these cases, if the algorithm predicts a lesion, even very small, the DSC is equal to 0 and strongly affects the mean DSC, although the segmentation error is quite small. That is why on some studies, DSC is only computed on large lesions, leading to better DSC values than studies measuring DSC on all lesions.
3.2. State of the Art
In computer vision segmentation, as well as in classification (see section 2.2.2.3), CNN is the most used ANN subtype.
3.2.1. Segmentation
In TBI data, Jadon et al. (27) compared CNN architectures and built 3 algorithms, each segmenting one type of injury among (intraparenchymal hemorrhage [IPH], extra axial hemorrhage [EAH], and hemorrhagic contusions) from 2D CT slice. DSC ranged from 0.85 for EAH to 0.95 for IPH. On their side, Jain et al. (28) built an algorithm based on the segmentation of WM, GM, and CSF to segment acute intracranial lesions (DSC = 0.73) and cisterns (DSC = 0.70 for cisterns larger than 5 mL).
Kuo et al. (29) demonstrated the superiority of a training on random patches of 240*240 pixels and evaluation on sliding windows on a CT scan to detect and segment intracranial hemorrhage against more classical CNN, which takes as input a whole CT 2D image. The resulting algorithm provides an AUC of 0.966 on external validation and a DSC of 0.72 on internal validation.
As there are different ways to build a CNN, Yao et al. (30) built a multiview CNN aimed to segment acute hematoma that analyzes CT scan at different resolutions to help detecting small lesions. It shows a DSC of 0.69 on an internal test set containing 20 subjects.
More recently, Monteiro et al. (16) made available a model named Brain Lesion Analysis and Segmentation Tool for Computed Tomography (BLAST-CT), aimed to segment and classify four types of TBI injuries: IPH, EAH, intraventricular hemorrhage (IVH), and perilesional edema (PO). They built a CNN, inspired from the architecture of Deepmedic (48), and trained it on 839 TBI scans where the four former lesion types had been delineated by trained personnel. The resulting model's segmentation have been validated on an internal cohort of 655 scans, providing a DSC of 0.36 (as mentioned before, DSC is very sensitive to small volumes and might not be the best metric to evaluate segmentation of small lesions), while the classification has been validated on an external cohort of 490 scans, providing an AUC between 0.83 and 0.95 for the different lesion types to classify.
3.2.2. Quantification From Segmentation
Besides this type of segmentation, some researchers went further in the quantification of CT images. Jain et al. (28) explored the measurement of cistern volume and midline shift because of a pipeline composed of CNNs and morphological operations and provides an Acc of 0.89 for midline shift estimation. Yao et al. (30) went, to our knowledge, the furthest. They automatically measured hematoma volumes, their localization, shape features representing the hematomas, and trained a RF algorithm with these metrics and biological information. Thereby, they were able to predict 6 months mortality, achieving an AUC of 0.85 on internal validation on 828 patients.
4. Discussions and conclusion
The development of ML algorithms offers new possibilities in predicting TBI outcome. While these methods provide few improvements in predicting outcome from clinical variables and imaging scores compared to classic multivariate logistic regression, they might change the paradigm about image quantification. In addition to the automatization and speeding up of the quantification, image processing methods can help to detect humanly undetectable patterns and, by represent early cerebral rearrangement, provide new biomarkers of TBI evolution, especially in TBI's modality of choice: CT.
4.1. Current Research Gaps
In the coming years, human segmentation and quantification of CT scans can probably be replaced by automated processing of ML algorithms, but their incorporation in clinical routine will be dependent on models and prediction's explainability and interpretability. Progress must be done on mastering CNN's theory to evaluate their capacities and limitations. A preliminary step could be the development of saliency maps (49), representing the voxels which have a strong impact on the prediction. Each prediction from a CT scan could be presented along with its saliency map, in order to explain the algorithm decision and potentially check its relevance, as already proposed by Kuo et al. (29).
Machine learning algorithms depend on their training dataset's nature. In order to deploy tools usable on multiple sites, one should put efforts on reducing intermachine and interprotocol variability. Indeed, the different models of scanner and parameters of acquisition, such as reconstruction algorithms, output resolution, voltage, or number of detectors, influence output images properties (50, 51). While researchers studied harmonization of metrics extracted from images (52), few studies intend to reduce variability at image level (53, 54). Since deep learning algorithms might be able to learn intersite variabilities and take them into account if trained on heterogeneous cohorts, efforts must then be made on the establishment of large and multicentric cohorts to cover the spectrum of all secondary injuries induced by TBI, as initiated by Maas et al. (55) or Flanders et al. (56).
As ML models on medical imaging usually follow the same pattern—metric extractions, features selection, training, validation and the same evaluation metrics (AUC, Acc, DSC, etc.)—it seems easy to compare them. However, these metrics strongly depend on the methodology of the model training and evaluation. If each study has its specifics, one must pay attention to the quality of the dataset (number of patients, heterogeneity/homogeneity), training and validation modalities (internal, external), and metrics of evaluation (DSC computed on all lesions or only on large lesions). An improvement in methodology's robustness, as well as in the availability of the mathematical models, will surely benefit all the scientific community and help build stronger studies, which are able to give birth to potential breakthrough in the clinical care of TBI patients.
4.2. Potential Future Developments
Brain CT represents the brain's density at the acquisition time. CT scans are generally acquired at different times to follow cerebral rearrangement after injuries or surgery. To improve our comprehension of cerebral rearrangement following heterogeneous secondary injuries, the establishment of longitudinal cohorts and their quantification is crucial to define different profile of evolution (5). Such profiles could then be isolated, leading to specific mechanisms and maybe specific treatments.
From a clinician point of view, the idea to combine clinical data and CT metrics is strongly relevant. One can imagine combining many variables from different nature, like most of the models presented in section 2.2.1, but replace manual quantification by automated and deep quantification of CT images provided by modern ML algorithms. As evaluated by Yao et al. (30), we also believe that in the field of TBI, localization of lesions have a large impact on the patient outcome. The development methods to register structural atlases on distorted TBI CT scans and the incorporation of this spatial information on prediction models might strongly improve their performances.
For research purposes, one might be soon able to predict clinical data as the neurological outcome or the level of medical care needed by a patient, as illustrated in Figure 1B. However, since no study evaluated the relevance of AI algorithms in the quantification of complex images, as mixed type injuries, artifacts altered images, or cohabitation of different lesions types as tumors or white matter diseases, their use in complex case studies remains challenging. One can then imagine that these algorithms will first be used in clinical routine for screening, triage, or indicative prediction in support of human readers.
4.3. Conclusion
AI algorithms have shown promising results in the biomedical field, especially in medical imaging. Their use on TBI CT scans, for classification and/or segmentation, is expanding and might become the reference methods in the next years if some problematic but surmountable issues are addressed. Their integration in clinical routine depends on the confidence on their predictions that might be increased with rigorous methodology.
Author Contributions
CB, BL, and PB wrote the first draft of the manuscript. All authors listed have made a substantial, direct and intellectual contribution to the work, approved it for publication, proofread, and corrected the final manuscript.
Funding
This work has been supported by the Fondation des Gueules Cassées (Paris, France) and the hospital CHUGA (Grenoble, France) through the project Radiomic-TBI.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
1. Peeters W, van den Brande R, Polinder S, Brazinova A, Steyerberg EW, Lingsma HF, et al. Epidemiology of traumatic brain injury in Europe. Acta Neurochirurgica (2015) 157:1683–96. doi: 10.1007/s00701-015-2512-7
2. Maas AIR, Menon DK, Adelson PD, Andelic N, Bell MJ, Belli A, et al. Traumatic brain injury: integrated approaches to improve prevention, clinical care, and research. Lancet Neurol. (2017) 16:987–1048. doi: 10.1016/S1474-4422(17)30371-X
3. Bouzat P, Sala N, Payen JF, Oddo M. Beyond intracranial pressure: optimization of cerebral blood flow, oxygen, and substrate delivery after traumatic brain injury. Ann Intensive Care. (2013) 3:23. doi: 10.1186/2110-5820-3-23
4. Stocchetti N, Carbonara M, Citerio G, Ercole A, Skrifvars MB, Smielewski P, et al. Severe traumatic brain injury: targeted management in the intensive care unit. Lancet Neurol. (2017) 16:452–64. doi: 10.1016/S1474-4422(17)30118-7
5. Schweitzer AD, Niogi SN, Whitlow CT, Tsiouris AJ. Traumatic brain injury: imaging patterns and complications. RadioGraphics (2019) 39:1571–95. doi: 10.1148/rg.2019190076
6. Marshall LF, Marshall SB, Klauber MR, Clark MvB, Eisenberg HM, Jane JA, et al. A new classification of head injury based on computerized tomography. J Neurosur. (1991) 75:S14–20. doi: 10.3171/sup.1991.75.1s.0s14
7. Maas AIR, Hukkelhoven CWPM, Marshall LF, Steyerberg EW. Prediction of outcome in traumatic brain injury with computed tomographic characteristics: a comparison between the computed tomographic classification and combinations of computed tomographic predictors. Neurosurgery (2005) 57:1173–82; discussion 1173–82. doi: 10.1227/01.neu.0000186013.63046.6b
8. Nelson DW, Nyström H, MacCallum RM, Thornquist B, Lilja A, Bellander BM, et al. Extended analysis of early computed tomography scans of traumatic brain injured patients and relations to outcome. J Neurotrauma (2010) 27:51–64. doi: 10.1089/neu.2009.0986
9. Raj R, Siironen J, Skrifvars MB, Hernesniemi J, Kivisaari R. Predicting outcome in traumatic brain injury: development of a novel computerized tomography classification system (Helsinki computerized tomography score). Neurosurgery (2014) 75:632–46; discussion 646–7. doi: 10.1227/NEU.0000000000000533
10. Jennett B. Assessment of outcome after severe brain damage a practical scale. Lancet (1975) 305:480–4. doi: 10.1016/s0140-6736(75)92830-5
11. Thelin EP, Nelson DW, Vehviläinen J, Nyström H, Kivisaari R, Siironen J, et al. Evaluation of novel computerized tomography scoring systems in human traumatic brain injury: an observational, multicenter study. PLoS Med. (2017) 14:e1002368. doi: 10.1371/journal.pmed.1002368
12. Pargaonkar R, Kumar V, Menon G, Hegde A. Comparative study of computed tomographic scoring systems and predictors of early mortality in severe traumatic brain injury. J Clin Neurosci. (2019) 66:100–6. doi: 10.1016/j.jocn.2019.05.011
13. Chun KA, Manley GT, Stiver SI, Aiken AH, Phan N, Wang V, et al. Interobserver variability in the assessment of CT imaging features of traumatic brain injury. J Neurotrauma (2010) 27:325–30. doi: 10.1089/neu.2009.1115
14. Pesapane F, Codari M, Sardanelli F. Artificial intelligence in medical imaging: threat or opportunity? Radiologists again at the forefront of innovation in medicine. Eur Radiol Exp. (2018) 2:35. doi: 10.1186/s41747-018-0061-6
15. Komorowski M. Artificial intelligence in intensive care: are we there yet? Intensive Care Med. (2019) 45:1298–300. doi: 10.1007/s00134-019-05662-6
16. Monteiro M, Newcombe VFJ, Mathieu F, Adatia K, Kamnitsas K, Ferrante E, et al. Multiclass semantic segmentation and quantification of traumatic brain injury lesions on head CT using deep learning: an algorithm development and multicentre validation study. Lancet Dig Health. (2020) 2:e314–22. doi: 10.1016/S2589-7500(20)30085-6
17. Rosa Edl, Sima DM, Vyvere TV, Kirschke JS, Menze B. A radiomics approach to traumatic brain injury prediction in CT scans. In: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) (2019). p. 732–5.
18. MRC CRASH Trial Collaborators. Predicting outcome after traumatic brain injury: practical prognostic models based on large cohort of international patients. BMJ (2008) 336:425–9. doi: 10.1136/bmj.39461.643438.25
19. Steyerberg EW, Mushkudiani N, Perel P, Butcher I, Lu J, McHugh GS, et al. Predicting outcome after traumatic brain injury: development and international validation of prognostic scores based on admission characteristics. PLoS Med. (2008) 5:e165; discussion e165. doi: 10.1371/journal.pmed.0050165
20. Matsuo K, Aihara H, Nakai T, Morishita A, Tohma Y, Kohmura E. Machine learning to predict in-hospital morbidity and mortality after traumatic brain injury. J Neurotrauma (2020) 37:202–10. doi: 10.1089/neu.2018.6276
21. Hale AT, Stonko DP, Brown A, Lim J, Voce DJ, Gannon SR, et al. Machine-learning analysis outperforms conventional statistical models and CT classification systems in predicting 6-month outcomes in pediatric patients sustaining traumatic brain injury. Neurosurgical Focus. (2018) 45:E2. doi: 10.3171/2018.8.FOCUS17773
22. Rau CS, Kuo PJ, Chien PC, Huang CY, Hsieh HY, Hsieh CH. Mortality prediction in patients with isolated moderate and severe traumatic brain injury using machine learning models. PLoS ONE (2018) 13:e0207192. doi: 10.1371/journal.pone.0207192
23. van der Ploeg T, Nieboer D, Steyerberg EW. Modern modeling techniques had limited external validity in predicting mortality from traumatic brain injury. J Clin Epidemiol. (2016) 78:83–9. doi: 10.1016/j.jclinepi.2016.03.002
24. Gravesteijn BY, Nieboer D, Ercole A, Lingsma HF, Nelson D, van Calster B, et al. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol. (2020) 122:95–107. doi: 10.1016/j.jclinepi.2020.03.005
25. Kim H, Kim Gd, Yoon BC, Kim K, Kim BJ, Choi YH, et al. Quantitative analysis of computed tomography images and early detection of cerebral edema for pediatric traumatic brain injury patients: retrospective study. BMC Med. (2014) 12:186. doi: 10.1186/s12916-014-0186-2
26. Chilamkurthy S, Ghosh R, Tanamala S, Biviji M, Campeau NG, Venugopal VK, et al. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet (2018) 392:2388–96. doi: 10.1016/S0140-6736(18)31645-3
27. Jadon S, Leary OP, Pan I, Harder TJ, Wright DW, Merck LH, et al. A comparative study of 2D image segmentation algorithms for traumatic brain lesions using CT data from the ProTECTIII multicenter clinical trial. Med Imag. (2020) 48. doi: 10.1117/12.2566332
28. Jain S, Vyvere TV, Terzopoulos V, Sima DM, Roura E, Maas A, et al. Automatic quantification of computed tomography features in acute traumatic brain injury. J Neurotrauma (2019) 36:1794–803. doi: 10.1089/neu.2018.6183
29. Kuo W, Häne C, Yuh E, Mukherjee P, Malik J. PatchFCN for intracranial hemorrhage detection. arXiv:180603265 [cs]. (2019) Apr;ArXiv: 1806.03265.
30. Yao H, Williamson C, Gryak J, Najarian K. Automated hematoma segmentation and outcome prediction for patients with traumatic brain injury. Artif Intel Med. (2020) 107:101910. doi: 10.1016/j.artmed.2020.101910
31. Perel P, Edwards P, Wentz R, Roberts I. Systematic review of prognostic models in traumatic brain injury. BMC Med Inform Decis Mak. (2006) 6:38. doi: 10.1186/1472-6947-6-38
32. Marmarou A, Lu J, Butcher I, McHugh GS, Mushkudiani NA, Murray GD, et al. IMPACT database of traumatic brain injury: design and description. J Neurotrauma (2007) 24:239–50. doi: 10.1089/neu.2006.0036
33. Maas AIR, Marmarou A, Murray GD, Teasdale SGM, Steyerberg EW. Prognosis and clinical trial design in traumatic brain injury: the IMPACT study. J Neurotrauma (2007) 24:232–8. doi: 10.1089/neu.2006.0024
34. Roozenbeek B, Lingsma HF, Lecky FE, Lu J, Weir J, Butcher I, et al. Prediction of outcome after moderate and severe traumatic brain injury: external validation of the international mission on prognosis and analysis of clinical trials (IMPACT) and corticoid randomisation after significant head injury (CRASH) prognostic models*. Crit Care Med. (2012) 40:1609–17. doi: 10.1097/CCM.0b013e31824519ce
35. Rovlias A, Kotsou S. Classification and regression tree for prediction of outcome after severe head injury using simple clinical and laboratory variables. J Neurotrauma (2004) 21:886–93. doi: 10.1089/0897715041526249
36. Murray GD, Butcher I, McHugh GS, Lu J, Mushkudiani NA, Maas AIR, et al. Multivariable prognostic analysis in traumatic brain injury: results from the IMPACT study. J Neurotrauma (2007) 24:329–37. doi: 10.1089/neu.2006.0035
37. Lingsma HF, Roozenbeek B, Steyerberg EW, Murray GD, Maas AI. Early prognosis in traumatic brain injury: from prophecies to predictions. Lancet Neurol. (2010) 9:543–54. doi: 10.1016/S1474-4422(10)70065-X
38. Zhu GW, Wang F, Liu WG. Classification and prediction of outcome in traumatic brain injury based on computed tomographic imaging. J Int Med Res. (2009) 37:983–95. doi: 10.1177/147323000903700402
39. Mohd Noor NSE, Ibrahim H. Predicting outcomes in patients with traumatic brain injury using machine learning models. In: Jamaludin Z, Ali Mokhtar MN, editors. Intelligent Manufacturing and Mechatronics. Lecture Notes in Mechanical Engineering. Singapore: Springer (2020). p. 12–20.
40. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. (2019) 110:12–22. doi: 10.1016/j.jclinepi.2019.02.004
41. Gong T, Liu R, Tan CL, Farzad N, Lee CK, Pang BC, et al. Classification of CT brain images of head trauma. In: Rajapakse JC, Schmidt B, Volkert G, editors. Pattern Recognition in Bioinformatics. Lecture Notes in Computer Science. Berlin, Heidelberg: Springer (2007). p. 401–8.
42. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036
43. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology (2016) 278:563–77. doi: 10.1148/radiol.2015151169
44. Tomaszewski MR, Gillies RJ. The biological meaning of radiomic features. Radiology (2021) 298:202553. doi: 10.1148/radiol.2021202553
45. Muschelli J, Sweeney EM, Ullman NL, Vespa P, Hanley DF, Crainiceanu CM. PItcHPERFeCT: primary intracranial hemorrhage probability estimation using random forests on CT. Neuroimage Clin. (2017) 14:379–90. doi: 10.1016/j.nicl.2017.02.007
46. Arbabshirani MR, Fornwalt BK, Mongelluzzo GJ, Suever JD, Geise BD, Patel AA, et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Dig Med. (2018) 1:1–7. doi: 10.1038/s41746-017-0015-z
47. Kim M, Yun J, Cho Y, Shin K, Jang R, Bae Hj, et al. Deep learning in medical imaging. Neurospine (2019) 16:657–68. doi: 10.14245/ns.1938396.198
48. Kamnitsas K, Ferrante E, Parisot S, Ledig C, Nori AV, Criminisi A, et al. DeepMedic for brain tumor segmentation. In: Crimi A, Menze B, Maier O, Reyes M, Winzeck S, Handels H, editors. Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, Vol. 10154. Cham: Springer International Publishing (2016). p. 138–49.
49. Simonyan K, Vedaldi A, Zisserman A. Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv:13126034 [cs] (2014) Apr;ArXiv: 1312.6034.
50. Mackin D, Fave X, Zhang L, Fried D, Yang J, Taylor B, et al. Measuring CT scanner variability of radiomics features. Invest Radiol. (2015) 50:757–65. doi: 10.1097/RLI.0000000000000180
51. Lu L, Ehmke RC, Schwartz LH, Zhao B. Assessing agreement between radiomic features computed for multiple CT imaging settings. PLoS ONE (2016) 11:e0166550. doi: 10.1371/journal.pone.0166550
52. Orlhac F, Frouin F, Nioche C, Ayache N, Buvat I. Validation of a method to compensate multicenter effects affecting CT radiomics. Radiology (2019) 291:53–9. doi: 10.1148/radiol.2019182023
53. Yi Z, Zhang H, Tan P, Gong M. DualGAN: unsupervised dual learning for image-to-image translation. arXiv:170402510 [cs] (2018) ArXiv: 1704.02510.
54. Dewey BE, Zhao C, Reinhold JC, Carass A, Fitzgerald KC, Sotirchos ES, et al. DeepHarmony: a deep learning approach to contrast harmonization across scanner changes. Magn Reson Imag. (2019) 64:160–70. doi: 10.1016/j.mri.2019.05.041
55. Maas AIR, Menon DK, Steyerberg EW, Citerio G, Lecky F, Manley GT, et al. Collaborative european neuroTrauma effectiveness research in traumatic brain injury (CENTER-TBI): a prospective longitudinal observational study. Neurosurgery (2015) 76:67–80. doi: 10.1227/NEU.0000000000000575
Keywords: traumatic brain injury, artificial intelligence, computed tomography, segmentation, classification, review
Citation: Brossard C, Lemasson B, Attyé A, de Busschère J-A, Payen J-F, Barbier EL, Grèze J and Bouzat P (2021) Contribution of CT-Scan Analysis by Artificial Intelligence to the Clinical Care of TBI Patients. Front. Neurol. 12:666875. doi: 10.3389/fneur.2021.666875
Received: 11 February 2021; Accepted: 15 April 2021;
Published: 10 June 2021.
Edited by:
Hengli Tian, Shanghai Jiao Tong University, ChinaReviewed by:
John K. Yue, University of California, San Francisco, United StatesRalph George Depalma, United States Department of Veterans Affairs, United States
Copyright © 2021 Brossard, Lemasson, Attyé, de Busschère, Payen, Barbier, Grèze and Bouzat. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Benjamin Lemasson, benjamin.lemasson@univ-grenoble-alpes.fr