A Deep Learning Model for Preoperative Differentiation of Glioblastoma, Brain Metastasis and Primary Central Nervous System Lymphoma: A Pilot Study

Tariciotti, Leonardo; Caccavella, Valerio M.; Fiore, Giorgio; Schisano, Luigi; Carrabba, Giorgio; Borsa, Stefano; Giordano, Martina; Palmisciano, Paolo; Remoli, Giulia; Remore, Luigi Gianmaria; Pluderi, Mauro; Caroli, Manuela; Conte, Giorgio; Triulzi, Fabio; Locatelli, Marco; Bertani, Giulio

doi:10.3389/fonc.2022.816638

ORIGINAL RESEARCH article

Front. Oncol., 24 February 2022

Sec. Surgical Oncology

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.816638

This article is part of the Research TopicNew Frontiers for Artificial Intelligence in Surgical Decision Making and its Organizational ImpactsView all 7 articles

A Deep Learning Model for Preoperative Differentiation of Glioblastoma, Brain Metastasis and Primary Central Nervous System Lymphoma: A Pilot Study

Leonardo Tariciotti^1,2*†

Valerio M. Caccavella^3†

Giorgio Fiore^1,2

Luigi Schisano^1,2

Giorgio Carrabba¹

Stefano Borsa¹

Martina Giordano^2,4

Paolo Palmisciano⁵

Giulia Remoli⁶

Luigi Gianmaria Remore¹

Mauro Pluderi¹

Manuela Caroli¹

Giorgio Conte^7,8

Fabio Triulzi^7,8

Marco Locatelli^1,8,9‡

Giulio Bertani^1‡

¹Unit of Neurosurgery, Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico, Milan, Italy
²Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
³Department of Paediatric Orthopaedics and Traumatology, ASST Centro Specialistico Ortopedico Traumatologico Gaetano Pini-CTO, Milan, Italy
⁴Department of Neurosurgery, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
⁵Department of Neurosurgery, Trauma Center, Gamma Knife Center, Cannizzaro Hospital, Catania, Italy
⁶National Center for Disease Prevention and Health Promotion, Italian National Institute of Health, Rome, Italy
⁷Unit of Neuroradiology, Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico, Milan, Italy
⁸Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
⁹Aldo Ravelli” Research Center for Neurotechnology and Experimental Brain Therapeutics, University of Milan, Milan, Italy

Background: Neuroimaging differentiation of glioblastoma, primary central nervous system lymphoma (PCNSL) and solitary brain metastasis (BM) remains challenging in specific cases showing similar appearances or atypical features. Overall, advanced MRI protocols have high diagnostic reliability, but their limited worldwide availability, coupled with the overlapping of specific neuroimaging features among tumor subgroups, represent significant drawbacks and entail disparities in the planning and management of these oncological patients.

Objective: To evaluate the classification performance metrics of a deep learning algorithm trained on T1-weighted gadolinium-enhanced (T1Gd) MRI scans of glioblastomas, atypical PCNSLs and BMs.

Materials and Methods: We enrolled 121 patients (glioblastoma: n=47; PCNSL: n=37; BM: n=37) who had undergone preoperative T1Gd-MRI and histopathological confirmation. Each lesion was segmented, and all ROIs were exported in a DICOM dataset. The patient cohort was then split in a training and hold-out test sets following a 70/30 ratio. A Resnet101 model, a deep neural network (DNN), was trained on the training set and validated on the hold-out test set to differentiate glioblastomas, PCNSLs and BMs on T1Gd-MRI scans.

Results: The DNN achieved optimal classification performance in distinguishing PCNSLs (AUC: 0.98; 95%CI: 0.95 - 1.00) and glioblastomas (AUC: 0.90; 95%CI: 0.81 - 0.97) and moderate ability in differentiating BMs (AUC: 0.81; 95%CI: 0.70 - 0.95). This performance may allow clinicians to correctly identify patients eligible for lesion biopsy or surgical resection.

Conclusion: We trained and internally validated a deep learning model able to reliably differentiate ambiguous cases of PCNSLs, glioblastoma and BMs by means of T1Gd-MRI. The proposed predictive model may provide a low-cost, easily-accessible and high-speed decision-making support for eligibility to diagnostic brain biopsy or maximal tumor resection in atypical cases.

Introduction

Brain metastases (BM), glioblastomas and primary central nervous system lymphomas (PCNSL) are amongst the most common intracranial neoplasms in adults (17%, 14.6%, and 1.9% respectively) (1, 2). Treatments and prognoses differ, and accurate diagnosis is crucial to guide management strategies. Current guidelines suggest maximal surgical resection plus chemoradiation therapy for BMs and glioblastoma and methotrexate-chemotherapy plus whole-brain radiotherapy for PCNSLs (3–6). Biopsy, especially stereotactic, is the diagnostic gold- standard, but the overall complication rate is up to 13% (7). In addition, the use of pre-operative steroids in patients with BMs and glioblastomas, aimed at relieving symptoms, may hinder histopathological diagnoses in PCNSLs, leading to higher false-negative rates (8).

Conventional Magnetic Resonance Imaging (MRI) assists the preoperative diagnostic assessment and guides treatment planning, but lesions may show overlapping radiological features. On T1-weighted gadolinium-enhanced (T1Gd) images, glioblastomas often show peripheral rims of contrast-enhancement and central necroses similar to solitary BMs, whereas PCNSLs frequently exhibit homogeneous enhancement (9, 10). In atypical cases, glioblastomas may display minimal or absent necroses and PCNSLs may show central necroses mimicking glioblastomas (11). Some advanced MRI techniques may support the radiological assessment, for instance by differentiating reduced cerebral blood volume (CBV), characteristic of PCNSLs, from high CBV, frequently reported in glioblastomas (12, 13). However, uncommon hypervascular PCNSLs may be encountered, posing additional diagnostic challenges despite the use of advanced multiparametric imaging. Finally, advanced MRI protocols require greater expertise and expenses, affecting their worldwide applicability (14).

Radiomics has been adopted in neuro-oncology for diagnostic classification and prognostic prediction from the analysis of textural or handcrafted radiological features (15). However, it needs lengthy and meticulous preprocessing steps such as imaging segmentation, manual features selection and extraction. More recently, the introduction of machine learning algorithms significantly improved classification performances (16–18): deep learning methods, in particular deep neural networks (DNN), may automatically perform several computer visions tasks by extracting information directly from radiological sequences (19, 20).

In this study we evaluated the discriminative ability of a deep learning algorithm trained on T1-weighted gadolinium-enhanced (T1-Gd) MRI scans to differentiate glioblastomas, atypical PCNSLs, and BMs, and to improve both diagnostic and interventional workflows.

Material and Methods

Patient’s Selection

Ethical approval was waived by the local Ethics Committee in view of the retrospective nature of the study and all the procedures being performed were part of the routine care. Informed consent was obtained from all individual participants included in the study. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

We retrospectively reviewed the records of 254 consecutive patients with histologically confirmed glioblastoma, PCNSL with atypical features (immunocompetent patients, central necrotic core, no perivascular location/atypical anatomical location according to the literature or increased rCBV) or BM, who underwent preoperative brain MRI between June 2015 and April 2021. Exclusion criteria included: 1) patients with absent or inadequate MR images; 2) patients with previous intracranial intervention (surgical intervention, gamma knife surgery, or radiation therapy) and 3) the presence of multiple enhancing lesions. Adult immunodeficiency syndrome-related or Epstein-Barr virus-related PCNSL were excluded from our analysis as both subtypes of PCNSL might have increased the heterogeneity of the population investigated.

Following these criteria 121 patients were selected and split, following a 70/30 ratio, into a training set, for deep learning model training, and a balanced hold-out test set, to internally validate the developed DNN model.

MR Acquisition and Image Preprocessing

All brain MRI studies were performed with a 3 Tesla scanner (Philips® Achieva, Eindoven, Netherlands) using a conventional 32-channel head coil. The protocol included: axial T2-weighted (T2w) sequence, three-dimensional (3D) Fluid Attenuated Inversion Recovery (FLAIR) sequence, axial diffusion-weighted images (DWI) with b-values of 0-1000 sec/mm2 and contrast-enhanced (Gadovist 0.1 mL/kg; Prohance 0.2 mL/kg) axial and three-dimensional (3D) T1-weighted sequences.

All MR images in the form of digital imaging and communications in medicine (DICOM) were input to the Horos DICOM Viewer (version 3.3.5, www.horosproject.org) a free, open-source medical imaging viewer and analytic tool. Using this software, the regions of interest (ROIs) of these three types of lesions were manually delineated on every section in which the tumoral mass was visualized on preoperative axial CE-T1. After volume acquisition, segmentation and signal intensity normalization, all the ROIs were then centered in a 224x224 pixels black box and exported in PNG file format (Figure 1).

FIGURE 1

Figure 1 The overall Resnet-101 model architecture. The window size and the stride for convolutional, maxpooling and fully connected layers are also presented. Conv, convolutional layer; FC, fully connected layer; GBM, glioblastoma multiforme; PCNSL, primary central nervous system lymphoma; BM, brain metastasis.

Convolutional Neural Network Model

A 2D convolutional neural networks (specifically the Resnet-101 model) with 101 layers consisting of 3-layer residual blocks pre-trained with the ImageNet database was used (21, 22). Hyperparameters of the fully connected layer of ResNet were fine-tuned with the training set data while the convoluting and pooling layers were frozen in order to preserve the features extraction capability of the pretrained Resnet-101 model. The batch size was 32, and a drop-out rate of 0.25 was applied with rectifier linear unit as the activation function to minimize model overfitting. The model was trained for 50 epochs with stochastic gradient descent optimized with the Adam optimizer and the initial learning rate set to 0.005 (23). Batch normalization was used in each layer to improve learning stability (24). The structure of the developed DNN model is depicted in Figure 1.

Each ROI was used as inputs for all the 3 channels expected by the Resnet model and was treated as an independent image to increase the number of input data even though a group of slices belonged to the same patient. However, to prevent data leakage from the training to the hold-out test set, data splitting during training of the model was done per patient and not per section image. The predicted diagnostic class for each patient was the most frequently voted one among its entire ROIs set. Gradient-weighted class activation mapping (Grad-CAM) visualization was used to evaluate which portion of the tumoral lesion the developed DNN model was focusing on in order to make each patient’s diagnostic prediction (an example is reported in Figure 2) (25).

FIGURE 2

Figure 2 Images of a 59-year-old woman with a histologically diagnosed GBM. (A) CE T1WI with the segmented region of interest (ROI). (B) Extracted ROI of the segmented lesion. (C) Extracted ROI and the corresponding gradient-weighted class activation map (Grad-CAM) showing the salient tumor regions identified by Resnet-10 and on which the DNN model relies to make its prediction.

The final model was internally validated on the hold-out test set. The performance metrics reported were computed considering the number of mis considered per patient and not per image.

Performance Metrics

The classification performance of the DNN model was evaluated considering the following performance metrics: 1) Area Under the Receiving Operative Characteristics curve (AUC-ROC); 2) Accuracy; 3) Precision or Positive predictive value (PPV); 4) Negative predictive value (NPV); 5) Recall or sensitivity; 6) Specificity; 7) F-1 score.

A One-vs-the-Rest (OvR) multiclass strategy was employed to extract performance metrics for each outcome class, then the average value and its 95% bootstrap confidence interval were computed for each of the above-mentioned performance metrics on the hold-out test set. A grouped binary comparison was also performed in order to investigate the reliability of the DNN model in distinguishing surgically resectable lesions (glioblastoma or BM) from non-resectable ones (PCNSL).

Finally, the retrospective diagnostic performance of neuroradiologists with at least 10 years of dedicated experience reviewed by means of radiological report charts was computed and defined as “gold standard” for comparison with the DL model.

Statistics, Software and Hardware

Descriptive statistics, frequencies and percentages were used to report tumor volume characteristics. A Shapiro-Wilk normality test was used to assess normality. When appropriate, continuous variables were reported as mean + standard deviation (SD) or median and interquartile range (IQR). Statistical differences in tumor volumes were tested using the ANOVA or Kruskall Wallis test, according to normality of the sample. All the statistical analyses were performed in Jupyter Notebook, using Python v.3.7.6 (https://www.python.org/). The Python packages used for this study included: ‘PyTorch v1.7’ to develop and train the DNN model, ‘Numpy’ for Excel dataset handling; ‘Scikit-learn’ to compute performance metrics and ‘Seaborn’ to plot ROC-AUC. The workstation used to train the DNN model mounted an Intel Core i7-10700K processor while the GPU was a Tesla K80 12GB.

Results

The cohort of selected patients included: 47 glioblastomas (age: 61.3 [48.9-73.7] years), 37 PCNSLs (age: 51.1 [43.3-58,9] years) and 37 BMs (age 59.5 [49.9-69.1] years). The male-to-female ratio was 50/71 (58.3% were female). Median tumor volumes were as follows: glioblastoma (56.31[45.50-69.00], PCNSL (39.00 [31.40-45.25] and BM (56.50 [44.01-65.25].A statistically significant different in tumour volume was found (p=0.03). A total of 3’597 axial slices/ROIs of tumors were extracted from 121 patients with: glioblastoma (1’481 ROIs), PCNSL (1’073 ROIs) and BM (1’043 ROIs). No significant difference in age and sex distribution was found between the three groups of patients. Patients in the metastasis group included those with various primary tumor subtypes: 14 (37.8%) lung cancers, 9 (24.3%) breast cancers, 6 (16.2%) colorectal cancers, 5 (13.5%) melanomas and 3 (8.1%) endometrial cancers.

DNN Model Performance Metrics Evaluation

The trained DNN model, evaluated on the hold-out test set, achieved an AUC of 0.98 (95%CI: 0.95 - 1.00), 0.90 (95%CI: 0.81 - 0.97), 0.81 (95%CI: 0.70 - 0.95) respectively for PCNSL, glioblastoma and BM diagnostic class demonstrating high discriminative ability. High reliability was reported across all performance metrics for atypical PCNSL and moderate for glioblastoma and BM (Table 1)

TABLE 1

Table 1 Convolutional neural networks model’s performance metrics in differentiating PCNSL, Glioblastoma, and BM.

In fact, the model reported 91.57% (76.92% - 100.00%) and 93.54% (87.38%-100.00%) PPV and NPV for PCNSL, respectively, confirming high reliability in ruling in and out candidates in a population-based scenario. By reviewing the capacity to detect BM among all cases, the model was found to be highly reliable in excluding the suspect of BM in favor of a different prediction outcome than confirming it [specificity: 88.46% (76.92% - 95.31%); NPV: 84.37% (74.78%-94.15%)]. Finally, a moderate absolute (sensitivity: 80.01% (71.23% - 100.00%); specificity: 81.84% (69.18% - 95.45%)) and clinical performance was reported for glioblastomas among the cases the model was tested on.

Moreover, the trained DNN model achieved an AUC of 0.92 (95%CI: 0.83 - 0.99) and an accuracy of 94.7% (95%CI: 89.19% - 100.0%) in distinguishing surgically resectable lesion (glioblastoma or BM) from non resectable ones (PCNSL) (Table 2). For each diagnostic outcome class the AUC-ROC curves, achieved on both training and hold-out test set, and the global confusion matrix are depicted in Figure 3.

TABLE 2

Table 2 Convolutional neural networks model’s performance metrics in suggesting lesion resectability.

FIGURE 3

Figure 3 AUC-ROC curves (on both training and hold-out test sets) for each diagnostic outcome class [One-vs-Rest: (A–C)] and global confusion matrix (D). GBM, glioblastoma; PCNSL, primary central nervous system lymphoma; METS, brain metastasis.

DNN Model and Neuroradiological Assessment (Gold Standard) Comparison

As gold standard reference, the classification performance of neuroradiologists with at least 10 years of experience was retrospectively conducted on the same population. Every physician had access to the whole multi-sequence DICOM package and independently classified each tumor according to their knowledge. Overall, an optimal accuracy was reached on each tumor type (atypical PCNSL: 84.38%; glioblastoma: 85.87%; BM: 91.67%). The lowest performance was noted when an atypical PCNL was to identify: the overall sensitivity and specificity were 55.56% and 91.03% (PPV: 58.82%; NPV: 89.87%). Additional information are shown in Table 3.

TABLE 3

Table 3 Neuroradiologists (Gold standard) performance metrics in differentiating PCNSL, Glioblastoma and BM in the cohort examined.

The DL model yielded an increase in accuracy of +14% for PCNSL, +5% for glioblastoma and a decrease of 10% for BM compared to the gold standard. According to the performance metrics evaluation, the most reliable prediction computed by the DL model was atypical PCNSL.

Discussion

In the current study, we demonstrated the feasibility of a deep learning model to differentiate glioblastomas, PCNSLs and BMs in routine clinical settings. We found that our DNN model trained on T1Gd-weighted volumetric MRI axial scans showed 83.08%, 94.65%, 81.07% accuracy rates in differentiating each lesion (glioblastomas, PCNSLs and BMs respectively) against the other two. Our model returned the highest accuracy (94.65%) in identifying PCNSLs against the other classes and moderately high diagnostic accuracy for glioblastomas (83.08%) and BMs (81.07%). Moreover, when asked to define the tumor amenability to maximal safe resection or diagnostic biopsy, the deep learning model returned excellent performance (accuracy: 94.72%) and high reliability (PPV: 91.88%; NPV: 94.76%). The algorithm misclassified BMs more frequently than glioblastomas and PCNSLs, probably due to their higher histological heterogeneity and related variability in radiological features; higher accuracy rates would have been obtained by using a larger dataset.

Finally, when the deep learning model and the gold standard (diagnostic reports by neuroradiologists) were compared and evaluated considering the prevalence of each tumor type in the population investigated, a reliable classification performance of the deep learning algorithm was denoted, especially for PCNSL. The greater discriminative power of neuroradiological reports concerning BMs was unsurprising: access to clinical data, history and multiparametric MR imaging in the real-world clinical practice provided essential information the algorithm had no access to. Glioblastomas were the most represented class in our population (47 vs 37 vs 37 cases) and were more accurately classified by the deep learning model than the gold standard, although these results should be carefully reviewed: in fact, though sensitivity and specificity were comparable between the deep learning model and the reference gold standard, the PPV and the NPV showed slightly lower reliability of these results.

Overall, these findings support the clinical experimentation and applicability of the model in assisting physicians to decide whether to proceed with diagnostic biopsy when PCNSLs are suspected or maximal surgical resection when glioblastomas or BMs are more likely.

No single MRI modality is currently capable of differentiating PCNSLs, BMs, and glioblastomas with absolute accuracy. Recent radiomic studies focused on tumor histology prediction (which showed fair performance rates - up to 75%) reported contradictory results in terms of the most predictive MRI sequences analyzed, limiting their applicability in routine clinical practice (26, 27). Fruehwald-Pallamar et al. (28) found that T2-weighted images were more predictive than FLAIR or T1-weighted scans in differentiating benign from malignant tumors. In contrast, Tiwari et al. (29) and Xiao et al. (30) argued that T1Gd images might be superior by showing distinct borders of contrast-enhancing tumors, increasing the accuracy of ROIs segmentation compared to the unclear borders exhibited in T2-weighted and FLAIR scans. Finally, although advanced MRI techniques may improve classification and differentiation of suspected brain neoplasms, their diagnostic role is limited by the operator-dependent interpretation bias, the high heterogeneity among brain tumors and the additional hardware and set-up protocols required, which are available only at major institutions (31, 32).

Previous studies investigated the role of machine learning models to differentiate glioblastomas from PCNSLs. Kunimatsu et al. (27) developed a support vector machine, which, by analyzing radiomic features, returned a 0.75 accuracy in classifying glioblastomas vs PCNSL. Likewise, Xia et al. (33) designed a deep learning algorithm capable of classifying glioblastomas and PCNSLs from multiple MRI sequences with moderate outcomes (accuracy: 0.884). However, no machine learning models aimed at differentiating glioblastomas, PCNSLs and BMs have been reported yet.

Our deep learning algorithm detected with high discriminative capacity specific microscopic parameters of glioblastomas, PCNSLs and BMs and hidden radiological differences between brain tumors. The rationale behind the use of T1Gd images stems from their superior distinction of tumors borders and clear representation of central necroses, which are pathological hallmarks of most glioblastomas and BMs (34, 35).

The segmentation workflow represents a critical aspect of machine learning and deep learning models’ development. As recent computer-based automated segmentation algorithms need to be clinically validated, manual segmentation is still the current gold- standard, showing overall satisfactory results at the cost of intensive work, task-induced fatigue, and extensive processing time. In a recent study, McAvoy et al. (36) developed an EfficientNetB4 DNN with high classification performance for glioblastoma (accuracy: 0.94) vs PCNSL (accuracy: 0.95) on whole brain scan analysis with no prior image segmentation. The authors advocated the superiority of their model compared to previous machine learning studies, as the overall preprocessing effort was sharply reduced. However, the use of non-segmented whole-brain scans may lead to additional classification bias, as DNNs might learn to accomplish their classification task by relying on features (e.g., anatomical location and laterality) determined by unbalanced and heterogeneous training sets instead of clinically related radiological differences, hence limiting the general applicability of the model (37). On the contrary, lesion segmentation partitions each selected slice into a coherent region of interest (ROI) that is extracted from the background and individually processed to acquire overall lesion’s characteristics (either ROI or boundaries). Hence, in our investigation, trained personnel performed manual tumor segmentation.

In 2012 the MICCAI-Brain Tumor Segmentation Challenge (BRATS) (38) was intended to collect the best performing automated segmentation algorithms for brain tumors. The winning algorithm reported high performance in glioma segmentation, but these results are still experimental as automated tools might tend to overestimate volumes and suffer from gross accuracy errors in delineating tumor boundaries (39, 40). In addition, the inclusion of different tumor types in our study would have required several automated segmentation algorithms bearing different and incomparable segmentation performances, introducing larger biases in our training and validation sets than manual segmentation.

We propose the first combined diagnostic “next-move” support tool to assist neuroradiologists in differentiating atypical tumor cases, and neurosurgeons in surgical decision-making processes between resection or biopsy. We trained our model, a low-cost decision-making support solution with extremely high computation speed (within 10 seconds/patient), on a large dataset of conventional T1Gd scans, enabling wider clinical implementations even within institutions with limited resources and restricted access to advanced MRI modalities. Finally, our study design for DNN training and internal validation was built on open-source python packages, and our methodology could be reproduced and externally validated with image datasets from other institutions.

Our study design has some limitations. The number of included patients is larger than most other studies but remains relatively limited and might not address the vast heterogeneity of radiological features that glioblastomas, PCNSLs and BMs exhibit in real clinical settings. Indeed, the limited sample size resulted from selection and inclusion of selected radiologically atypical tumors. We performed a monocentric analysis of images acquired with a specific MRI scanner: for this reason, a validation at different institutions must be run to test model generalizability.

The deep learning architecture itself might introduce some downsides. First, generalizability is often mislead when input data comes from different machines (vendor, models, protocols etc.), as different parametrizations of the input data might alter patterns of deep learning as a consequence of data distribution shift: to minimize this limit, all patients underwent imaging acquisition with a standardized protocol using the same MR scan. Finally, several authors recognized the implicit “black-box” computation as a methodological limitation restraining wider application in clinical practice. In the current study, we deployed a “human-intelligible” visualization method – the Grad-CAM algorithm – to overcome this drawback. However, additional intelligible algorithms have been proposed and comparison of the latter was beyond the scope of the current investigation.

In this study, we trained and internally validated a deep learning algorithm to differentiate atypical and radiologically overlapping cases of glioblastomas, PCNSLs and BMs on T1Gd sequences. A secondary analysis to define the best next-step intervention was conducted with outstanding performance. Other than externally validate our findings, further investigation should prospectively compare the diagnostic and management performances of neuroradiologists and neurosurgeons whether they implement our DL algorithm or not. The proposed model provides a low-cost, easily accessible and high-speed decision-making support for eligibility to diagnostic brain biopsy or maximal tumor resection in atypical tumor cases.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

Conceptualization: LT, VC, and ML. Methodology: LT, VC, GF, LM, GCo and LS. Formal analysis and investigation: LT and VC. Writing—original draft preparation: LT, VC, MG, PP, GR, and GB. Writing—review and editing: MG, GB, SB, MP, MC, GR, GCo, GCa, and ML. Resources: GCo, ML, and FT. Supervision: ML, FT, GB, and GCo. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the Italian Ministry of Health (Ricerca Corrente 2022).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Ostrom QT, Patil N, Cioffi G, Waite K, Kruchko C, Barnholtz-Sloan JS. CBTRUS Statistical Report: Primary Brain and Other Central Nervous System Tumors Diagnosed in the United States in 2013-2017. Neuro Oncol (2020) 22:IV1–96. doi: 10.1093/neuonc/noaa200

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Nayak L, Lee EQ, Wen PY. Epidemiology of Brain Metastases. Curr Oncol Rep (2012) 14:48–54. doi: 10.1007/s11912-011-0203-y

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Patchell RA. The Management of Brain Metastases. Cancer Treat Rev (2003) 29:533–40. doi: 10.1016/S0305-7372(03)00105-1

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Alifieris C, Trafalis DT. Glioblastoma Multiforme: Pathogenesis and Treatment. Pharmacol Ther (2015) 152:63–82. doi: 10.1016/J.PHARMTHERA.2015.05.005

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bush NAO, Chang SM, Berger MS. Current and Future Strategies for Treatment of Glioma. Neurosurg Rev (2017) 40(1):1–14. doi: 10.1007/s10143-016-0709-8

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ferreri AJM, Reni M, Villa E. Therapeutic Management of Primary Central Nervous System Lymphoma: Lessons From Prospective Trials. Ann Oncol (2000) 11:927–37. doi: 10.1023/A:1008376412784

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Riche M, Amelot A, Peyre M, Capelle L, Carpentier A, Mathon B. Complications After Frame-Based Stereotactic Brain Biopsy: A Systematic Review. Neurosurg Rev (2021) 44:301–7. doi: 10.1007/s10143-019-01234-w

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Batchelor T, Loeffler JS. Primary CNS Lymphoma. J Clin Oncol (2006) 24:1281–8. doi: 10.1200/JCO.2005.04.8819

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Al-Okaili RN, Krejza J, Woo JH, Wolf RL, O’Rourke DM, Judy KD, et al. Intraaxial Brain Masses: MR Imaging–based Diagnostic Strategy—Initial Experience. Radiology (2007) 243:539–50. doi: 10.1148/radiol.2432060493

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Baris MM, Celik AO, Gezer NS, Ada E. Role of Mass Effect, Tumor Volume and Peritumoral Edema Volume in the Differential Diagnosis of Primary Brain Tumor and Metastasis. Clin Neurol Neurosurg (2016) 148:67–71. doi: 10.1016/j.clineuro.2016.07.008

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Bühring U, Herrlinger U, Krings T, Thiex R, Weller M, Küker W. MRI Features of Primary Central Nervous System Lymphomas at Presentation. Neurology (2001) 57:393–6. doi: 10.1212/WNL.57.3.393

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Wang S, Kim S, Chawla S, Wolf RL, Knipp DE, Vossough A, et al. Differentiation Between Glioblastomas, Solitary Brain Metastases, and Primary Cerebral Lymphomas Using Diffusion Tensor and Dynamic Susceptibility Contrast-Enhanced MR Imaging. Am J Neuroradiol (2011) 32:507–14. doi: 10.3174/ajnr.A2333

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Kickingereder P, Wiestler B, Sahm F, Heiland S, Roethke M, Schlemmer H-P, et al. Primary Central Nervous System Lymphoma and Atypical Glioblastoma: Multiparametric Differentiation by Using Diffusion-, Perfusion-, and Susceptibility-Weighted MR Imaging. Radiology (2014) 272:843–50. doi: 10.1148/radiol.14132740

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Zhang P, Liu B. Differentiation Among Glioblastomas, Primary Cerebral Lymphomas, and Solitary Brain Metastases Using Diffusion-Weighted Imaging and Diffusion Tensor Imaging: A PRISMA-Compliant Meta-Analysis. ACS Chem Neurosci (2020) 11:477–83. doi: 10.1021/acschemneuro.9b00698

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images Are More Than Pictures, They Are Data. Radiology (2016) 278:563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhou M, Scott J, Chaudhury B, Hall L, Goldgof D, Yeom KW, et al. Radiomics in Brain Tumor: Image Assessment, Quantitative Feature Descriptors, and Machine-Learning Approaches. Am J Neuroradiol (2018) 39:208–16. doi: 10.3174/ajnr.A5391

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Khan DZ, Luengo I, Barbarisi S, Addis C, Culshaw L, Dorward NL, et al. Automated Operative Workflow Analysis of Endoscopic Pituitary Surgery Using Machine Learning: Development and Preclinical Evaluation (IDEAL Stage 0). J Neurosurg (2021) 5:1–8. doi: 10.3171/2021.6.JNS21923

CrossRef Full Text | Google Scholar

18. Tariciotti L, Fiore G, Carrabba G, Bertani GA, Schisano L, Borsa S, et al. A Supervised Machine Learning Algorithm Predicts Intraoperative CSF Leak in Endoscopic Transsphenoidal Surgery for Pituitary Adenomas: Model Development and Prospective Validation. J Neurosurg Sci (2021). doi: 10.23736/S0390-5616.21.05295-4

CrossRef Full Text | Google Scholar

19. Chartrand G, Cheng PM, Vorontsov E, Drozdzal M, Turcotte S, Pal CJ, et al. Deep Learning: A Primer for Radiologists. Radiographics (2017) 37:2113–31. doi: 10.1148/rg.2017170077

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Tariciotti L, Palmisciano P, Giordano M, Remoli G, Lacorte E, Bertani G, et al. Artificial Intelligence-Enhanced Intraoperative Neurosurgical Workflow: State of the Art and Future Perspectives. J Neurosurg Sci (2021). doi: 10.23736/S0390-5616.21.05483-7

CrossRef Full Text | Google Scholar

21. He K, Zhang X, Ren S, Sun J. “Deep Residual Learning for Image Recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015) pp. 770–8. doi: 10.1109/CVPR.2016.90

CrossRef Full Text | Google Scholar

22. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L. Imagenet: A Large-Scale Hierarchical Image Database. IEEE Conf Comput Vis Pattern Recognit (2009) 248–55. doi: 10.1109/CVPR.2009.5206848

CrossRef Full Text | Google Scholar

23. Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. (2014).

Google Scholar

24. Ioffe S, Szegedy C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. IEEE (2017).

Google Scholar

25. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization. In: 2017 IEEE International Conference on Computer Vision (ICCV) (IEEE). p. 618–26. doi: 10.1109/ICCV.2017.74

CrossRef Full Text | Google Scholar

26. Larroza A, Bodí V, Moratal D. “Texture Analysis in Magnetic Resonance Imaging: Review and Considerations for Future Applications”, In (Ed.), Assessment of Cellular and Organ Function and Dysfunction using Direct and Derived MRI Methodologies. IntechOpen (2016). doi: 10.5772/64641

CrossRef Full Text | Google Scholar

27. Kunimatsu A, Kunimatsu N, Yasaka K, Akai H, Kamiya K, Watadani T, et al. Machine Learning-Based Texture Analysis of Contrast-Enhanced Mr Imaging to Differentiate Between Glioblastoma and Primary Central Nervous System Lymphoma. Magn Reson Med Sci (2019) 18:44–52. doi: 10.2463/mrms.mp.2017-0178

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Fruehwald-Pallamar J, Hesselink J, Mafee M, Holzer-Fruehwald L, Czerny C, Mayerhoefer M.Texture-Based Analysis of 100 MR Examinations of Head and Neck Tumors – Is It Possible to Discriminate Between Benign and Malignant Masses in a Multicenter Trial? RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der Bildgeb Verfahren (2015) 188:195–202. doi: 10.1055/s-0041-106066

CrossRef Full Text | Google Scholar

29. Tiwari P, Prasanna P, Rogers L, Wolansky L, Badve C, Sloan A, et al. Texture Descriptors to Distinguish Radiation Necrosis From Recurrent Brain Tumors on Multi-Parametric MRI. In: Medical Imaging 2014: Computer-Aided Diagnosis. SPIE. (2015) p. 90352B. doi: 10.1117/12.2043969

CrossRef Full Text | Google Scholar

30. Xiao D-D, Yan P-F, Wang Y-X, Osman MS, Zhao H-Y. Glioblastoma and Primary Central Nervous System Lymphoma: Preoperative Differentiation by Using MRI-Based 3D Texture Analysis. Clin Neurol Neurosurg (2018) 173:84–90. doi: 10.1016/j.clineuro.2018.08.004

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Cha S, Lupo JM, Chen MH, Lamborn KR, McDermott MW, Berger MS, et al. Differentiation of Glioblastoma Multiforme and Single Brain Metastasis by Peak Height and Percentage of Signal Intensity Recovery Derived From Dynamic Susceptibility-Weighted Contrast-Enhanced Perfusion MR Imaging. Am J Neuroradiol (2007) 28:1078–84. doi: 10.3174/ajnr.A0484

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Qin J, Li Y, Liang D, Zhang Y, Yao W. Histogram Analysis of Absolute Cerebral Blood Volume Map Can Distinguish Glioblastoma From Solitary Brain Metastasis. Med (Baltimore) (2019) 98:e17515. doi: 10.1097/MD.0000000000017515

CrossRef Full Text | Google Scholar

33. Xia W, Hu B, Li H, Shi W, Tang Y, Yu Y, et al. Deep Learning for Automatic Differential Diagnosis of Primary Central Nervous System Lymphoma and Glioblastoma: Multi-Parametric Magnetic Resonance Imaging Based Convolutional Neural Network Model. J Magn Reson Imaging (2021) 54:880–7. doi: 10.1002/JMRI.27592

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Raza SM, Lang FF, Aggarwal BB, Fuller GN, Wildrick DM, Sawaya R. Necrosis and Glioblastoma: A Friend or a Foe? A Review and a Hypothesis. Neurosurgery (2002) 51:2–13. doi: 10.1097/00006123-200207000-00002

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Palmisciano P, Haider AS, Nwagwu CD, Wahood W, Aoun SG, Abdullah KG, et al. Bevacizumab vs Laser Interstitial Thermal Therapy in Cerebral Radiation Necrosis From Brain Metastases: A Systematic Review and Meta-Analysis. J Neurooncol (2021) 154:13–23. doi: 10.1007/s11060-021-03802-x

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Mcavoy M, Prieto PC, Kaczmarzyk JR, Fernández IS, Mcnulty J, Smith T, et al. Classification of Glioblastoma Versus Primary Central Nervous System Lymphoma Using Convolutional Neural Networks. Sci Rep (2021) 11:1–7. doi: 10.1038/s41598-021-94733-0

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Elsayed A, Coenen F, Garca-Fiana M, Sluming V. Region Of Interest Based Image Classification: A Study in MRI Brain Scan Categorization. In: Data Mining Applications in Engineering and Medicine. InTech. doi: 10.5772/50019

CrossRef Full Text | Google Scholar

38. Menze BH, Jakab A, Bauer S, Kalpathy-Cramer J, Farahani K, Kirby J, et al. The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). IEEE Trans Med Imaging (2015) 34:1993–2024. doi: 10.1109/TMI.2014.2377694

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Jiang Z, Ding C, Liu M, Tao D. Two-Stage Cascaded U-Net: 1st Place Solution to Brats Challenge 2019 Segmentation Task. Cham: Springer International Publishing (2020). doi: 10.1007/978-3-030-46640-4_22

CrossRef Full Text | Google Scholar

40. Zeppa P, Neitzert L, Mammi M, Monticelli M, Altieri R, Castaldo M, et al. How Reliable are Volumetric Techniques for High-Grade Gliomas? A Comparison Study of Different Available Tools. Neurosurgery (2020) 87:E672–9. doi: 10.1093/neuros/nyaa282

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: brain metastases, deep learning, glioblastoma, machine learning, primary central nervous system lymphoma (PCNSL), artificial intelligence

Citation: Tariciotti L, Caccavella VM, Fiore G, Schisano L, Carrabba G, Borsa S, Giordano M, Palmisciano P, Remoli G, Remore LG, Pluderi M, Caroli M, Conte G, Triulzi F, Locatelli M and Bertani G (2022) A Deep Learning Model for Preoperative Differentiation of Glioblastoma, Brain Metastasis and Primary Central Nervous System Lymphoma: A Pilot Study. Front. Oncol. 12:816638. doi: 10.3389/fonc.2022.816638

Received: 16 November 2021; Accepted: 31 January 2022;
Published: 24 February 2022.

Edited by:

Lorenzo Cobianchi, University of Pavia, Italy

Reviewed by:

Daniele Piccolo, University of Padua, Italy
Constantin Tuleasca, Centre Hospitalier Universitaire Vaudois (CHUV), Switzerland

Copyright © 2022 Tariciotti, Caccavella, Fiore, Schisano, Carrabba, Borsa, Giordano, Palmisciano, Remoli, Remore, Pluderi, Caroli, Conte, Triulzi, Locatelli and Bertani. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Leonardo Tariciotti, bGVvbmFyZG8udGFyaWNpb3R0aUB1bmltaS5pdA==; bGVvbmFyZG90YXJpY2lvdHRpbWRAZ21haWwuY29t

†These authors share first authorship

‡These authors share senior authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A Deep Learning Model for Preoperative Differentiation of Glioblastoma, Brain Metastasis and Primary Central Nervous System Lymphoma: A Pilot Study

Introduction

Material and Methods

Patient’s Selection

MR Acquisition and Image Preprocessing

Convolutional Neural Network Model

Performance Metrics

Statistics, Software and Hardware

Results

DNN Model Performance Metrics Evaluation

DNN Model and Neuroradiological Assessment (Gold Standard) Comparison

Discussion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good