- 1Department of Medical Physics, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, Aviano, Italy
- 2Department of Radiation Oncology, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, Aviano, Italy
- 3Guerbet SA, Villepinte, France
- 4Breast Surgery Unit, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, Aviano, Italy
- 5Department of Radiology, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, Aviano, Italy
- 6Department of Radiation Oncology, Udine General Hospital, Udine, Italy
- 7Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, United States
Purpose: to predict the occurrence of late subcutaneous radiation induced fibrosis (RIF) after partial breast irradiation (PBI) for breast carcinoma by using machine learning (ML) models and radiomic features from 3D Biologically Effective Dose (3D-BED) and Relative Electron Density (3D-RED).
Methods: 165 patients underwent external PBI following a hypo-fractionation protocol consisting of 40 Gy/10 fractions, 35 Gy/7 fractions, and 28 Gy/4 fractions, for 73, 60, and 32 patients, respectively. Physicians evaluated toxicity at regular intervals by the Common Terminology Adverse Events (CTAE) version 4.0. RIF was assessed every 3 months after the completion of radiation course and scored prospectively. RIF was experienced by 41 (24.8%) patients after average 5 years of follow up.
The Hounsfield Units (HU) of the CT-images were converted into relative electron density (3D-RED) and Dose maps into Biologically Effective Dose (3D-BED), respectively. Shape, first-order and textural features of 3D-RED and 3D-BED were calculated in the planning target volume (PTV) and breast. Clinical and demographic variables were also considered (954 features in total). Imbalance of the dataset was addressed by data augmentation using ADASYN technique. A subset of non-redundant features that best predict the data was identified by sequential feature selection. Support Vector Machines (SVM), ensemble machine learning (EML) using various aggregation algorithms and Naive Bayes (NB) classifiers were trained on patient dataset to predict RIF occurrence. Models were assessed using sensitivity and specificity of the ML classifiers and the area under the receiver operator characteristic curve (AUC) of the score functions in repeated 5-fold cross validation on the augmented dataset.
Results: The SVM model with seven features was preferred for RIF prediction and scored sensitivity 0.83 (95% CI 0.80–0.86), specificity 0.75 (95% CI 0.71–0.77) and AUC of the score function 0.86 (0.85–0.88) on cross-validation. The selected features included cluster shade and Run Length Non-uniformity of breast 3D-BED, kurtosis and cluster shade from PTV 3D-RED, and 10th percentile of PTV 3D-BED.
Conclusion: Textures extracted from 3D-BED and 3D-RED in the breast and PTV can predict late RIF and may help better select patient candidates to exclusive PBI.
Introduction
Subcutaneous radiation induced fibrosis (RIF) is characterized by a progressive induration and thickening of the subcutaneous tissues and is one of the late adverse effects of breast radiotherapy (RT) mostly affecting cosmesis. It is a dose dependent and slowly progressive side effect originating from a proliferative response of surviving fibrocytes to growth factors (e.g., the transforming growth factor β (TGF-β), released in response to tissue injury) (1).
The available tools to predict late subcutaneous fibrosis in patients treated with RT are of limited quality. Models to predict Normal Tissue Complication Probability (NTCP) for RIF after breast RT have been first fitted to published data of rates of incidence from whole breast irradiation (WBI) (2). Later, models for NTCP of RIF have been refined by including dose volume data from simulated dose distributions of WBI (3) and partial breast irradiations (PBI) (4).
Quantitative analysis of medical images could provide information about intensity, shape, size or volume, and texture of tumor or organs at risk that is distinct or complementary to that provided by other data sources (5). Recently, the combination of quantitative analysis of radiological images with Machine Learning (ML) methods, also known as “radiomics,” has been applied also to predict side effects of RT such as lung-injury following Stereotactic Body RT (SBRT) for lung cancer (6), gastrointestinal and genitourinary toxicities (7) and xerostomia (8).
Other 3D information, as dose distribution delivered in RT calculated on pre-treatment Computer Tomography (CT), can be integrated in the radiomics analysis. The textural analysis of dose distribution could provide more detailed spatial information on the 3D dose distribution: it attempts to extract spatial features from dose distribution to predict RT response instead of dose-volume histogram (DVH) typically used in NTCP models. Dosiomics, or integration of dose features from the irradiated lung, has shown to be predictive of radiation pneumonitis with higher accuracy than DVH-based NTCP models (9).
The purpose of the present work is to develop a model to improve the accuracy of prediction of RIF by integrating data from pre-treatment CT, 3D dose distribution and clinical variables. For this purpose, we developed a ML classifier, that is, a predictive model assigning an unseen patient to one of two possible classes: patient with or without RIF during follow-up. Our study is the first, to the best of our knowledge, to derive a classifier for RIF which includes radiomic variables and individual dose data using ML algorithms.
Methods
Patient Data
One hundred sixty-five patients treated with breast conservative surgery for an early stage ductal carcinoma who underwent external PBI were retrospectively analyzed. Patient characteristics, with results of univariate statistical tests to investigate correlation with RIF, are shown in Table 1. All patients underwent a complete free breathing pre-treatment planning CT to include all the organs at risk (OAR), according to the RTOG 0413 protocol (10). CTs were acquired with a GE Lightspeed RT (GE Medical Systems, Waukesha, WI) or a Toshiba Aquilion LB (Toshiba Medical Systems Europe, Zoetermeer, the Netherlands) using 120 kVp, 215–300 mAs 5 mm slice thickness, and voxel size ranging from 0.977 to 1.074 mm.
The clinical target volume (CTV) consisted of the lumpectomy cavity, identified by the post-surgery seroma or by the surgical clips, uniformly expanded by 15 mm, limited to 5 mm from the skin surface and 5 mm from the lung-chest wall interface. The planning target volume (PTV) was calculated from the CTV using uniform 3D expansion of 1 cm, then it was limited to exclude the part outside the ipsilateral breast, the first 5 mm of tissue under the skin and the expansion beyond the posterior extent of breast tissue. Breast tissue visible on the pre-treatment planning CT was outlined, according to the RTOG “Breast Cancer Atlas for Radiation therapy planning: consensus definition” (11).
Patients were treated following a hypo-fractionation protocol (12) designed using iso-effective doses for subcutaneous RIF based on NTCP models (4). The hypofractionation schemes consisted of 40 Gy in 10 fractions (73 patients), 35 Gy in 7 fractions (60) and 28 Gy in 4 fractions fractions. The RT technique consisted of “field-in-field” planning (forward-planned intensity modulated RT) (14) using multiple planar and non-coplanar 6-MV photon beams, and delivered by a Trilogy linear accelerator equipped with a kV on-board imager system and a 120-leaves Millennium multi-leaf collimator (Varian Medical Systems, Palo Alto, CA, US).
All treatments were developed using the Eclipse treatment planning system (Varian Medical), and dose calculations were carried out using the anisotropic analytical algorithm (AAA) with a grid resolution of 2.5 mm, taking into account heterogeneity correction. The CT scan, dose matrix and Region Of Interest (ROI) contours were exported in a DICOM format.
Physicians evaluated toxicity at regular intervals by the Common Terminology Criteria for Adverse Events (CTCAE) (version 4.0). Clinical and demographic variables, age, presence of comorbidities (diabetes and rheumatological disorders), tumor histology, laterality and quadrant, administration of chemotherapy and hormone therapy, were considered (954 features in total). The presence of RIF of any grade (grade 1 or more) was assessed every 3 months after the completion of radiation course and scored in a prospective database. Forty-one (24.8%) patients experienced RIF after average 5 years of follow up. Fibrosis of grade 1, 2, and 3 occurred in 26, 12, and 3 patients, respectively. The maximum toxicity score (Grade 4) was not recorded during follow up.
Radiomic Analysis of BED and RED
Prior to the calculation of radiomic features, resampling to isotropic voxel size was applied to have standardized voxel spacing across the cohort (15). For example, all CT images were resampled to 3 × 3 × 3 mm3 (16). Voxel intensities were grouped into 64 equally spaced bins to reduce image noise and normalize intensities across all different patients.
In order to remove dependency on the Hounsfield scale used by the two scanners (17), the images were converted from Hounsfield Units to electron density relative to water (3D-RED) using the Hounsfield Units—RED conversion scales of the CTs (Figure 1) as measured on phantom on each CT scanner. Since patients were treated with different fractionation schemes, the 3D dose distributions were converted into 3D Biologically Effective Dose (3D-BED) using the number of fractions of the treatment, and an assumed value of α/β of 3Gy, typical of late-responding tissues as subcutaneous tissue, which has also been used to model RIF (2, 18). A total of 21 shapes, 57 radiomic and 57 dosimetric textural features were calculated from the PTV and breast volume in the 3D-RED and 3D-BED. The radiomic features were calculated following definitions and nomenclature from the Image Biomarker Standardization Initiative (IBSI) (19) using an in-house Matlab code. The in-house code had been previously validated by comparing its results with the Ibex open source software (20). The same features were also calculated after application of one between Gaussian, Laplacian of Gaussian (LoG), or Median filtering to 3D-RED and 3D-BED. The clinical variables follow up, age, tumor location, pathological tumor size, chemo and hormone therapy, were also collected and included in the analysis, so that the variables were 954 in total. A common problem in application of ML classifiers is that some classes have a significantly higher number of examples, a problem which is referred to as class imbalance. The effect of imbalanced datasets on ML performance is detrimental (21, 22), and there are two methods for overcoming this issue, namely under-sampling and over-sampling, of which the latter has been proven to be more effective in ML (21).
Figure 1. Axial views of 3D-BED and 3D-RED in a patient who did not experience late RIF (A) and one who developed late RIF during follow up (B).
We then applied the Adaptive Synthetic Sampling Method for Imbalanced Data (ADASYN) over-sampling technique (23), an improved variant of the Synthetic Minority Over-sampling Technique (SMOTE), which generates synthetic data points by interpolating new feature values between the minority instance and its neighbors, according to the Euclidean distance, in the feature space. In ADASYN, the new minority samples are generated using a density distribution based on the number of out-of-class neighbors so that a minority instance surrounded by more out-of-class instances is considered hard-to-train, and is thus given a higher probability to be augmented (24).
ADASYN was applied to the level that the imbalance was completely eliminated, resulting in an augmented dataset of 252 patients, of which 50% had late RIF. All the analysis was performed using Matlab (Mathworks, Natick, MA).
ML Models
The occurrence of RIF in patients was converted into a binary outcome, positive for patients who experienced any grade (one or more) of RIF, and negative for patients who did not experience RIF during follow up.
The ML process includes two phases (5). First, to prevent overfitting, prior to applying ML classification, Stepwise forward feature selection was used to select a subset of variables best suited to predict late fibrosis. In Stepwise feature selection, terms from a generalized linear model are removed or added in order to find the subset of variables in the data set resulting in the smallest model with lowest prediction error (25).
Forward stepwise selection is a wrapper method of feature selection, that is, a method which uses a learning technique, in our case a generalized linear model (GLM), to evaluate the importance of the features. Forward selection starts with an empty model. Then at each iteration, the single feature that best improves the fit of the GLM according to a specified criterion is determined (26). As a criterion we used the deviance of the values predicted by the GLM from the test data in a 5-fold cross validation. This is repeated until a best subset predictors (features) are selected.
In order to choose a proper number of variables, the process was initially performed with 4 variables allowed in the feature selection, then repeated with increasing number of variables.
After feature selection, the following binary ML classifiers were applied to the dataset to predict RIF:
1) SVM, which, by means of a kernel function, projects the data into a higher-dimensional feature space and determine a hyperplane in this feature space which separates data points into two categories (27). During the optimization, the proper box constraint level and kernel scale are chosen.
2) Ensemble machine learning (EML) which aggregates multiple learners into a single learner. Decision Trees were used as weak learners (5). During training, the best ensemble EML algorithm is selected between Random forests, Adaptive Logistic Regression and various boosting algorithms: Adaptive, Gentle, and Random Undersampling boosting (28) as well as the optimal number of learning cycles, learning rate, and minimum leaf size.
3) Naïve-Bayesian (NB) classifier which calculates the probability of each class assuming the conditional independence of the attributes using the Naive Bayes formula. A new instance is classified into the class with maximum calculated probability (29, 30). The optimizer also searches the best type of probability distribution (Gaussian or Kernel) and width of the kernel function.
The model to predict the occurrence of RIF was chosen according to the following criteria. First, the performance of the models was evaluated by calculating the average and 95% confidence intervals of sensitivity and specificity of the classifier and the AUC of the score function used by the classifiers in a 5-fold cross validation repeated 500 times in the augmented dataset. Finally, the sensitivity, specificity, and AUC were recalculated on the original, non-augmented dataset.
The models were required to have at least sensitivity and specificity of 0.75, and AUC of the model score of 0.85. Second, models were required to provide a realistic description of occurrence of RIF vs. BED variables. For this purpose, the score used by the best performing models to predict RIF was calculated vs. variables from 3D-BED variables. For biological consistency, models were required to have a continuously monotonic response to increasing dose. Models with non-monotonically increasing dose response were discarded, as this would imply that two different doses can lead to the same risk of side effects and that increasing dose could reduce the risk (31).
Results
The variables selected for ML are shown in Table 2. Two were textural variables of 3D-BED from the breast, cluster shade and Run Length Non-uniformity (RLN) after application of LoG filter, two were histogram (kurtosis and range) and one textural (Gray Level Co-occurrence Matrix Cluster shade) features (19) from the 3D-RED in the PTV and two histogram (10th percentile and inverse variance) variables of 3D-BED in the PTV. Among these, three variables (RLN of 3D-BED in breast, kurtosis of 3D-RED in PTV, 10th percentile of 3D-RED in PTV) were significantly correlated with occurrence of RIF according to the Wilcoxon-Mann-Whitney test for independent samples. No clinical variable was selected in the model.
EML with Adaptive Boosting was the best performing model for any number of variables, and it scored an AUC of the radiomic signature of 0.87 (0.85–0.90) with only 6 variables. SVM was the second best performing classifier as it achieved acceptable scores with 7 variables, while Native Bayes gave generally poor performance in terms of specificity (Table 3).
To interpret the features, their values were investigated in the two subsets of patients having extreme values of the function score. These patients were chosen as the 5% with the lowest score function among those without RIF, and the 5% with the highest score function of those who had RIF. Their features are shown in Table 4.
The score functions of the SVM and EML classifiers, were plotted against the 10th percentile of 3D-BED in the PTV for two values of kurtosis, that is, the average values of the patients at low and high risk of RIF, with the other features fixed at their average values among all the patients (Figure 2). The EML model was then discarded, as it showed a non-monotonically increasing dose-score function. The 7 variables SVM, scoring sensitivity 0.83 (95% CI 0.80–0.86), specificity 0.75 (95% CI 0.71–0.77) and AUC of the score function 0.86 (0.85–0.88) on cross-validation, was chosen as the preferred model. The model had sensitivity, specificity and AUC of 0.81, 0.77, and 0.86 respectively in the original dataset.
Figure 2. Score function of EML (A) and SVM (B) used to classify patients vs. 10th percentile of 3D-BED in the PTV. The curves are calculated for values of kurtosis typical of patients at low and high risk of RIF, chosen as average of kurtosis in the 10 patients without RIF with lowest score (blue lines) and in 10 patients with RIF with highest score (red lines).
Discussion
Supervised ML methods have been increasingly used in medicine, especially in the field of radiomics (32) to identify patients as responders or not responders but also to predict side effects in OARs (13, 13, 27). They are prone to overfitting, an event in which the model will better reflect noise in the image than the data themselves (33). Hence, careful feature selection and validation must be performed to tackle this limitation. In our results, EML is an example of ML models which overfitted the data, as it provided the best performance on repeated cross-validation for any number of variables, but produced unrealistic dose-response (Figure 2A) which was not monotonic. It was then discarded in favor of the SVM, whose score was monotonically increasing as a function of dose (Figure 2B). SVM models are more robust to overfitting than other ML methods such as decision trees (34), because they tolerate some points on the wrong side of the hyperplane, thus improving model robustness and generalization (5). Because the AUC of the SVM model of 0.86 on the non-augmented dataset is considered excellent (35) and, as the sum of sensitivity and specificity is 1.58, larger than 1.5, the model fulfills the rule of thumb for being useful of a clinical test (36), it can be used to stratify patients according to the risk of subcutaneous RIF.
To the best of our knowledge, this is the first study using radiomic features extracted from dose distribution after conversion to 3D-BED, which was necessary since our patients had different fractionation schemes. As the BED variables were the most correlated with RIF, our analysis confirms that radiation-induced RIF is governed by BED calculated with α/β = 3 Gy to the whole breast and to the high dose region, the PTV. This result is in agreement with previous clinical findings showing that fibrosis is related to dose and dose per fraction (18). On the other hand, a correlation between RIF and maximum dose has been observed in clinical data for both WBI (37) and PBI (38). RIF after PBI has been related to minimum PTV dose (39).
The features that were most correlated with RIF were BED features10th percentile and variance in the PTV and RLN of the breast, cluster shade to both PTV and breast.
Among the BED features, the 10th percentile in PTV is a descriptor of the minimum BED to the PTV, and describes the dependency of fibrosis on the lowest fractionation-corrected dose covering at least 90% of PTV. Cluster shade of BED in the PTV describes asymmetry of the GLCM. A larger module of cluster shade implies large GLCM asymmetry (19), which means that there are regions in the PTV with large differences in BED from their neighbors and may be related to the presence of hot spots in the PTV. Of note, if a PTV is close to the patient's surface, like in Figure 1B, there is a sudden change of dose in the build-up region which may increase cluster shade of BED. These associations are confirmed by larger variance of BED to the PTV in patients with RIF (Table 4).
RLN of BED in the breast describes the similarity among run lengths, defined as the lengths of consecutive voxels having the same dose value in a specified direction, in number of voxels (19) throughout the breast. RLN is related to homogeneity of dose, and lower values indicate more homogeneity among run lengths in the image. In our results patients without RIF had lower RLN of BED (more inhomogeneity). This may be due to larger “out-of-field” areas of the breast in patients less at risk of fibrosis (Figure 1A) that, being irradiated with low, uniform doses from scattered radiation, tend to have larger runs of voxels with the same values of dose from scattered radiation. An example of this situation can be observed in Figure 1A, and suggests that a steep dose gradient outside of the PTV may be beneficial to prevent fibrosis.
These findings indicate that the radiomic BED variables show that higher BED and presence of hot spots of BED in the PTV, as well as higher volumes receiving intermediate doses out of the PTV, as in Figure 1B, are related to occurrence of fibrosis.
The hypothesis underlying the application of radiomics to predict side effects in OARs is that a patient who is more at risk of side effect has a particular appearance of the organ at risk in pretreatment CT from the patient at lower risk. Often, these models are still perceived as “black boxes,” meaning that it is difficult to determine how they arrive at their predictions, which impairs their use by clinicians as part of their clinical practice (40, 41). To address this issue, we provide interpretation of the radiomic features that are selected by the models. In our results, it was found that 3D-RED kurtosis in the PTV was correlated with a higher risk for RIF. Because kurtosis describes inhomogeneity of the electron density of the breast, the patients with more inhomogeneous breast (small kurtosis) are more sensitive, that is, have higher function score for all dose values (Figure 2). Fat, which is radiolucent, appears dark on a CT, while epithelial and stromal tissue appear radiodense and may represent connective tissues (42, 43). Senescence may in the human mammary epithelium be at the origin of RIF (44) and RT may have a more pronounced effect on stroma (42). Thus, an already dense breast could be more prone to developing fibrosis. As younger patients have more inhomogeneous breast, this result seems in agreement with studies reporting worse cosmetic results in young patients [e.g., (45)], who typically have a denser breast. In our results, however, fibrosis was not correlated with age, neither kurtosis (Pearson correlation p = 0.28). This lack of correlations with age could be due to the limited range of age of our patients (95%CI 61.0–82.9 years), that do not include younger (<50) patients. The relationship of fibrosis and radiomic features from CT of the breast with age therefore could be the subject of future investigation.
Conclusion
The models implemented show that radiomic and dose textural variables extracted from the breast and PTV volumes after correction for fractionation and CT density scale can predict RIF and may help better select patients candidate to exclusive PBI.
Data Availability Statement
The datasets generated for this study are available on request to the corresponding author.
Ethics Statement
The studies involving human participants were reviewed and approved by Comitato Etico Unico Regionale—CEUR Friuli Venezia Giulia. Azienda Regionale di Coordinamento per la Salute (ARCS), via Pozzuolo n. 330 – 33100 Udine (palazzina B). The patients/participants provided their written informed consent to participate in this study.
Author Contributions
MA and LV conceived the study. LV and AC performed the contouring of the cases and were responsible for the clinical evaluation of side effects during the follow-up. GP and ADr extracted the imaging and dose data. GP processed the data and performed radiomics analysis. MA performed the machine learning and statistical analysis and wrote the manuscript with inputs from JS, IN, MT, ADP, MU, SM, MM, and GS. All the authors discussed the results and contributed to final manuscript.
Conflict of Interest
JS is employed by the company Guerbet SA, Villepinte, France.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The handling editor declared a past collaboration with two of the authors MA and IN.
References
1. Avanzo M, Stancanello J, Jena R. Adverse effects to the skin and subcutaneous tissue. In: Rancati T, Fiorino C, editors. Modelling Radiotherapy Side Effects: Practical Applications for Planning Optimisation. Boca Raton, FL: CRC Press, Taylor & Francis Group (2019). p. 291–307. doi: 10.1201/b21956-12
2. Alexander MA, Brooks WA, Blake SW. Normal tissue complication probability modelling of tissue fibrosis following breast radiotherapy. Phys Med Biol. (2007) 52:1831–43. doi: 10.1088/0031-9155/52/7/005
3. Mukesh MB, Harris E, Collette S, Coles CE, Bartelink H, Wilkinson J, et al. Normal tissue complication probability (NTCP) parameters for breast fibrosis: pooled results from two randomised trials. Radiother Oncol. (2013) 108:293–8. doi: 10.1016/j.radonc.2013.07.006
4. Avanzo M, Stancanello J, Trovo M, Jena R, Roncadin M, Trovo MG, et al. Complication probability model for subcutaneous fibrosis based on published data of partial and whole breast irradiation. Phys Med. (2012) 28:296–306. doi: 10.1016/j.ejmp.2011.11.002
5. Avanzo M, Stancanello J, El Naqa I. Beyond imaging: the promise of radiomics. Phys Med. (2017) 38:122–39. doi: 10.1016/j.ejmp.2017.05.071
6. Moran A, Daly ME, Yip SSF, Yamamoto T. Radiomics-based assessment of radiation-induced lung injury after stereotactic body radiotherapy. Clin Lung Cancer. (2017) 18:e425–31. doi: 10.1016/j.cllc.2017.05.014
7. Rossi L, Bijman R, Schillemans W, Aluwini S, Cavedon C, Witte M, et al. Texture analysis of 3D dose distributions for predictive modelling of toxicity rates in radiotherapy. Radiotherap Oncol. (2018) 129:548–53. doi: 10.1016/j.radonc.2018.07.027
8. Gabryś HS, Buettner F, Sterzing F, Hauswald H, Bangert M. Design and selection of machine learning methods using radiomics and dosiomics for normal tissue complication probability modeling of xerostomia. Front Oncol. (2018) 8:35. doi: 10.3389/fonc.2018.00035
9. Liang B, Yan H, Tian Y, Chen X, Yan L, Zhang T, et al. Dosiomics: extracting 3D spatial features from dose distribution to predict incidence of radiation pneumonitis. Front Oncol. (2019) 9:269. doi: 10.3389/fonc.2019.00269
10. Trovo M, Avanzo M, Vinante L, Furlan C, Fiorica F, Perin T, et al. Seven fractions to deliver partial breast irradiation: the toxicity is low. Radiat Oncol. (2017) 12:86. doi: 10.1186/s13014-017-0825-9
11. The Radiation Therapy Oncology Group (RTOG) Contouring Atlases. Available online at: https://www.rtog.org/CoreLab/ContouringAtlases.aspx
12. Avanzo M, Trovo M, Stancanello J, Jena R, Roncadin M, Toffoli G, et al. Hypofractionation of partial breast irradiation using radiobiological models. Phys Med. (2015) 31:1022–8. doi: 10.1016/j.ejmp.2015.08.016
13. Ospina JD, Zhu J, Chira C, Bossi A, Delobel JB, Beckendorf V, et al. Random forests to predict rectal toxicity following prostate cancer radiation therapy. Int J Radiat Oncol Biol Phys. (2014) 89:1024–31. doi: 10.1016/j.ijrobp.2014.04.027
14. Haciislamoglu E, Colak F, Canyilmaz E, Dirican B, Gurdalli S, Yilmaz AH, et al. Dosimetric comparison of left-sided whole-breast irradiation with 3DCRT, forward-planned IMRT, inverse-planned IMRT, helical tomotherapy, and volumetric arc therapy. Phys Med. (2015) 31:360–7. doi: 10.1016/j.ejmp.2015.02.005
15. Vallières M, Freeman CR, Skamene SR, El Naqa I. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys Med Biol. (2015) 60:5471–96. doi: 10.1088/0031-9155/60/14/5471
16. Coroller TP, Agrawal V, Narayan V, Hou Y, Grossmann P, Lee SW, et al. Radiomic phenotype features predict pathological response in non-small cell lung cancer. Radiother Oncol. (2016) 119:480–6. doi: 10.1016/j.radonc.2016.04.004
17. Lamba R, McGahan JP, Corwin MT, Li C, Tran T, Seibert JA, et al. CT hounsfield numbers of soft tissues on unenhanced abdominal CT scans: variability between two different manufacturers’ MDCT scanners. Am J Roentgenol. (2014) 203:1013–20. doi: 10.2214/AJR.12.10037
18. Johansson S, Svensson H, Denekamp J. Dose response and latency for radiation-induced fibrosis, edema, and neuropathy in breast cancer patients. Int J Radiat Oncol Biol Phys. (2002) 52:1207–19. doi: 10.1016/S0360-3016(01)02743-2
19. Zwanenburg A, Leger S, Vallières M, Lock S, the Image Biomarker Standardisation Initiative for. Image biomarker standardisation initiative. arXiv e-prints. arXiv:1612.07003v11 (2016).
20. Zhang L, Fried DV, Fave XJ, Hunter LA, Yang J, Court LE. IBEX: an open infrastructure software platform to facilitate collaborative work in radiomics. Med Phys. (2015) 42:1341–53. doi: 10.1118/1.4908210
21. Buda M, Maki A, Mazurowski MA. A systematic study of the class imbalance problem in convolutional neural networks. arXiv e-prints. arXiv:1710.05381 (2017). doi: 10.1016/j.neunet.2018.07.011
22. Lemaitre G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. arXiv e-prints. arXiv:1609.06570 (2016).
23. Haibo He, Yang Bai E. A. Garcia, Shutao Li. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) Hong Kong (2008). doi: 10.1109/IJCNN.2008.4633969
24. Yung M, Brown ET, Rasin A, Furst JD, Raicu DS. Synthetic sampling for multi-class malignancy prediction. arXiv e-prints. arXiv:1807.02608 (2018).
25. Franke GR. Stepwise Regression. In: Sheth J, Malhotra N, editors. Wiley International Encyclopedia of Marketing. Hoboken, NJ: American Cancer Society (2010). doi: 10.1002/9781444316568.wiem02071
26. Thu N. Faster feature selection with a dropping forward-backward algorithm. arXiv e-prints. arXiv:1910.08007 (2019).
27. Chen S, Zhou S, Yin FF, Marks LB, Das SK. Investigation of the support vector machine algorithm to predict lung radiation-induced pneumonitis. Med Phys. (2007) 34:3808–14. doi: 10.1118/1.2776669
28. Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F. A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybernet. (2012) 42:463–84. doi: 10.1109/TSMCC.2011.2161285
29. Ben-Bassat M, Klove KL, Weil MH. Sensitivity analysis in bayesian classification models: multiplicative deviations. IEEE Trans Pattern Analysis Mach Intellig. (1980) 2:261–6. doi: 10.1109/TPAMI.1980.4767015
30. Kukar M, Kononenko I, Silvester T. Machine learning in prognosis of the femoral neck fracture recovery. Artif Intellig Med. (1996) 8:431–51. doi: 10.1016/S0933-3657(96)00351-X
31. Massari E, Rancati T, Giandini T, Cicchetti A, Vavassori V, Fellin G, et al. PO-0851: Artificial neural networks for toxicity prediction in RT: a method to validate their “intelligence”. Radiother Oncol. (2017) 123:S461–2. doi: 10.1016/S0167-8140(17)31288-4
32. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts HJ. Machine learning methods for quantitative radiomic biomarkers. Sci Rep. (2015) 5:13087. doi: 10.3389/fonc.2015.00272
33. Parekh V, Jacobs MA. Radiomics: a new application from established techniques. Expert Rev Precis Med Drug Dev. (2016) 1:207–26. doi: 10.1080/23808993.2016.1164013
34. Castiglioni I, Gallivanone F, Soda P, Avanzo M, Stancanello J, Aiello M, et al. AI-based applications in hybrid imaging: how to build smart and truly multi-parametric decision models for radiomics. Eur J Nucl Med Mol Imaging. (2019) 46:2673–99. doi: 10.1007/s00259-019-04414-4
35. Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol. (2010) 5:1315–6. doi: 10.1097/JTO.0b013e3181ec173d
36. Power M, Fell G, Wright M. Principles for high-quality, high-value testing. Evid Based Med. (2013) 18:5–10. doi: 10.1136/eb-2012-100645
37. Collette S, Collette L, Budiharto T, Horiot JC, Poortmans PM, Struikmans H, et al. Predictors of the risk of fibrosis at 10 years after breast conserving therapy for early breast cancer: a study based on the EORTC Trial 22881–10882 'boost versus no boost'. Eur J Cancer. (2008) 44:2587–99. doi: 10.1016/j.ejca.2008.07.032
38. Hepel JT, Tokita M, MacAusland SG, Evans SB, Hiatt JR, Price LL, et al. Toxicity of three-dimensional conformal radiotherapy for accelerated partial breast irradiation. Int J Radiat Oncol Biol Phys. (2009) 75:1290–6. doi: 10.1016/j.ijrobp.2009.01.009
39. Lozza L, Fariselli L, Sandri M, Rampa M, Pinzi V, De Santis MC, et al. Partial breast irradiation with CyberKnife after breast conserving surgery: a pilot study in early breast cancer. Radiat Oncol. (2018) 13:49. doi: 10.1186/s13014-018-0991-4
40. Yosinski J, Clune J, Nguyen A, Fuchs T, Lipson H. Understanding neural networks through deep visualization. arXiv e-prints. arXiv:1506.06579 (2015).
41. Sankar V, Kumar D, Clausi DA, Taylor GW, Wong A. SISC: End-to-end interpretable discovery radiomics-driven lung cancer prediction via stacked interpretable sequencing cells. arXiv e-prints. arXiv:1901.04641 (2019).
42. Maskarinec G, Woolcott CG, Kolonel LN. Mammographic density as a predictor of breast cancer outcome. Future Oncol. (2010) 6:351–4. doi: 10.2217/fon.10.3
43. Alowami S, Troup S, Al-Haddad S, Kirkpatrick I, Watson PH. Mammographic density is related to stroma and stromal proteoglycan expression. Breast Cancer Res. (2003) 5:R129–35. doi: 10.1186/bcr622
44. Nguyen HQ, To NH, Zadigue P, Kerbrat S, De La Taille A, Le Gouvello S, et al. Ionizing radiation-induced cellular senescence promotes tissue fibrosis after radiotherapy. A review. Crit Rev Oncol Hematol. (2018) 129:13–26. doi: 10.1016/j.critrevonc.2018.06.012
Keywords: radiomics, radiotherapy, machine learning, breast cancer, fibrosis
Citation: Avanzo M, Pirrone G, Vinante L, Caroli A, Stancanello J, Drigo A, Massarut S, Mileto M, Urbani M, Trovo M, el Naqa I, De Paoli A and Sartor G (2020) Electron Density and Biologically Effective Dose (BED) Radiomics-Based Machine Learning Models to Predict Late Radiation-Induced Subcutaneous Fibrosis. Front. Oncol. 10:490. doi: 10.3389/fonc.2020.00490
Received: 03 January 2020; Accepted: 18 March 2020;
Published: 21 April 2020.
Edited by:
Claudio Fiorino, San Raffaele Hospital (IRCCS), ItalyReviewed by:
Laura Cella, Italian National Research Council, ItalyCarlo Cavedon, Integrated University Hospital Verona, Italy
Copyright © 2020 Avanzo, Pirrone, Vinante, Caroli, Stancanello, Drigo, Massarut, Mileto, Urbani, Trovo, el Naqa, De Paoli and Sartor. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Michele Avanzo, mavanzo@cro.it