- 1Department of Radiology, Xiangya Hospital, Central South University, Changsha, China
- 2Department of Urology, Xiangya Hospital, Central South University, Changsha, China
- 3Department of Pathology, Xiangya Hospital, Central South University, Changsha, China
- 4Xiangya School of Medicine, Central-South University, Changsha, China
- 5School of Mathematics and Statistics, Central South University, Changsha, China
- 6Department of Radiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States
- 7Department of Diagnostic Radiology, City of Hope National Medical Center, Duarte, CA, United States
Background: Clear cell renal cell carcinoma (ccRCC) is the most common renal cancer and it has the worst prognosis among all renal cancers. However, traditional radiological characteristics on computed tomography (CT) scans of ccRCC have been insufficient to predict the pathological grade of ccRCC before surgery.
Methods: Patients with ccRCC were retrospectively enrolled into this study and were separated into two groups according to the World Health Organization (WHO)/International Society of Urological Pathology (ISUP) grading system, i.e., low-grade (Grade I and II) group and high-grade (Grade III and IV) group. Traditional CT radiological characteristics such as tumor size, pre- and post-enhancing CT densities were assessed. In addition, radiomic texture analysis based on the CT imaging of the ccRCC were also performed. A CT-based machine learning method combining the traditional radiological characteristics and radiomic features was used in the predictive modeling for differentiating the low-grade from the high-grade ccRCC. Model performance was evaluated with the receiver operating characteristic curve (ROC) analysis.
Results: A total of 264 patients with pathologically confirmed ccRCC were included in this study. In this cohort, 206 patients had the low-grade tumors and 58 had the high-grade tumors. The model built with traditional radiological characteristics achieved an area under the curve (AUC) of 0.9175 (95% CI: 0.8765–0.9585) and 0.8088 (95% CI: 0.7064–0.9113) in differentiating the low-grade from the high-grade ccRCC for the training cohort and the validation cohort respectively. The model built with the radiomic textural features yielded an AUC value of 0.8170 (95% CI: 0.7353–0.8987) and 0.8017 (95% CI: 0.6878–0.9157) for the training cohort and the validation cohort, respectively. The combined model integrating both the traditional radiological characteristics and the radiomic textural features achieved the highest efficacy, with an AUC of 0.9235 (95% CI: 0.8646–0.9824) and an AUC of 0.9099 (95% CI: 0.8324–0.9873) for the training cohort and validation cohort, respectively.
Conclusion: We developed a machine learning radiomic model achieving a satisfying performance in differentiating the low-grade from the high-grade ccRCC. Our study presented a potentially useful non-invasive imaging-focused method to predict the pathological grade of renal cancers prior to surgery.
Introduction
Clear cell renal cell carcinoma (ccRCC) constitutes 70%–80% of all renal cancers (1–3) and it has a poor prognosis with a cure rate under 70% even for a localized ccRCC treated by radical nephrectomy (4–6). A novel four-tiered World Health Organization (WHO)/International Society of Urological Pathology (ISUP) grading (7) has been reported to have the potential to predict prognosis in patients with ccRCC (8–10) who may have the poorest prognosis among all patients with renal cancer (11–13). The ccRCC tumors are usually subclassified into two groups including the low-grade (Grade I and II) and the high-grade (Grade III and IV) groups, reflecting the significant difference in treatment strategy and prognosis between the two groups (7, 10, 14). It has been shown that the higher ISUP grade of ccRCC has greater biologic aggressiveness, and is associated with worse survival (8, 9) and higher risk for recurrence after partial (nephron-sparing) nephrectomy (15). Knowledge of ISUP grade prior to surgery could guide clinical decision making (5, 16, 17). In addition, reliable ISUP grade obtained from a non-invasive method such as imaging may alleviate the need for renal biopsy (18), thus avoiding the risk of complications from invasive biopsies such as bleeding, infection, tumor seeding the biopsy needle path, and the relatively low accuracy in assessing tumor grade based on the biopsy specimen (19). Therefore, there is an unmet need to develop non-invasive methods for assessing the pathological grade of ccRCC before surgery.
Non-invasive imaging-based method has been used in assessing pathological grade of ccRCC before surgery (20–22). Several traditional radiological characteristics such as tumor size and CT enhancement patterns have been shown to be correlated with the tumor grade (23). However, it has been challenging to predict the pathological grade of ccRCC with the existing limited information obtained from the traditional radiological characteristics (21, 24). By contrast, radiomic analysis involving the computerized extraction of data not discernable to the human eyes could generate highly detailed imaging features regarding tumor texture, shape, and image intensity (25, 26). Such methods have been successfully used in cancer research (25, 26), presenting the potential for identifying tumor phenotype, pathological grade (27), and biological behavior (28). Therefore, radiomic analysis is a potentially useful method that could be used not only to evaluate tumor heterogeneity but also to assess pathological grade for guiding personalized cancer treatment. However, there has been limited progress in developing non-invasive radiomic machine-learning models to accurately differentiate the low-grade from the high-grade ccRCC.
In this study, we analyzed the traditional radiological characteristics of ccRCC on pre-surgical CT images including the tumor size and CT density values. In addition, we performed predictive modeling combining the features obtained from both the traditional radiological assessment and the radiomic textural analysis. We aimed to develop a radiomic machine learning model to predict the ISUP grade of ccRCC tumors pre-surgically. We hypothesized that integration of radiomic features into the traditional radiological characteristics should improve the model performance in differentiating the low-grade from the high-grade ccRCC than using either the radiomic features or the traditional radiological characteristics alone in building the model.
Methods
Patients
Patients were consecutively identified and retrospectively included into this study through a careful assessment of our medical records from June 1, 2010 to June 1, 2017. All patients in this cohort underwent radical or partial nephrectomy with curative intent in our hospital with a final pathological diagnosis of ccRCC. Those patients with complete medical records including pathological confirmation and pre-surgical CT images were included, and their medical data including CT images was collected by research personnel (QX, FZ, ZL, and CW) for subsequent assessment. To avoid possible observer bias, the researchers were tasked specifically for different aspects of the study. For example, researcher 1 (ZL) with exposure to the original data completed the data anonymization procedure and did not participate in the subsequent analysis. The reminder three researchers including FZ only dealt with anonymized data and they were blinded to all radiological and clinicopathological information of the patients. All enrolled patients were divided into two cohorts, i.e., the training cohort and the validation cohort, at a ratio of 3:1 randomly. Details of the exclusion criteria and the patient recruiting process were shown in Figure 1.
This study was approved by Ethic Committee and Institutional Review Board in Xiangya Hospital of Central South University, P. R. China (IRB#2017121011). Written informed consents were waived due to the retrospective nature of this study.
Re-Analysis of Pathological Slides
For each patient, all pathological slides (including hematoxylin and eosin [HE] and immunohistochemical [IHC] staining) were re-analyzed by two pathologists specialized in urology (GG and HY, with 6 and 25 years of experience in uropathology, respectively). Each ccRCC grading was undertaken according to the criteria of the ISUP grading system (7) (Supplementary Table 1), and the ccRCC tumors were separated into two groups: the low-grade group (Group 1: Grade I and II) and the high-grade group (Group 2: Grade III and IV) (14). Consensus was reached by discussion if differences in opinions existed.
Computed Tomography Imaging
All patients had a routine abdominal CT scan obtained on one of our three CT scanners, i.e., a 16-MDCT (Brilliance 16, Philipps), a 64-MDCT (SOMATOM Definition, Siemens), or a 320-MDCT (Aquilion ONE, Toshiba Medical Systems) scanner. CT imaging included an acquisition of a pre-contrast phase and a contrast-enhanced phase with a power injector (Ulrich CT plus 150, Ulrich Medical, Ulm, Germany). Briefly, 90–100 ml of iodinated contrast material (Ultravist 370, Bayer Schering Pharma, Berlin, Germany) was administered intravenously at a rate of 3.0–3.5 ml per second. Contrast-enhanced images at the nephrographic phase (scan with fixed delay time of 65 s) were obtained for all patients. Since all patients had CT images for both the non-enhanced phase and the nephrographic/portal venous phase, the CT images from these two phases were included in this analysis. All CT images were retrieved from our Picture Archiving and Communication Systems (PACS, Carestrem, Canada), and were downloaded to an external workstation (Leonardo; Siemens Medical Solutions, Forchheim, Germany). All CT images were reconstructed into the voxel size of 1×1×1mm3 for subsequent analysis.
Traditional Radiological Analysis
Two radiologists specialized in abdominal imaging (Reader 1: FZ with 10 years of experience and Reader 2: GL with 25 years of experience) reviewed the CT images independently. They were blinded to all radiological and clinicopathological information of the patients. They recorded the traditional CT imaging findings including the tumor size measurements such as transverse dimension in millimeter (mm), anteroposterior dimension (mm), cranio-caudal dimension (mm), pre-enhanced CT density value (CTpre) in Hounsfield units (HU), enhanced CT density value (CTpost) in HU, and enhancement range in HU.
Radiomic Textural Feature Extraction
We used the pre-contrast non-enhanced CT images for radiomic textural feature extraction due to the following reasons. In this retrospective study with the images already acquired, we were concerned about the potential confounding variables affecting the contrast-enhanced images such as the inconsistent injection speed of contrast medium and varying hemodynamics of each patient after contrast administration. These variables may contribute to varying contrast enhancement of the tumors that did not reflect the true tumor heterogeneity. On the contrary, pre-contrast non-enhanced CT images were easy to acquire and were relatively stable from one patient to another, which may show the inherent tumor heterogeneity. In our study, the radiomic textural feature extraction was performed only on the non-enhanced images.
For each patient’s CT scan, a representative axial image with the largest cross-sectional measurement of the renal tumor was selected. In order to eliminate the potential variance of CT images obtained on the three different scanners, all original CT images underwent normalization using the gray-scale discretization method before textural feature extraction, with a final 256 bins (Analysis Kit software, version V3.0.0.R, GE Healthcare) (29, 30). Subsequently, we used the textural analysis software (MaZda Version 4.6, Institute of Electronics, Technical University of Lodz, Poland) (31) to perform the image analysis. A region of interest (ROI) to outline the tumor boundaries was drawn manually. The corresponding contrast-enhanced CT images were used as references in delineating the precise boundaries of the tumor on pre-enhanced images. All contouring was reviewed and validated by two senior abdominal radiologists (XY and GL) with 15 and 16 years of experience, respectively, in interpreting genitourinary CT images.
For each patient, a total of 340 textural features were extracted with the MaZda software based on corresponding ROI file, including a gray-level histogram, a gradient, a run-length matrix, a co-occurrence matrix, an autoregressive model and a wavelet transform analysis.
Reproducibility of Textural Feature Extraction
To evaluate the reproducibility of the radiomic textural feature extraction, the inter-observer (Reader 1 versus Reader 2) and intra-observer (Reader 1 twice) correlation coefficient (ICC) values were accessed. The reader consistency and reproducibility were determined according to the following criteria based on the ICC value: poor (<0.20), fair (0.21–0.40), moderate (0.40–0.60), good (0.61–0.80), and excellent (0.81–1.00). In general, an ICC exceeding 0.75 indicated good agreement.
The differences of the values for each feature between the two groups, and the differences between the textural features generated by Reader 1 (first time) and those by Reader 2, as well as between the features twice-generated by Reader 1, were analyzed using Mann-Whitney U test, independent samples t-test or Kruskal-Wallis H test, where appropriate.
Inter-observer and intra-observer reproducibility was initially analyzed with 50 randomly chosen patients’ CT images evaluated by two radiologists (Reader 1 and Reader 2). To assess the inter-observer reproducibility, Reader 1 and Reader 2 completed the workflow as described previously (32).
A 0.2–1 cm2 circular ROI was used to measure CT attenuation values of the tumors in HU. ROIs were placed on the solid parts of the tumor for three times, then the average CT attenuation value was recorded. Tumor size measurements including transverse dimension (mm), anteroposterior dimension (mm), and cranio-caudal dimension (mm), were all measured three times, and then the average values were recorded.
Statistical Analysis, Feature Selection, and Prediction Model Building
IBM SPSS version 22.0.0 (IBM Corporation, Armonk, NY, USA) was used to for statistical analyses. The differences about quantitative radiomic features and the qualitative features between the two groups, i.e., the low-grade group and the high-grade group, were tested using the Wilcoxon rank-sum test and the chi-square test respectively.
We used MATLAB 2017a (The Mathworks, Inc., Natick, MA, USA) to perform the data processing, data reduction for feature selection, and model building. The least absolute shrinkage and selection operator (LASSO) method was performed to select the features from the training cohort that possessed the most useful predictive value. Based on these selected features, machine learning methods including the Random Forest (RF) method and the support vector machine (SVM) method were used to generate the differentiation models according to the classification algorithm developed in our previous report (32).
The differentiation models were developed in the training cohort, and were validated in the validation cohort. The classification efficiencies of the models were calculated using the receiver operating characteristic (ROC) curves analysis. A P value < 0.05 was considered statistically significant. The work flow for radiomic feature extraction, feature selection and classification model building was presented in Figure 2. Details of the flow chart depicting the process of predictive modeling was shown in Supplementary Figure 1.
Figure 2 Flow chart for radiomic feature extraction, feature selection and predictive modeling. (I) Representative tumor segmentation computed tomography (CT) images. (II) Radiomic feature extraction. (III) Radiomic feature selection. (IV) Classification algorithm and predictive modeling.
Correlation Test Among Selected Features
A correlation matrix analysis was performed to evaluate associations between the radiomic textural features and the traditional radiological characteristics, including correlations of features within each of these two groups, i.e., the low-grade group and the high-grade group as well as between the two groups.
Results
Patient Characteristics Between the Training and Validation Cohorts
Table 1 summarized the clinicopathological and traditional radiological characteristics of this study cohort. There were no significant differences in the clinical characteristics such as gender and age between the training and the validation cohorts. There were no significant differences in the distribution of the low-grade and the high-grade ccRCC between the two cohorts. No significant differences were noted between the two cohorts regarding the tumor size measurements or CT density values.
Table 1 Comparison of patient characteristics between the training cohort and the validation cohort.
Patient Characteristics Between the Low-Grade and the High-Grade Groups
The tumors size measurements in the low-grade group (Group 1) were significantly smaller than those in Group 2. The CTpre values of the Group 1 tumors were significantly lower than those of the Group 2 tumors. In contrast, tumors in Group 1 showed a higher CTpost value than the tumors in Group 2, although the difference did not reach statistical significance (P=0.052). However, when considering the CTpre values, the degree of enhancement for Group 1 tumors was significantly higher than that of the Group 2 tumors (P=0.001). Details of the corresponding statistical results were presented in Table 2.
Reproducibility of Radiomic Feature Extraction and Traditional Radiological Assessment
Our results demonstrated satisfactory inter- and intra-observer reproducibility of the radiomic feature extraction and the traditional radiological assessment. The inter-observer ICCs of for radiomic features between Reader 1 (first time) and Reader 2 ranged from 0.761 to 0.893. The intra-observer ICC of Reader 1 with two extraction performances ranged from 0.781 to 0.909. As a result, the radiomic features extracted by Reader 1 were used in all subsequent analysis. The inter-reader analysis achieved good to excellent agreement in traditional radiological evaluation (ICC = 0.687–0.936). The ICC values for the traditional radiological features were not high, which could be explained by the following reasons. First, the traditional radiological features such as CT density may vary from one scan to another due to inherent tumor heterogeneity. In addition, the solid components of tumors might not be homogenously enhancing and therefore may result in variations in local delineation of ROIs for CT density measurements. Second, there may be subtle differences in CT density among the three different CT scanners. Third, the renal tumors were generally small and the solid enhancing parts of the renal tumor were even smaller in size. Any minor variations in local delineation of ROI between the readers may result in a large difference in ICC. However, caution was taken in delineation of ROIs and all measurements were performed three times with the average values being recorded.
Model Built With Radiomic Textural Features
A total of 340 features were extracted from pre-enhanced CT images for each patient. Of all the textural features, 19 features were finally selected to build a textural signature (Rad-score) after performing LASSO for feature selection. This process was included in the Supplementary Files (Supplementary Files: Equation 1). The same set of features was also used to build a RF classifier (score 2). A SVM classifier (SMV 1) was built based on the two models. The SVM 1 classifier achieved a classification performance with an AUC value of 0.8170 (95% CI: 0.7353–0.8987) and 0.8017 (95% CI: 0.6878–0.9157) in the training and validation cohorts, respectively (Figure 3).
Figure 3 Classification efficiencies of the three support vector machine (SVM) models. (A) Model built with radiomic textural features. (B) Model built with traditional radiological characteristics. (C) Model built with both radiomic textural features and traditional radiological features. (D) Receiver operating characteristic (ROC) curve analysis for the training cohort. (E) Receiver operating characteristic (ROC) curve analysis for the validation cohort. LASSO, least absolute shrinkage and selection operator; SVM, Support vector machine.
Model Built With Traditional Radiological Characteristics
The traditional radiological characteristics including the transverse dimension, cranio-caudal dimension, CTpre, and enhancement range were selected using the LASSO method (Supplementary Files: Equation 2) (score 4), and a RF model (Score 5) was built through the same modeling process as performed for the radiomic textural features. Based on scores 4 and 5, a new SVM classifier (SVM 2; Score 6) was created. The AUC of SVM2 was 0.9175 (95% CI: 0.8765–0.9585) and 0.8088 (95% CI: 0.7064–0.9113) in the training and validation cohorts, respectively.
Based on SVM1 (Score 3) and SVM2 (Score 6), the final classification model built by the SVM method (SVM3; Score 7) was constructed. This model provided an AUC of 0.9235 (95% CI: 0.8646–0.9824) with a sensitivity of 0.8780 (95% CI: 0.7561–0.9756) and a specificity of 0.9167 (95% CI: 0.8611–0.9722) in the training cohort, and a AUC of 0.9099 (95% CI: 0.8324–0.9873) with a sensitivity of 0.9412 (95% CI: 0.7647–1.0000) and a specificity of 0.8871 (95% CI: 0.7742–0.9839) in the validation cohort (Table 3 and Figure 4).
Table 3 Summary of three machine learning models for predicting pathological grade based on radiomic features, radiological characteristics, and the combination of both, respectively.
Figure 4 Classification efficiency for the training cohort and the validation cohort for the support vector machine (SVM) models.
Correlation Among All Features Used in Modeling
We obtained a correlation matrix to evaluate the correlations among all the features included in the final model. As shown in Figure 5, the correlations were relatively high among the four selected traditional radiological characteristics (0.036–0.883), and were varied among the 19 selected radiomic textural features (0.000–1.000). Interestingly, although a few radiomic textural features (including Mean, Variance, Perc_01, and Perc_99) presented high correlation indices (0.923–0.929), the remaining 15 radiomic textural features had relatively low correlation indices (0.004–0.375), which justified using the features from both the radiomic texture and the traditional radiological assessment to build a more reliable predictive model.
Figure 5 Correlation matrix test among all 19 radiomic textural features and four traditional radiological features (bold font) used in predictive modeling. S-S, indicating the craniocaudal dimension of the tumor; L-R, indicating the transverse dimension of the tumor.
Discussion
In this study, we utilized pre-surgical CT images to develop a radiomic machine learning model for differentiating the low-grade from the high-grade ccRCC. Our machine learning models incorporating optimal radiomic textural features achieved an AUC up to 0. 92 in the training cohort and 0.91 in the validation cohort. Our study provided promising data for potentially using noninvasive imaging-based method to predict pathological grade of ccRCC.
We included several traditional radiological characteristics in the modeling process, including tumor size measurements, T staging information, and CT density values (21). These commonly used radiological characteristics have been used to predict tumor progression and pathological grade with some success (17, 21, 23, 24). However, to the best of our knowledge, our results showed for the first time that the model built with these traditional radiological characteristics was not stable enough to make a reliable prediction of pathological grade for ccRCC. Nevertheless, these radiological characteristics were visible to the human eye and could be conveniently assessed by radiologists and trained imaging personnel. These radiological characteristics have been valuable in clinical practice and we therefore should include them in predictive modeling. On the other hand, our study also showed low correlation index between the traditional radiological characteristics and the radiomic textural features, indicating these two different kinds of features may contribute different rather than redundant tumor information. Our study showed the potential of combining the observed radiological characteristics and the radiomic computational approach to improve model performance.
The mechanism underlying our satisfying radiomic model performance is not clear. Imaging features of tumor heterogeneity may represent the phenotypes of tumor (26, 33, 34). Tumor heterogeneity may potentially be expressed phenotypically in images as intratumoral heterogeneity and could be comprehensively assessed by imaging analysis (17, 33). Therefore, it is reasonable to speculate that radiomic textural features in our study may represent tumor heterogeneity, thus being relevant in predicting pathological grade as indicated in prior literature (26, 33–37). For texture features, we found that the features prompting the model to classify renal tumors as high-grade ccRCC mainly belonged to histogram (such as: variance), run-length matrix (such as: run length nonuniformity, and gray level nonuniformity. with the higher values of these textural features, there were corresponding higher LASSO scores, indicating the higher risk of the tumor being classified as a higher-grade ccRCC. Regarding the traditional radiological features, the LASSO regressors included the tumor size measurements, CTpre density value and the enhanced degree of the tumor. It is understandable that the larger the tumor poses the greater risk of being high-grade because of greater tumor heterogeneity. In addition, higher-grade tumors may have worse pathological differentiation and tumor necrosis, which may lead to a lower degree of enhancement.
Our study was generally in line with prior reports of renal cancer assessed with machine learning radiomics (29, 38–41). Our model performance was comparable to the prior studies which had AUC values reaching 0.86~0.98 for predicting pathological grade of renal cancers. However, our method for radiomic analysis was different from others in that we extracted radiomic features from one representative pre-enhanced axial CT image containing the maximal cross-sectional tumor dimensions while others obtained radiomics from contrast-enhanced images on both CT and magnetic resonance imaging. There were several advantages in our novel approach. First, our method was feasible and could be readily adopted as non-enhanced images were routinely included in CT imaging of renal cancer. It is easier to acquire the non-enhanced images than the contrast-enhanced images, and the image quality for the non-enhanced images could be better controlled than the contrast-enhanced images. In addition, our method could be used in patients who could not have contrast-enhanced imaging due to either contrast allergy or abnormal renal function which is especially relevant in patients with renal cancers. Secondly, the contrast-enhanced images may vary depending on the distribution and amount of contrast agents in the tumor tissue, which could be affected by multiple variables such as the type of contrast agent used, the injection speed, the hemodynamic conditions of the patients, etc (32). Therefore, our approach of using non- enhanced images could alleviate the concerns stemming from the potential image variations due to contrast enhancement. Lastly, our single image strategy could be useful for our planned multicenter clinical trials because of its simplicity to use and its readiness to be standardized among all participating centers. Furthermore, the acceptable efficiency of our method using the single image at the maximal cross-sectional tumor level has been reported in our own publication (32).
It should be noted that our satisfying model performance in predicting the pathological grade of ccRCC could be partly related to the classification algorithm used in the present study. This algorithm was developed by our team, and has been successfully applied in our current and previous studies (32). The basic logic of this classification algorithm was to treat LASSO and RF as weak regressors in the whole algorithm, which respectively reflected the classification attributes of the research object. We then used the SVM algorithm to combine these two to finally achieve the purpose of enhancing the classification effect. In addition, the final regressed scores from this algorithm could be binarized for further prediction. Nevertheless, our modeling algorithm was far from being comprehensive. More work is needed to further improve the classification performance by continually optimizing and improving the structure of the data mining algorithms.
We recognized that there was an apparent contradiction in our feature selection for the final model building. The contradiction was that we used the non-enhanced CT images for radiomic feature extraction but included the degree of CT enhancement as part of the traditional radiological characteristics in the final model. We believe we could resolve this apparent contradiction with the following explanation. First, the degree of CT enhancement was one of the most important radiological characteristics assessed by radiologists. Due to limitations of human visual inspection, the traditional radiological characteristics are usually limited in number including only the tumor size measurement and CT densities on both pre- and post-contrast images as in our study. Therefore, it was important to include it in the model building in our attempt to keep the few commonly reported characteristics which reflects the current clinical radiological practice. Second, while the traditional radiological assessment could only provide descriptive information on tumor characteristics, radiomics could extract a multitude of computational quantitative imaging features about tumor heterogeneity not visible to human eyes. Therefore, these two approaches in our study with one using non-enhanced images and another using enhanced images were complimentary rather than contradictory to each other and the combination of both strengthened the model performance as shown in our study.
This study had several limitations. First, this was a retrospective study conducted at a single institution, and case selection bias seemed inevitably. In addition, although there were 264 patients with ccRCC included in our study, our sample size was still modest for a machine learning study given heterogeneous disease distribution. Second, our validation cohort used to test the model efficiency was from the same institution as the training cohort, therefore making it challenging to generalize our results to other institutions and other disease settings. Future large-scale independent prospective multicenter studies are needed to validate our results. Third, our study was focused on ccRCC which constituted most of the renal cancer. However, it was not sufficient for a complete survey of renal cancer since other renal cancer subtypes could have similar imaging features and therefore should be evaluated in future studies. Moreover, our study was limited in that an accurate imaging-pathological correlation for each patient could not be performed in this retrospective study, which could have been helpful to assess the underlying pathological basis of our model performance. Lastly, the CT images in this study were obtained in three different CT scanners, which may be variable in terms of imaging quality due to inherent differences among the scanners. This may in turn potentially affect the textural features and model performance.
In summary, we developed a radiomic machine learning model with the pre-surgical CT images, achieving a satisfying performance in differentiating the low-grade from the high-grade ccRCC. Our approach integrating the traditional radiological characteristics and the radiomic textural features improved the performance of our prediction models. Our study presented a potentially useful non-invasive imaging-focused method to predict the pathological grade of renal cancers prior to surgery, which should assist in clinical decision making for selecting cancer treatment strategies and for informing prognosis.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Ethics Statement
This single-center retrospective study was approved by Ethic Committee and Institutional Review Board in Xiangya Hospital of Central South University, P. R. China (IRB#2017121011), and the written informed consents were waived.
Author Contributions
XY and LL conceived and designed the study. XY, FZ, LL, QXi, CL, and GL analyzed the data. XG, QXi, HY, ZL, CQ, CW, XY, QXu, CQ, ML, and GG contributed analysis tools. XY, XG, LL, CZ, and BC wrote the paper. All authors contributed to the article and approved the submitted version.
Funding
This study was supported in part by a project funded by the China Postdoctoral Science Foundation (2018M632997), the Postdoctoral Science Foundation of Central South University (No.185705), the Natural Science Foundation of Hunan Province (2018JJ2641), the Shenghua Yuying talents program at Central South University, P.R. China and the National Natural Science Foundation of China (81902727).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Taihao Jin, Ph.D., at the City of Hope National Medical Center, USA, for helpful suggestions regarding radiomics and machine learning methods. We thank Carol Wuenschell, Ph.D., for editing this manuscript. We also wish to thank Drs. Desheng Xiao and Mr Guanghui Gong, from the Department of Pathology, Xiangya Hospital, Central South University, P.R. China, for the helpful assistance in the pathological analysis. Preliminary data with an abstract was presented as an oral report at the European Congress of Radiology conference in Vienna, Austria, February 27–March 3, 2019.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2020.570396/full#supplementary-material
Abbreviations
ccRCC, clear cell renal cell carcinoma; HE, hematoxylin and eosin; IHC, immunohistochemistry; ISUP, International Society of Urological Pathology; LASSO, least absolute shrinkage and selection operator; pRCC, papillary renal cell carcinoma; RCC, renal cell carcinoma; RF, random forest; SVM, support vector machine; WHO, World Health Organization.
References
1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin (2018) 68(1):7–30. doi: 10.3322/caac.21442
2. Wan XM, Peng LB, Ma JA, Li YJ. Economic evaluation of nivolumab as a second-line treatment for advanced renal cell carcinoma from US and Chinese perspectives. Cancer (2017) 123(14):2634–41. doi: 10.1002/cncr.30666
3. Znaor A, Lortet-Tieulent J, Laversanne M, Jemal A, Bray F. International variations and trends in renal cell carcinoma incidence and mortality. Eur Urol (2015) 67(3):519–30. doi: 10.1016/j.eururo.2014.10.002
4. Wang Z, Qi L, Yuan P, Zu X, Chen W, Cao Z, et al. Application of Three-Dimensional Visualization Technology in Laparoscopic Partial Nephrectomy of Renal Tumor: A Comparative Study. J Laparoendosc Adv Surg Tech A (2017) 27(5):516–23. doi: 10.1089/lap.2016.0645
5. Liu L, Qi L, Li Y, Chen M, Li C, Cheng X, et al. Retroperitoneoscopic Partial Nephrectomy for Moderately Complex Ventral Hilar Tumors: Surgical Technique and Trifecta Outcomes from a Single Institution in China. J Laparoendosc Adv Surg Tech A (2017) 27(8):812–7. doi: 10.1089/lap.2016.0194
6. Ljungberg B, Bensalah K, Canfield S, Dabestani S, Hofmann F, Hora M, et al. EAU guidelines on renal cell carcinoma: 2014 update. Eur Urol (2015) 67(5):913–24. doi: 10.1016/j.eururo.2015.01.005
7. Delahunt B, Cheville JC, Martignoni G, Humphrey PA, Magi-Galluzzi C, McKenney J, et al. The International Society of Urological Pathology (ISUP) grading system for renal cell carcinoma and other prognostic parameters. Am J Surg Pathol (2013) 37(10):1490–504. doi: 10.1097/PAS.0b013e318299f0fb
8. Kim H, Inomoto C, Uchida T, Furuya H, Komiyama T, Kajiwara H, et al. Verification of the International Society of Urological Pathology recommendations in Japanese patients with clear cell renal cell carcinoma. Int J Oncol (2018) 52(4):1139–48. doi: 10.3892/ijo.2018.4294
9. Dagher J, Delahunt B, Rioux-Leclercq N, Egevad L, Srigley JR, Coughlin G, et al. Clear cell renal cell carcinoma: validation of World Health Organization/International Society of Urological Pathology grading. Histopathology (2017) 71(6):918–25. doi: 10.1111/his.13311
10. Moch H, Cubilla AL, Humphrey PA, Reuter VE, Ulbright TM. The 2016 WHO Classification of Tumours of the Urinary System and Male Genital Organs-Part A: Renal, Penile, and Testicular Tumours. Eur Urol (2016) 70(1):93–105. doi: 10.1016/j.eururo.2016.02.029
11. Ficarra V, Martignoni G, Galfano A, Novara G, Gobbo S, Brunelli M, et al. Prognostic role of the histologic subtypes of renal cell carcinoma after slide revision. Eur Urol (2006) 50(4):786–93; discussion 793-4. doi: 10.1016/j.eururo.2006.04.009
12. Cheville JC, Lohse CM, Zincke H, Weaver AL, Blute ML. Comparisons of outcome and prognostic features among histologic subtypes of renal cell carcinoma. Am J Surg Pathol (2003) 27(5):612–24. doi: 10.1097/00000478-200305000-00005
13. Amin MB, Amin MB, Tamboli P, Javidan J, Stricker H, de-Peralta Venturina M, et al. Prognostic impact of histologic subtyping of adult renal epithelial neoplasms: an experience of 405 cases. Am J Surg Pathol (2002) 26(3):281–91. doi: 10.1097/00000478-200203000-00001
14. Kuthi L, Jenei A, Hajdu A, Németh I, Varga Z, Bajory Z, et al. Prognostic Factors for Renal Cell Carcinoma Subtypes Diagnosed According to the 2016 WHO Renal Tumor Classification: a Study Involving 928 Patients. Pathol Oncol Res (2017) 23(3):689–98. doi: 10.1007/s12253-016-0179-x
15. Mouracade P, Kara O, Maurice MJ, Dagenais J, Malkoc E, Nelson RJ, et al. Patterns and Predictors of Recurrence after Partial Nephrectomy for Kidney Tumors. J Urol (2017) 197(6):1403–9. doi: 10.1016/j.juro.2016.12.046
16. Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, et al. Cancer statistics in China, 2015. CA Cancer J Clin (2016) 66(2):115–32. doi: 10.3322/caac.21338
17. Yan Y, Liu L, Zhou J, Li L, Yuan Li Y, Chen M, et al. Clinicopathologic characteristics and prognostic factors of sarcomatoid renal cell carcinoma. J Cancer Res Clin Oncol (2015) 141(2):345–52. doi: 10.1007/s00432-014-1740-1
18. Lane BR, Samplaski MK, Herts BR, Zhou M, Novick AC, Campbell SC. Renal mass biopsy–a renaissance. J Urol (2008) 179(1):20–7. doi: 10.1016/j.juro.2007.08.124
19. Bjurlin MA, Elkin EB, Atoria CL, Russo P, Taneja SS, Huang WC. Influence of renal biopsy results on the management of small kidney cancers in older patients: Results from a population-based cohort. Urol Oncol (2017) 35(10):604.e1–9. doi: 10.1016/j.urolonc.2017.05.025
20. Chen SH, Wu YP, Li XD, Lin T, Guo QY, Chen YH, et al. R.E.N.A.L. Nephrometry Score: A Preoperative Risk Factor Predicting the Fuhrman Grade of Clear-Cell Renal Carcinoma. J Cancer (2017) 8(18):3725–32. doi: 10.7150/jca.21189
21. Oh S, Sung DJ, Yang KS, Sim KC, Han NY, Park BJ, et al. Correlation of CT imaging features and tumor size with Fuhrman grade of clear cell renal cell carcinoma. Acta Radiol (2017) 58:376–84. doi: 10.1177/0284185116649795
22. Cornelis F, Tricaud E, Lasserre AS, Petitpierre F, Bernhard JC, Le Bras Y, et al. Multiparametric magnetic resonance imaging for the differentiation of low and high grade clear cell renal carcinoma. Eur Radiol (2015) 25(1):24–31. doi: 10.1007/s00330-014-3380-x
23. Zhu YH, Wang X, Zhang J, Chen YH, Kong W, Huang YR. Low enhancement on multiphase contrast-enhanced CT images: an independent predictor of the presence of high tumor grade of clear cell renal cell carcinoma. AJR Am J Roentgenol (2014) 203:W295–300. doi: 10.2214/AJR.13.12297
24. Vargas HA, Delaney HG, Delappe EM, Wang Y, Zheng J, Moskowitz CS, et al. Multiphasic contrast-enhanced MRI: single-slice versus volumetric quantification of tumor enhancement for the assessment of renal clear-cell carcinoma fuhrman grade. J Magn Reson Imaging (2013) 37(5):1160–7. doi: 10.1002/jmri.23899
25. Feng Z, Rong P, Cao P, Zhou Q, Zhu W, Yan Z, et al. Machine learning-based quantitative texture analysis of CT images of small renal masses: Differentiation of angiomyolipoma without visible fat from renal cell carcinoma. Eur Radiol (2018) 28(4):1625–33. doi: 10.1007/s00330-017-5118-z
26. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol (2017) 14(12):749–62. doi: 10.1038/nrclinonc.2017.141
27. Ding J, Xing Z, Jiang Z, Chen J, Pan L, Qiu J, et al. CT-based radiomic model predicts high grade of clear cell renal cell carcinoma. Eur J Radiol (2018) 103:51–6. doi: 10.1016/j.ejrad.2018.04.013
28. Cho H, Park H. Classification of low-grade and high-grade glioma using multi-modal image radiomics features. Conf Proc IEEE Eng Med Biol Soc (2017) 2017:3081–4. doi: 10.1109/EMBC.2017.8037508
29. Shu J, Wen D, Xi Y, Xia Y, Cai Z, Xu W, et al. Clear cell renal cell carcinoma: Machine learning-based computed tomography radiomics analysis for the prediction of WHO/ISUP grade. Eur J Radiol (2019) 121:108738. doi: 10.1016/j.ejrad.2019.108738
30. Ren S, Zhao R, Cui W, Qiu W, Guo K, Cao Y, et al. Computed Tomography-Based Radiomics Signature for the Preoperative Differentiation of Pancreatic Adenosquamous Carcinoma From Pancreatic Ductal Adenocarcinoma. Front Oncol (2020) 10:1618. doi: 10.3389/fonc.2020.01618
31. Szczypiński PM, Strzelecki M, Materka A, Klepaczko A. MaZda–a software package for image texture analysis. Comput Methods Programs Biomed (2009) 94:66–76. doi: 10.1016/j.cmpb.2008.08.005
32. Yi X, Pei Q, Zhang Y, Zhu H, Wang Z, Chen C, et al. MRI-Based Radiomics Predicts Tumor Response to Neoadjuvant Chemoradiotherapy in Locally Advanced Rectal Cancer. Front Oncol (2019) 9:552. doi: 10.3389/fonc.2019.00552
33. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer (2012) 48(4):441–6. doi: 10.1016/j.ejca.2011.11.036
34. Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images Are More than Pictures, They Are Data. Radiology (2016) 278(2):563–77. doi: 10.1148/radiol.2015151169
35. Lubner MG. Radiomics and Artificial Intelligence for Renal Mass Characterization. Radiol Clin North Am (2020) 58:995–1008. doi: 10.1016/j.rcl.2020.06.001
36. Shu J, Tang Y, Cui J, Yang R, Meng X, Cai Z, et al. Clear cell renal cell carcinoma: CT-based radiomics features for the prediction of Fuhrman grade. Eur J Radiol (2018) 109:8–12. doi: 10.1016/j.ejrad.2018.10.005
37. Chaddad A, Kucharczyk MJ, Daniel P, Sabri S, Jean-Claude BJ, Niazi T, et al. Radiomics in Glioblastoma: Current Status and Challenges Facing Clinical Implementation. Front Oncol (2019) 9:374. doi: 10.3389/fonc.2019.00374
38. Cui E, Li Z, Ma C, Li Q, Lei Y, Lan Y, et al. Predicting the ISUP grade of clear cell renal cell carcinoma with multiparametric MR and multiphase CT radiomics. Eur Radiol (2020) 30:2912–21. doi: 10.1007/s00330-019-06601-1
39. Zhao Y, Chang M, Wang R, Xi IL, Chang K, Huang RY, et al. Deep Learning Based on MRI for Differentiation of Low- and High-Grade in Low-Stage Renal Cell Carcinoma. J Magn Reson Imaging (2020). doi: 10.1002/jmri.27153
40. Sun X, Liu L, Xu K, Li W, Huo Z, Liu H, et al. Prediction of ISUP grading of clear cell renal cell carcinoma using support vector machine model based on CT images. Med (Baltimore) (2019) 98:e15022. doi: 10.1097/MD.0000000000015022
Keywords: radiomics, clear cell renal cell carcinoma, computed tomography (CT), machine learning, predictive modeling
Citation: Yi X, Xiao Q, Zeng F, Yin H, Li Z, Qian C, Wang C, Lei G, Xu Q, Li C, Li M, Gong G, Zee C, Guan X, Liu L and Chen BT (2021) Computed Tomography Radiomics for Predicting Pathological Grade of Renal Cell Carcinoma. Front. Oncol. 10:570396. doi: 10.3389/fonc.2020.570396
Received: 07 June 2020; Accepted: 08 December 2020;
Published: 27 January 2021.
Edited by:
Bo Gao, Affiliated Hospital of Guizhou Medical University, ChinaReviewed by:
Rongpin Wang, Guizhou Provincial People’s Hospital, ChinaHaiyi Wang, Chinese People’s Liberation Army General Hospital, China
Copyright © 2021 Yi, Xiao, Zeng, Yin, Li, Qian, Wang, Lei, Xu, Li, Li, Gong, Zee, Guan, Liu and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Longfei Liu, ZG9jdG9ybGxmQDE2My5jb20=; Xiao Guan, Nzk4NDc3OTUyQHFxLmNvbQ==
†These authors have contributed equally to this work