- 1Department of MRI, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, China
- 2Department of Radiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
- 3State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
- 4MR Scientific Marketing, Siemens Healthineers, Shanghai, China
Purpose: The aim of this study was to evaluate the value of different multiparametric MRI-based radiomics models in differentiating stage IA endometrial cancer (EC) from benign endometrial lesions.
Methods: The data of patients with endometrial lesions from two centers were collected. The radiomics features were extracted from T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), apparent diffusion coefficient (ADC) map, and late contrast-enhanced T1-weighted imaging (LCE-T1WI). After data dimension reduction and feature selection, nine machine learning algorithms were conducted to determine which was the optimal radiomics model for differential diagnosis. The univariate analyses and logistic regression (LR) were performed to reduce valueless clinical parameters and to develop the clinical model. A nomogram using the radscores combined with clinical parameters was developed. Two integrated models were obtained respectively by the ensemble strategy and stacking algorithm based on the clinical model and optimal radiomics model. The area under the curve (AUC), clinical decisive curve (CDC), net reclassification index (NRI), and integrated discrimination index (IDI) were used to evaluate the performance and clinical benefits of the models.
Results: A total of 371 patients were incorporated. The LR model was the optimal radiomics model with the highest average AUC (0.854) and accuracy (0.802) in the internal and external validation groups (AUC = 0.910 and 0.798, respectively), and outperformed the clinical model (AUC = 0.739 and 0.592, respectively) or the radiologist (AUC = 0.768 and 0.628, respectively). The nomogram (AUC = 0.917 and 0.802, respectively) achieved better discrimination performance than the optimal radiomics model in two validation groups. The stacking model (AUC = 0.915) and ensemble model (AUC = 0.918) had a similar performance compared with the nomogram in the internal validation group, whereas the AUCs of the stacking model (AUC = 0.792) and ensemble model (AUC = 0.794) were lower than those of the nomogram and radiomics model in the external validation group. According to the CDC, NRI, and IDI, the optimal radiomics model, nomogram, stacking model, and ensemble model achieved good net benefits.
Conclusions: Multiparametric MRI-based radiomics models can non-invasively differentiate stage IA EC from benign endometrial lesions, and LR is the best machine learning algorithm. The nomogram presents excellent and stable diagnostic efficiency.
Introduction
Endometrial cancer (EC) and endometrial hyperplasia and polyps are the most common malignant and benign uterine endometrial cavity lesions, respectively (1). In order to avoid insufficient curing or excessive treatment and to protect the patient’s fertility, it is necessary to accurately identify benign and malignant endometrial lesions before operation. Although endometrial samplings such as dilatation and curettage, endometrial cytology, and biopsy can preoperatively identify some endometrial lesions (2), they do not always provide a definitive diagnosis. Because these procedures are often performed in a blind manner, they may be subject to sampling error and cannot properly diagnose focal endometrial lesions (3). Furthermore, they are difficult to perform in patients with pelvic organ prolapse and vaginal or cervical stenosis (4). In addition, endometrial sampling procedures are invasive with some complications including pain, discomfort, and bleeding. Hence, it is important to find a noninvasive method to distinguish benign and malignant uterine lesions.
Magnetic resonance imaging (MRI) with excellent soft tissue contrast resolution plays an important role in the preoperative diagnosis and staging of EC in situations where it is difficult to obtain histologic samples, and is more sensitive than transvaginal sonography for diagnosing endometrial lesions (1, 5). Multiparametric MRI, including T2-weighted imaging (T2WI), contrast-enhanced MRI (CE-MRI), diffusion-weighted imaging (DWI), and apparent diffusion coefficient (ADC) are increasingly being applied for diagnosing various endometrial lesions (1). However, conventional imaging evaluation of the uterine cavity lesions may present many challenges to the radiologist. The endometrium structure is susceptible to age, menopausal status, menstrual cycle, and hormonal replacement therapy (6). There are a variety of appearances and overlapping imaging features of early-stage EC and benign mimickers (7). Moreover, the experience of the radiologist usually contributes to high interobserver variation. All of these factors lead to inaccurate diagnoses.
Radiomics is an emerging field of application of artificial intelligence in medical imaging by extracting high-throughput quantitative image features and is a problem-solving tool when there is a dilemma in conventional imaging diagnosis (8). Recently, MRI-based radiomics has been gradually applied in the evaluation of EC including risk stratification (9–11), lymph node metastasis (12–14), myometrial invasion (15–17), prognosis and recurrence (18–20), and histological characteristics (21–23). Chen et al. (24) had confirmed that MRI-based radiomics was a valuable tool for distinguishing EC from benign mimics. However, they included stage IB to IV ECs that were easily distinguishable from benign uterine lesions, and only one machine learning algorithm model was studied. Therefore, this study aims to compare the performance of various multiparametric MRI-based machine learning radiomics models in differentiating stage IA EC from benign endometrial lesions, and further assess the potential utility of diverse integrated models utilizing clinical parameters and radiomics features.
Materials and methods
Study population
Ethical approval was obtained for this retrospective study, and written informed consent was waived. Between January 2017 and June 2021, consecutive patients with endometrial lesions from center A and center B were collected. Inclusion criteria were as follows: (1) patients with stage IA EC, endometrial hyperplasia, or endometrial polyps confirmed by histopathology; (2) underwent MR examination including T2WI, DWI, and dynamic contrast-enhanced (DCE-MRI) within 2 weeks prior to treatment; and (3) complete clinical data. Exclusion criteria were as follows: (1) MRI quality did not meet the requirement of analysis; (2) received treatment before the MR examination; (3) the maximum diameter of the lesion was less than 1 cm; and (4) patients with other pelvic diseases. Patients from center A were randomly allocated into the training group and the internal validation group at a ratio of 3:1. All patients from center B served as the external validation group. Clinical and histological characteristics of all patients, including histological subtypes, age, menopause, clinical manifestation, metabolic syndrome, body mass index (BMI), actual treatment options, and CA125 and CA199 level, and immunohistochemical findings such as estrogen receptor (ER), progesterone receptor (PR), P53, and Ki-67 were collected.
Imaging acquisition and lesion segmentation
All MR examinations were performed using 1.5/3.0-T scanners (GE Signa HDXt, Siemens Prisma, and Siemens Aera) with eight-channel phased-array abdominal coils. Each patient underwent preoperative MR scanning using the standard protocol. In the study, uterus-axial T2WI, DWI (b-value = 1,000 s/mm2), ADC map, and late contrast-enhanced T1-weighted imaging (LCE-T1WI) were acquired for lesion segmentation. Parameter details are shown in Table 1. Some parameters would be adjusted according to the individual differences of patients. The ADC map was automatically reconstructed and generated after scanning DWI by the Siemens MRI scanners, or manually reconstructed on the Functool Software (ADW 4.7 Workstation) by the GE MRI scanner. CE-T1WI was performed immediately after administering a standard dose (0.1 mmol/kg) of gadopentetate dimeglumine (Magnevist; Bayer Healthcare Pharmaceuticals, Germany) at approximately 2 ml/s via the elbow vein. Uterus-axial LCE-T1WI was obtained at 240 s into the examination after the contrast agent injection.
The original MR images of uterus-axial T2WI, DWI, ADC map, and LCE-T1WI in Digital Imaging and Communications in Medicine (DICOM) format were loaded into 3D Slicer 4.11.0 software (https://www.slicer.org/). Region of interest (ROI) of the lesion was manually delineated layer by layer to form three-dimensional (3D) volume of interest (VOI) by two radiologists (reader 1 and reader 2, with 3 years and 7 years of experience in pelvic MRI, respectively), with unknown clinical information and pathological diagnosis. Reader 1 delineated the boundary of all lesions on uterus-axial T2WI, DWI, and LCE-T1WI, respectively. After 2 months, reader 1 and reader 2 randomly selected the same 50 patients to outline. Care was taken to avoid including endometrial cavity fluid and hematocele and nearby normal myometrium, but necrotic, bleeding, and cystic areas inside the tumor can be included.
Feature extraction and selection
The open-source Python package Pyradiomics (https://pypi.org/project/pyradiomics/) was used to extract radiomics features from the VOI of each patient at the 3D Slicer platform. To obtain isotropic voxels, the VOIs were resampled to 3 × 3 × 3 mm, then cubic spline interpolation was performed. In order to reduce the imaging differences among different MRI scanners, image normalization was performed so that all gray-level values in the images were distributed in the range of 0–600. A fixed bin width of 1 was selected to ensure better comparability of MRI gray values as suggested in a previous study (12). Before feature extraction, several built-in filters such as gradient, exponent, logarithm, square, square root, wavelet, and Laplacian of Gaussian (LOG) filters were applied on the normalized MR images, and derived images were achieved. The extracted features were divided into the following categories (25): first-order features, two-dimensional features, gray-level co-occurrence matrix (GLCM), gray-level dependence matrix (GLDM), gray-level size-zone matrix (GLSZM), gray-level run-length matrix (GLRLM), and neighboring gray tone difference matrix (NGTDM). A total of 1,781 radiomics features were extracted from each MRI modality, resulting in 7,124 radiomics features for each patient in total. All the above features were standardized by the Z score.
The datasets of the patients with stage IA EC and benign endometrial lesions were balanced by using the synthetic minority oversampling technique in the training group. To ensure repeatability and avoid the subjective difference in lesion segmentation, the intraclass correlation coefficient (ICC) of each feature was calculated. Only features with ICC values ≥ 0.75 between observers and within observers were retained. Pearson correlation coefficients were calculated for identifying redundant features. If the correlation coefficient of two features was ≥ 0.9, the feature with the largest mean absolute correlation was deleted. Whereafter, least absolute shrinkage and selection operator (LASSO) was used to select the most representative features and 10-fold cross-validation was performed (26).
Model building
On the construction of the clinical model, firstly, univariate analysis was conducted to compare the clinical characteristics of benign and malignant endometrial lesions in the training group, and find out the clinical parameters with statistically significant difference. Secondly, the individual predictors of stage IA EC were chosen according to the univariate logistic regression (LR) analysis. Finally, the clinical model was constructed based on the multivariate LR, and the efficient clinical predictive parameters were selected.
Different radiomics models were developed and tested respectively to predict stage IA endometrial cancer based on the following nine machine learning classification algorithms: LR, support vector machine (SVM), stochastic gradient descent (SGD), K nearest neighbor (KNN), decision tree (DT), random forest (RF), extremely randomized trees (ET), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM). A fivefold cross-validation strategy was applied to tune and optimize the model parameter, and assess the performance of the models. Referring to a recently published study (27), the machine learning algorithm with the highest average area under the receiver operating characteristic (ROC) curve (AUC) of the internal and external validation group was used to construct the optimal radiomics model. Then, the radiomics score (radscore) was calculated. A nomogram based on the multivariate LR analysis was developed by using the combination of clinical predictive parameters and radscore in the training group.
The stacking model is an integrated learning technology, which can combine the predictions of learned classifiers in order to create prediction of new instances to improve overall performance (28). In the study, a two-tier stacking model was conducted; the first tier was the above clinical model and the optimal radiomics model, and the second tier used the output of the first tier as the input of the multivariate LR. The ensemble algorithm is developed using superlearner (29), and belongs to an integrated strategy. According to the accuracy weight, the predictions obtained from the foregoing clinical model and radiomics model were calculated by the weighted average method and the new output as the final results.
Through the nomogram, stacking model, or ensemble model, the clinical and radiomics features were combined, so as to achieve model fusion. All model building was implemented in Python (https://www.python.org/getit/), and the detailed process of model building is shown in Figure 1. The AUC, accuracy, sensitivity, specificity, and calibration curve were used as metrics to assess the performance and goodness of fit of the models.
Clinical application of the models
One radiologist (reader 3, with 30 years of experience in pelvic MRI) who was blind to the clinicopathological information of the patient independently reviewed the MR images to diagnose stage IA EC and benign uterine disease in the training and validation groups. The AUC, accuracy, sensitivity, and specificity of the radiologist were calculated. Clinical decisive curve (CDC), net reclassification index (NRI), and integrated discrimination index (IDI) were performed to estimate the clinical usefulness and net benefit of different models and the radiologist by comparing the actual treatment options of patients.
Correlations between radiomics features and immunohistochemical findings
In order to explore the correlation between radiomics information and histological characteristics, Spearman correlation coefficients were used to evaluate the correlations between the selected radiomics features and immunohistochemical findings.
Statistical analysis
All statistical tests were performed using SPSS 26.0 (IBM, New York, USA), R software 4.1.2 (https://www.r-project.org/), and Python 3.9.7 (https://www.python.org/). Continuous variables and categorical variables were respectively expressed as mean value ± standard deviation and counts. The Kolmogorov–Smirnov test was used to check the normality of the continuous data distribution. Continuous variables were analyzed using one-way ANOVA, Mann–Whitney U test, or Kruskal–Wallis test. Categorical variables were compared using the Chi-square test or Fisher’s exact test. Univariate and multivariate LR analyses were used to filtrate the clinical predictors and model building. A p-value less than 0.05 was considered statistically significant. Pearson correlation analyses were performed to assess correlations between continuous variables, and Spearman correlation analyses were used to evaluate the correlations between continuous variables and ranked data. If p < 0.05, there were correlations between the variables.
Results
Clinical parameters
A total of 371 patients were divided into the training group (245 patients from center A), the internal validation group (82 patients from center A), and the external validation group (44 patients from centers B). The clinicopathological characteristics of incorporated patients are listed in Table 2. The 371 patients included 234 patients with stage IA EC and 137 patients with benign endometrial lesions. Three hundred and twenty patients were treated following the protocol for EC and 51 patients for benign endometrial disease. A total of 112 (30.2%) patients received inappropriate treatment, including 13 (3.5%) patients with stage IA EC who were undertreated and 99 (26.7%) patients with benign endometrial lesions who were overtreated. Univariate analysis showed that the mean age of patients with stage IA EC (51.65 ± 7.94) was significantly older than that of patients with benign uterine lesions (48.12 ± 8.35) in the training group (p = 0.001). Compared with benign endometrial lesions, there were more patients with irregular vaginal bleeding and menopause in stage IA EC (p < 0.05). No significant differences in metabolic syndrome, BMI, CA125, and CA199 between patients with stage IA EC and benign endometrial lesions were shown (p > 0.05). According to the univariate and multivariate LR analysis, age and irregular vaginal bleeding were the valid predictive parameters.
Feature selection and optimal machine learning algorithm
Among all the extracted features, 3,356 features were excluded because the ICC values between observers or within observers were <0.75. There were 847 features retained after the Pearson correlation analysis. Finally, the LASSO classifier selected 18 features as shown in Figure 2.
Figure 2 Feature selection using the least absolute shrinkage and selection operator (LASSO) regression model. The cross-validation plot (A) and the coefficient profile plot (B).
The AUC and accuracy of radiomics models constructed by nine machine learning algorithms are shown in Table 3, and the broken line graphs of accuracy for different algorithms in the training group, internal validation group, and external validation group are presented in Figures 3A–C. The LR algorithm showed the highest average AUC (0.854) in the validation groups, and also had the highest average accuracy (0.802). Therefore, LR was considered to be the optimal machine learning algorithm for radiomics model building. The radscore was calculated based on the coefficients and intercepts obtained from the LR model. The selected features and weights are shown in Figure 3D. The top four features that contribute most to the radiomics model were CE_original_shape_flatness, T2_exponential_GLSZM_zone percentage, DWI_LOG-sigma-6-0-mm-3D_first order_root mean squared, and ADC_LOG-sigma-2-0-mm-3D_first order_median, respectively.
Figure 3 Different model building. Broken line graphs of accuracy for different machine learning algorithms in the training group (A), the internal validation group (B), and the external validation group (C). Bar chart of feature weight for the logistic regression model (D). Nomogram of the training group (E).
Performance and clinical application of different models
A nomogram was constructed by using the clinical predictive parameters (age and irregular vaginal bleeding) and the radscore (Figure 3E). The diagnostic performance of each model and radiologist is displayed in Table 4. Figure 4 shows ROC curves and calibration curves of different models. In the training group, the AUCs of the clinical model, radiomics model, nomogram, stacking model, ensemble model, and radiologist were 0.760, 0.921, 0.922, 0.925, 0.916, and 0.769, respectively. In the internal validation group, they were 0.739, 0.910, 0.917, 0.915, 0.918, and 0.768, respectively. In the external validation group, they were 0.592, 0.798, 0.802, 0.792, 0.794, and 0.628, respectively. According to the calibration curves, the Brier scores of the clinical model, radiomics model, nomogram, stacking model, and ensemble model were 0.200, 0.114, 0.114, 0.113, and 0.129, respectively in the training group. They were 0.206, 0.123, 0.118, 0.119, and 0.129, respectively, in the internal validation group, and they were 0.274, 0.188, 0.184, 0.182, and 0.184, respectively, in the external validation group. The radiomics model, nomogram, stacking model, and ensemble model demonstrated good goodness of fit due to their Brier scores being <0.25.
Figure 4 Receiver operator characteristic (ROC) curves (A–C) and calibration curves (D–F) of different models in the training group (A, D), the internal validation group (B, E), and the external validation group (C, F).
The CDCs of the different models and the radiologist are presented in Figure 5, and the NRI and IDI are shown in Table 3. The results showed that the radiomics model, nomogram, stacking model, and ensemble model for predicting stage IA EC added benefit and performed better than the actual treatment options in the training and validation groups (p < 0.05). In the training group, the NRI and IDI of the clinical model, radiomics model, nomogram, stacking model, ensemble model, and radiologist were 0.130 and 0.023, 0.414 and 0.393, 0.429 and 0.396, 0.451 and 0.498, 0.410 and 0.397, and 0.319 and 0.242, respectively. In the internal validation group, they were −0.163 and −0.034, 0.307 and 0.341, 0.395 and 0.362, 0.395 and 0.356, 0.395 and 0.366, and 0.216 and 0.137, respectively. In the external validation group, they were −0.068 and −0.004, 0.423 and 0.272, 0.368 and 0.234, 0.423 and 0.255, 0.368 and 0.241, and 0.188 and 0.099, respectively.
Figure 5 Clinical decision curves (CDCs) of different models and the radiologist in the training group (A), the internal validation group (B), and the external validation group (C).
Correlations between radiomics features and immunohistochemical findings
A heatmap (Figure 6) showed that the selected radiomics features were not correlated with immunohistochemical findings (ER, PR, P53, and Ki-67) (all p > 0.05). The selected sequences for lesion segmentation and pathological and immunohistochemical pictures are presented in Figure 7.
Figure 6 Heatmap of the correlations between the selected radiomics features and ER, PR, P53, and Ki-67.
Figure 7 A 46-year-old woman with stage IA endometrial cancer (EC) whose main clinical complication was irregular vaginal bleeding for 3 months. The selected MR images for lesion segmentation included uterus-axial T2-weighted imaging (T2WI) (A), diffusion-weighted imaging (DWI) (B), apparent diffusion coefficient (ADC) map (C), and late contrast-enhanced T1-weighted imaging (LCE-T1WI) (D). The estrogen receptor (ER) (E), progesterone receptor (PR) (F), P53 (G), and Ki-67 (H) immunohistochemical staining (40×) showed 90%, 90%, focal, and 80% positive cells, respectively.
Discussion
In the study, age and irregular vaginal bleeding were the valid predictive parameters in the clinical model. On the basis of several common machine learning algorithms, the diverse multiparametric MRI-based radiomics models were developed to differentiate stage IA EC from benign endometrial lesions, and the LR algorithm model was selected as the optimal radiomics model with the highest AUC and accuracy. Compared with the clinical model and the radiologist, the optimal radiomics model and the compositive models combining clinical parameters with radiomics features, like the nomogram, stacking model, and ensemble model, showed better diagnostic performance and achieved good clinical net benefits. The nomogram had a higher AUC than the optimal radiomics model, and revealed more stable discrimination efficiency and better generalization ability than stacking and ensemble models.
The standard surgery of early-stage EC is total hysterectomy with bilateral salpingo-oophorectomy with or without lymphadenectomy/radiotherapy/chemotherapy (30), while the treatment for benign endometrial lesions is a minimally invasive approach compared to hysterectomy, such as hysteroscopic resection or conservative treatment (31, 32). In this study, 3.5% of patients with stage IA EC had undergone inadequate surgery and 26.7% of patients with benign endometrial lesions had undergone overtreatment. As a consequence, the rationalization of treatment options is crucial for patients with stage IA EC and benign mimickers. The most common symptom of EC is irregular vaginal bleeding, which often occurs in the early stage, and the American Cancer Society recommended that all women older than 65 years should be advised to seek risk evaluation of EC if bleeding occurs (33). Therefore, age and irregular vaginal bleeding could be used as effective clinical predictors of stage IA EC. Benign endometrial lesions, such as endometrial hyperplasia and polyps, are highly prevalent in postmenopausal women; symptoms include abnormal uterine bleeding (31, 32). Due to the overlapping clinical features of benign endometrial lesions and EC, the AUC and the diagnostic accuracy of clinical model on the training group and validation groups were low.
In radiomics, the digital medical images that hold information of tumor pathophysiology are transformed into quantitative high-dimensional data to improve medical decision-making, and are gaining importance in cancer research (8). In this study, the radiomics models had high diagnostic performance, which was consistent with the research of Chen et al. (24). The models with high efficiency and reliability are fundamental factors driving the success of radiomics (34), and the recognition of optimal machine learning methods for radiomics models is crucial (35); thus, multiple machine learning algorithms should be employed. We trained nine common classification algorithms, namely, LR, SVM, SGD, KNN, DT, RF, ET, XGBoost, and LightGBM, in model establishment. LR performed best among all classifiers, and the reason might be that complex models required more training samples (36). The optimal radiomics model modeled by LR had higher AUCs and diagnostic accuracies than those of the clinical model and the radiologist in this study. This result further confirmed that radiomics could be a problem-solving tool when there is a dilemma in clinical diagnosis and the observation of conventional imaging (8).
Unlike the study of Chen et al. (24), CE-MRI was included and extracted features in our study. The top four vital features in the optimal radiomics model were from CE-MRI, T2WI, DWI, and the ADC map, respectively. Due to the differences in vascular permeability and microvessel density between EC and benign lesions, most ECs showed early maximal enhancement and late gradual washout, and frequently showed lower signal intensity than the myometrium on LCE-MRI. In contrast, benign lesions showed delayed persistent enhancement pattern, and tended to show higher signal intensity than the myometrium on LCE-MRI (37). The shape of benign and malignant endometrial lesions might be more clearly shown on LCE-MRI. Flatness shows the relationship between the largest and smallest principal components in the ROI shape (8). In consequence, CE_original_shape_flatness was the most contributing feature. Endometrial polyp and hyperplasia are rich in fibrous stromal structures and endometrial glands (1). Specific MRI findings such as a fibrous tissue (hypointensity on T2WI) and intratumoral cysts (hyperintensity on T2WI) might be useful to differentiate benign endometrial lesions from EC (1). Additionally, the zone percentage of GLSZM features represents the coarseness of the texture, and can better reflect the heterogeneity of different tumors (8). Thus T2_exponential_GLSZM_zone percentage was also an important feature. A previous study had suggested that DWI with ADC values were a potential quantitative and qualitative tool for differentiating between early-stage EC and benign mimickers (38). On DWI, benign endometrial lesions showed low signal intensity, which was an important point in differentiating them from EC that showed high signal intensity due to relatively high cellularity (39). Nevertheless, the top two important features were not derived from DWI and the ADC map in this study. The possible reason was that DWI was acquired in different scanners, which might lead to inconsistency in image quality and ADC estimation across vendors, although the models remained effective after cross-validation in datasets from scanners with different manufacturers or with different Tesla. Another possible reason was that benign uterine lesions rich in cystic areas and mucus might increase DWI signal intensity due to the influence of the T2-penetration effect, and hemorrhagic areas and mucous components could reduce the signal intensity of the ADC map, which would lead to a slight difference between DWI and ADC maps of benign and malignant endometrial lesions, thus resulting in the reduction of the weight of their features.
Gatenby et al. (40) believed that radiomics features could offer information on the phenotype and microenvironment of tumors, which was complementary to other data like clinical parameters. Radiomics features combined with clinical parameters and other pertinent data can produce accurate robust evidence-based clinical-decision support systems (8). In this study, according to ROCs, CDCs, NRI, and IDI, the compositive models modeled by clinical parameters and radiomics features, such as the nomogram, stacking, and ensemble models, showed better diagnostic performance and achieved better clinical net benefits than the clinical model and the radiologist. Compared with the radiomics model, the nomogram had a higher AUC. Yan et al. (11) developed an MRI- and clinical-based radiomics nomogram to preoperatively assess high-risk EC, and obtained a similar result to this study, which was the prediction efficiency of nomogram was better than that of the radiomics model. The advantage of the ensemble strategy was that it can reduce the variance and bias of the model by a powerful process of majority vote or group averaging, and it improves the robustness and generalizability of the model in prediction and classification (27). A recent study had confirmed that the two-tier stacking model could further improve the generalization ability of the radiomics model compared with the single model (41). In the present study, the diagnostic performance of the stacking model and ensemble model was similar with that of the nomogram and better than that of the radiomics model in the internal validation group, whereas the AUCs of the stacking model and ensemble model were lower than those of the nomogram and radiomics model in the external validation group. Therefore, the nomogram presented more excellent and stable differential diagnostic efficiency than stacking and ensemble models with good reproducibility and reliability.
There were some limitations in the study. First, this study only collected patients from two centers. Patients from more centers need to be included to improve the universality of the model in clinical application. Second, the MRI systems and scanning parameters were not uniform, and it may influence the models’ results, especially in the external validation group. Third, only traditional radiomics features were extracted; the deep-learning-based features were not investigated. In the future, we will conduct in-depth learning combined with traditional radiomics to build models. Last, manual lesion segmentation is time-consuming and is easily affected by the experience of readers; automatic or semiautomatic methods that delineate lesions more accurately need to be explored in the future.
Conclusions
The multiparametric MRI-based radiomics models can be conveniently used for preoperative identification of patients with stage IA EC and benign endometrial lesions, and the model established by the LR algorithm has the highest accuracy. Incorporating radiomics and clinical parameters (age and irregular vaginal bleeding) into a combined model to estimate patients was more accurate than the clinical model and the radiologist. This study is beneficial in noninvasively identifying benign and malignant endometrial lesions that are difficult to determine by clinicians and radiologists before surgery, avoiding misdiagnosis and missed diagnosis, and providing a basis for the patient protocol of individualized diagnosis and treatment.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by Medical Ethics review committee of the First People’s Hospital of Yunnan Province. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
QB designed the study, performed the statistical analysis, and wrote the manuscript. YXW collected the data and performed the statistical analysis. YD, YL, and YP collected the data. YS and YZW revised the manuscript. KW guaranteed the integrity of the entire study. All authors approved the submitted version of the manuscript.
Acknowledgments
We sincerely thank Programmer Guanghong Yang and Platform Onekey AI for Code consultation of the study.
Conflict of interest
Authors YS and YW were employed by Siemens Healthineers.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Lee Y, Kim KA, Song MJ, Park YS, Lee J, Choi JW, et al. Multiparametric magnetic resonance imaging of endometrial polypoid lesions. Abdom Radiol (NY) (2020) 45(11):3869–81. doi: 10.1007/s00261-020-02567-7
2. Koh WJ, Abu-Rustum NR, Bean S, Bradley K, Campos SM, Cho KR, et al. Uterine neoplasms, version 1.2018, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw (2018) 16(2):170–99. doi: 10.6004/jnccn.2018.0006
3. Svirsky R, Smorgick N, Rozowski U, Sagiv R, Feingold M, Halperin R, et al. Can we rely on blind endometrial biopsy for detection of focal intrauterine pathology? Am J Obstet Gynecol (2008) 199(2):111–5. doi: 10.1016/j.ajog.2008.02.015
4. Narice BF, Delaney B, Dickson JM. Endometrial sampling in low-risk patients with abnormal uterine bleeding: A systematic review and meta-synthesis. BMC Fam Pract (2018) 19(1):135. doi: 10.1186/s12875-018-0817-3
5. Bi Q, Chen Y, Wu K, Wang J, Zhao Y, Wang B, et al. The diagnostic value of MRI for preoperative staging in patients with endometrial cancer: A meta-analysis. Acad Radiol (2020) 27(7):960–8. doi: 10.1016/j.acra.2019.09.018
6. Nalaboff KM, Pellerito JS, Ben-Levi E. Imaging the endometrium: Disease and normal variants. Radiographics (2001) 21(6):1409–24. doi: 10.1148/radiographics.21.6.g01nv211409
7. Kierans AS, Bennett GL, Haghighi M, Rosenkrantz AB. Utility of conventional and diffusion-weighted MRI features in distinguishing benign from malignant endometrial lesions. Eur J Radiol (2014) 83(4):726–32. doi: 10.1016/j.ejrad.2013.11.030
8. Lambin P, Leijenaar R, Deist TM, Peerlings J, de Jong E, van Timmeren J, et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol (2017) 14(12):749–62. doi: 10.1038/nrclinonc.2017.141
9. Zhang K, Zhang Y, Fang X, Fang M, Shi B, Dong J, et al. Nomograms of combining apparent diffusion coefficient value and radiomics for preoperative risk evaluation in endometrial carcinoma. Front Oncol (2021) 11:705456. doi: 10.3389/fonc.2021.705456
10. Chen J, Gu H, Fan W, Wang Y, Chen S, Chen X, et al. MRI-Based radiomic model for preoperative risk stratification in stage I endometrial cancer. J Cancer (2021) 12(3):726–34. doi: 10.7150/jca.50872
11. Yan BC, Li Y, Ma FH, Feng F, Sun MH, Lin GW, et al. Preoperative assessment for high-risk endometrial cancer by developing an MRI- and clinical-based radiomics nomogram: A multicenter study. J Magn Reson Imaging (2020) 52(6):1872–82. doi: 10.1002/jmri.27289
12. Yan BC, Li Y, Ma FH, Zhang GF, Feng F, Sun MH, et al. Radiologists with MRI-based radiomics aids to predict the pelvic lymph node metastasis in endometrial cancer: A multicenter study. Eur Radiol (2021) 31(1):411–22. doi: 10.1007/s00330-020-07099-8
13. Yang LY, Siow TY, Lin YC, Wu RC, Lu HY, Chiang HJ, et al. Computer-aided segmentation and machine learning of integrated clinical and diffusion-weighted imaging parameters for predicting lymph node metastasis in endometrial cancer. Cancers (Basel) (2021) 13(6):1406. doi: 10.3390/cancers13061406
14. Xu X, Li H, Wang S, Fang M, Zhong L, Fan W, et al. Multiplanar MRI-based predictive model for preoperative assessment of lymph node metastasis in endometrial cancer. Front Oncol (2019) 9:1007. doi: 10.3389/fonc.2019.01007
15. Zhu X, Ying J, Yang H, Fu L, Li B, Jiang B. Detection of deep myometrial invasion in endometrial cancer MR imaging based on multi-feature fusion and probabilistic support vector machine ensemble. Comput Biol Med (2021) 134:104487. doi: 10.1016/j.compbiomed.2021.104487
16. Rodriguez-Ortega A, Alegre A, Lago V, Carot-Sierra JM, Ten-Esteve A, Montoliu G, et al. Machine learning-based integration of prognostic magnetic resonance imaging biomarkers for myometrial invasion stratification in endometrial cancer. J Magn Reson Imaging (2021) 54(3):987–95. doi: 10.1002/jmri.27625
17. Stanzione A, Cuocolo R, Del GR, Nardiello A, Romeo V, Travaglino A, et al. Deep myometrial infiltration of endometrial cancer on MRI: A radiomics-powered machine learning pilot study. Acad Radiol (2021) 28(5):737–44. doi: 10.1016/j.acra.2020.02.028
18. Jacob H, Dybvik JA, Ytre-Hauge S, Fasmer KE, Hoivik EA, Trovik J, et al. An MRI-based radiomic prognostic index predicts poor outcome and specific genetic alterations in endometrial cancer. J Clin Med (2021) 10(3):538. doi: 10.3390/jcm10030538
19. Hoivik EA, Hodneland E, Dybvik JA, Wagner-Larsen KS, Fasmer KE, Berg HF, et al. A radiogenomics application for prognostic profiling of endometrial cancer. Commun Biol (2021) 4(1):1363. doi: 10.1038/s42003-021-02894-5
20. Zhang K, Zhang Y, Fang X, Dong J, Qian L. MRI-Based radiomics and ADC values are related to recurrence of endometrial carcinoma: A preliminary analysis. BMC Cancer (2021) 21(1):1266. doi: 10.1186/s12885-021-08988-x
21. Zheng T, Yang L, Du J, Dong Y, Wu S, Shi Q, et al. Combination analysis of a radiomics-based predictive model with clinical indicators for the preoperative assessment of histological grade in endometrial carcinoma. Front Oncol (2021) 11:582495. doi: 10.3389/fonc.2021.582495
22. Yan BC, Ma XL, Li Y, Duan SF, Zhang GF, Qiang JW. MRI-Based radiomics nomogram for selecting ovarian preservation treatment in patients with early-stage endometrial cancer. Front Oncol (2021) 11:730281. doi: 10.3389/fonc.2021.730281
23. Luo Y, Mei D, Gong J, Zuo M, Guo X. Multiparametric MRI-based radiomics nomogram for predicting lymphovascular space invasion in endometrial carcinoma. J Magn Reson Imaging (2020) 52(4):1257–62. doi: 10.1002/jmri.27142
24. Chen X, Wang X, Gan M, Li L, Chen F, Pan J, et al. MRI-Based radiomics model for distinguishing endometrial carcinoma from benign mimics: A multicenter study. Eur J Radiol (2022) 146:110072. doi: 10.1016/j.ejrad.2021.110072
25. van Griethuysen J, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res (2017) 77(21):e104–7. doi: 10.1158/0008-5472.CAN-17-0339
26. Sauerbrei W, Royston P, Binder H. Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat Med (2007) 26(30):5512–28. doi: 10.1002/sim.3148
27. Rui W, Qiao N, Wu Y, Zhang Y, Aili A, Zhang Z, et al. Radiomics analysis allows for precise prediction of silent corticotroph adenoma among non-functioning pituitary adenomas. Eur Radiol (2022) 32(3):1570–8. doi: 10.1007/s00330-021-08361-3
28. Naimi AI, Balzer LB. Stacked generalization: An introduction to super learning. Eur J Epidemiol (2018) 33(5):459–64. doi: 10.1007/s10654-018-0390-z
29. Schuler MS, Rose S. Targeted maximum likelihood estimation for causal inference in observational studies. Am J Epidemiol (2017) 185(1):65–73. doi: 10.1093/aje/kww165
30. Concin N, Matias-Guiu X, Vergote I, Cibula D, Mirza MR, Marnitz S, et al. ESGO/ESTRO/ESP guidelines for the management of patients with endometrial carcinoma. Radiother Oncol (2021) 154:327–53. doi: 10.1016/j.radonc.2020.11.018
31. Auclair MH, Yong PJ, Salvador S, Thurston J, Colgan T, Sebastianelli A. Guideline no. 390-classification and management of endometrial hyperplasia. J Obstet Gynaecol Can (2019) 41(12):1789–800. doi: 10.1016/j.jogc.2019.03.025
32. Wolfman W. No. 249-asymptomatic endometrial thickening. J Obstet Gynaecol Can (2018) 40(5):e367–77. doi: 10.1016/j.jogc.2018.03.005
33. Braun MM, Overbeek-Wager EA, Grumbo RJ. Diagnosis and management of endometrial cancer. Am Fam Physician (2016) 93(6):468–74. Available at: https://www.aafp.org/pubs/afp/issues/2016/0315/p468.html
34. Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images are more than pictures, they are data. Radiology (2016) 278(2):563–77. doi: 10.1148/radiol.2015151169
35. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts H. Machine learning methods for quantitative radiomic biomarkers. Sci Rep (2015) 5:13087. doi: 10.1038/srep13087
36. Mao B, Ma J, Duan S, Xia Y, Tao Y, Zhang L. Preoperative classification of primary and metastatic liver cancer Via machine learning-based ultrasound radiomics. Eur Radiol (2021) 31(7):4576–86. doi: 10.1007/s00330-020-07562-6
37. Park BK, Kim B, Park JM, Ryu JA, Kim MS, Bae DS, et al. Differentiation of the various lesions causing an abnormality of the endometrial cavity using MR imaging: Emphasis on enhancement patterns on dynamic studies and late contrast-enhanced T1-weighted images. Eur Radiol (2006) 16(7):1591–8. doi: 10.1007/s00330-005-0085-1
38. Wang X, Zhao Y, Hu Y, Zhou Y, Ye X, Liu K, et al. Evaluation and validation of the diagnostic value of the apparent diffusion coefficient for differentiating early-stage endometrial carcinomas from benign mimickers at 3T MRI. Oncotarget (2017) 8(28):46390–7. doi: 10.18632/oncotarget.18553
39. Bakir B, Sanli S, Bakir VL, Ayas S, Yildiz SO, Iyibozkurt AC, et al. Role of diffusion weighted MRI in the differential diagnosis of endometrial cancer, polyp, hyperplasia, and physiological thickening. Clin Imaging (2017) 41:86–94. doi: 10.1016/j.clinimag.2016.10.016
40. Gatenby RA, Grove O, Gillies RJ. Quantitative imaging in cancer evolution and ecology. Radiology (2013) 269(1):8–15. doi: 10.1148/radiol.13122697
Keywords: endometrial cancer, magnetic resonance imaging, radiomics, nomogram, benign endometrial lesions
Citation: Bi Q, Wang Y, Deng Y, Liu Y, Pan Y, Song Y, Wu Y and Wu K (2022) Different multiparametric MRI-based radiomics models for differentiating stage IA endometrial cancer from benign endometrial lesions: A multicenter study. Front. Oncol. 12:939930. doi: 10.3389/fonc.2022.939930
Received: 09 May 2022; Accepted: 12 July 2022;
Published: 05 August 2022.
Edited by:
Alessio G. Morganti, University of Bologna, ItalyReviewed by:
Tao Yu, China Medical University, ChinaKomsun Suwannarurk, Thammasat University, Thailand
Copyright © 2022 Bi, Wang, Deng, Liu, Pan, Song, Wu and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kunhua Wu, a2hjZ3pAc2luYS5jb20=
†These authors have contributed equally to this work and share first authorship