Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 11 July 2024
Sec. Breast Cancer

Predicting Ki-67 expression levels in breast cancer using radiomics-based approaches on digital breast tomosynthesis and ultrasound

Jie Liu&#x;Jie Liu1†Caiying Yan&#x;Caiying Yan1†Chenlu LiuChenlu Liu1Yanxiao WangYanxiao Wang2Qian ChenQian Chen1Ying ChenYing Chen1Jianfeng Guo*Jianfeng Guo3*Shuangqing Chen*Shuangqing Chen1*
  • 1Department of Radiology, Nanjing Medical University Affiliated Suzhou Hospital, Suzhou, China
  • 2Department of Ultrasound, Sir Run Run Hospital Nanjing Medical University, Nanjing, China
  • 3Department of Ultrasound, Nanjing Medical University Affiliated Suzhou Hospital, Suzhou, China

Purpose: To construct and validate radiomics models that utilize ultrasound (US) and digital breast tomosynthesis (DBT) images independently and in combination to non-invasively predict the Ki-67 status in breast cancer.

Materials and methods: 149 breast cancer women who underwent DBT and US scans were retrospectively enrolled from June 2018 to August 2023 in total. Radiomics features were acquired from both the DBT and US images, then selected and reduced in dimensionality using several screening approaches. Establish radiomics models based on DBT, and US separately and combined. The area under the receiver operating characteristic curve (AUC), accuracy, specificity, and sensitivity were utilized to validate the predictive ability of the models. The decision curve analysis (DCA) was used to evaluate the clinical applicability of the models. The output of the classifier with the best AUC performance was converted into Rad-score and was regarded as Rad-Score model. A nomogram was constructed using the logistic regression method, integrating the Rad-Score and clinical factors. The model’s stability was assessed through AUC, calibration curves, and DCA.

Results: Support vector machine (SVM), logistic regression (LR), and random forest (RF) were trained to establish radiomics models with the selected features, with SVM showing optimal results. The AUC values for three models (US_SVM, DBT_SVM, and merge_SVM) were 0.668, 0.704, and 0.800 respectively. The DeLong test indicated a notable disparity in the area under the curve (AUC) between merge_SVM and US_SVM (p = 0.048), while there was no substantial variability between merge_SVM and DBT_SVM (p = 0.149). The DCA curve indicates that merge_SVM is superior to unimodal models in predicting high Ki-67 level, showing more clinical values. The nomogram integrating Rad-Score with tumor size obtained the better performance in test set (AUC: 0.818) and had more clinical net.

Conclusion: The fusion radiomics model performed better in predicting the Ki-67 expression level of breast carcinoma, but the gain effect is limited; thus, DBT is preferred as a preoperative diagnosis mode when resources are limited. Nomogram offers predictive advantages over other methods and can be a valuable tool for predicting Ki-67 levels in BC.

1 Introduction

Breast Cancer (BC) is a malignant tumor with the greatest morbidity and mortality rate all around the world, which is a severe risk to women’s health (1). Ki-67 is a nuclear antigen that is closely connected with the invasiveness and proliferative activity of breast cancer (2). It serves as a significant marker for breast cancer category (3), prognosis, as well as predicting the effectiveness of preoperative neoadjuvant chemotherapy and endocrine therapy (4, 5). Currently, Ki-67 is mainly detected by immunohistochemistry (IHC), which requires tissue specimens to be obtained by core-needle biopsy or surgery. However, these procedures are invasive, time-consuming, not repeatable, and the limited number of samples obtained cannot thoroughly represent the tumor’s heterogeneity (6). Moreover, Ki-67 expression levels can change dynamically during the course of treatment (7), and IHC cannot be used as a routine means of dynamic monitoring. Therefore, it is crucial to find a non-invasive and accurate technique to assess Ki-67 expression before surgery.

During mammography, dense and inhomogeneous mammary glands might cause normal breast tissue to overlap with lesions, leading to decreased sensitivity and specificity in detection (8, 9). Digital breast tomosynthesis (DBT) is an advanced digital mammography technique that utilizes three-dimensional imaging technology to reduce breast tissue overlap, improving lesion visibility (10, 11). It provides increased sensitivity and specificity compared to traditional mammography, enabling the identification of initial, low-grade breast cancer (12). It is gradually becoming the current standard for breast screening and diagnosis.

Radiomics uses advanced data analysis techniques to assess biological indicators of breast cancer non-invasively before surgery (13), it offers significant potential in distinguishing between benign and malignant breast lesions, categorizing and grading breast cancer, and forecasting treatment response and risk of recurrence (14, 15), it also has major potential in evaluating the tumor microenvironment (16). Most current radiomics research to predict high Ki-67 expression relies on single-mode imaging or Magnetic Resonance Imaging (MRI). Nevertheless, the high cost, long examination time, and limited availability have impeded its practical application. Women diagnosed with breast cancer are more inclined to have DBT and US due to their efficiency, affordability, and ease of operation. A recent meta-analysis indicated that combining DBT and US can enhance the diagnosis accuracy of dense breasts (17), potentially serving as an alternative to MRI. Furthermore, a few studies have demonstrated that radiomics based on DBT and US is feasible and repeatable (18), and it has the potential to facilitate precision medicine.

Therefore, the present study attempted to construct DBT, US, and fusion models utilizing the quantitative radiomics features extracted from DBT and US images, and to investigate whether the three models could enhance the diagnostic efficacy of preoperative noninvasive prediction for Ki-67 status. Figure 1 displays the workflow.

Figure 1
www.frontiersin.org

Figure 1 Workflow of predicting the Ki-67 level based on radiomics.

2 Materials and methods

2.1 Patient selection

This retrospective research received approval from the Ethics Committee of the Suzhou Municipal Hospital (No. 2024320), and the need for written informed permission was waived. Patients pathologically diagnosed with breast tumors in the breast cancer screening center from June 2018 to August 2023 were enrolled.

The inclusion standards were (a) patients diagnosed with breast cancer via biopsy or surgical pathology; (b) DBT and US examinations were conducted within 1 week before surgery; (c) immunohistochemical indexes such as Ki-67 were perfected after operation.

The exclusion standards were (a) inadequate quality of DBT and US images for radiomics analysis owing to artifact, calcification, cystic degeneration, etc.; (b) patients had received any form of treatment (including neoadjuvant chemotherapy, radiotherapy, hormone therapy, surgery, and core-needle biopsy) before DBT and US examination; (c) lesions larger than 50 mm (not completely shown on a single plane); (d) patients with other primary tumors.

2.2 Clinical and pathological assessment

Clinical data including age, mass location, and menstrual status were acquired from the hospitalization system of our center, tumor size, calcifications, and burr edges were evaluated from medical images. ER, PR, HER2, and Ki-67 were detected by IHC of the surgical specimens, conducted by two senior attending physicians in the Department of Pathology. ER and PR status were taken for positive when ≥1% of tumor cell nucleus showed ER and PR staining. The diagnostic criterion for positive expression of Ki-67 was the percentage of tumor nuclei in the field of view of the hot spot in the section. According to the expert consensus of the 2020 International Breast Cancer Ki-67 Working Group (19), a cut-off value of 30% was used to classify high and low Ki-67 expression (≤30% as low expression and >30% as high expression). HER2 expression levels of 0 and 1+ were classified as negative, 3+ as positive, and 2+ required additional FISH testing: amplification was deemed positive, while non-amplification was deemed negative.

2.3 Image acquisition

All DBT images were obtained using a Selenia Dimensions system (Hologic, Bedford, MA, USA) on both the craniocaudal (CC) and mediolateral oblique (MLO) views. The DBT volumes were rebuilt with a slice interval of 1 mm and an in-plane pixel size of roughly 100 μm using the filtered back-projection reconstruction method. The scanning angle was 15° ± 7.5°. The US inspections were conducted by two sonographers with over 10 years of experience in breast diagnosis for each patient. Each patient was placed in the supine position, and the Resona R7 or Resona R9 ultrasonic diagnostic apparatus (Mindray, Shenzhen, China) was utilized, with the probe model L14-5WU. The probe frequency was set at 7-10 MHz, and the measurement standard for lesions was based on the 2013 American Co1lege of Radiology (ACR) Breast Imaging Reporting and Data System (BI-RADS) (20), the multifocal mass selected the largest mass for measurement, the gray-scale ultrasound imaging of the largest plane of each breast lesion was selected in all patients. Subsequently, all collected images were stored in digital imaging and communications in medicine (DICOM) format.

2.4 ROIs segmentation

The Regions of Interest (ROIs) were manually drawn layer by layer along the edge of the lumps by two imaging physicians with over 5 and 15 years of experience in breast diagnosis utilizing an open-source imaging platform ITK-SNAP (version 4.0.1, http://www.itk-snap.org), avoiding cystic and necrotic areas as much as possible. Subsequently, 40 image cases were chosen randomly for a consistency assessment.

2.5 Feature extraction

All images were standardized and normalized to reduce variability caused by different machines, with the intensities of the images adjusted to a range of 0-1. Then the open-source pyRadiomics package of Python (version 3.7.6) was used to extract the radiomics features from the CC, MLO images of DBT, and US images respectively. These features mainly encompass first-order features, shape features, texture features, and higher-order texture features (features extracted through transformations such as wavelet and Laplace filter (LoG)). The specific parameters can be referred to https://pyradiomics.readthedocs.io/. Overall, 1427 features were separately acquired from the CC and MLO views of DBT images, whereas 1239 features were obtained from the US images. All features were merged as features of the fusion model.

2.6 Feature selection

The participants were divided into the training and test sets randomly with a ratio of 7:3. The feature screening procedure was applied in the training set to screen the optimal radiomics features in 4 steps. Firstly, the extracted features were standardized by z-score normalization in both the training and the test set separately to provide a uniform standard of feature values and enhance comparability between features, achieving proportional scaling of the original data. Secondly, The Student’s T test was performed on all radiomics features and only features with a p-value <0.05 were considered potentially predictive and retained. Thirdly, RF was used to rank features from high to low. Then, the Spearman correlation coefficient was used to examine the relevance between features. Features with a coefficient > 0.9 were considered highly associated, and the one with a lower RF score was discarded. Finally, The Least Absolute Shrinkage and Selection Operator Method (LASSO) was performed for feature dimensionality reduction to select features further, the optimal tuning parameter λ was selected using 10-fold cross-validation, and features with non-zero regression coefficients were selected from these candidate features (Figure 2).

Figure 2
www.frontiersin.org

Figure 2 LASSO algorithm for radiomics features selection in the training set.

2.7 Model construction

Radiomics models: The features of the three models were input into three commonly used machine learning methods separately after the LASSO algorithm: Random Forest (RF), Logistic Regression (LR), Support Vector Machine (SVM), and the three prediction models (DBT_, US_, and merge_) were constructed using the “scikit-learn” package in Python (version 3.7.6). The predictive performance of the 9 models was presented in a table, and the optimal model was chosen by comparing the AUC values of the training and test sets. For the optimal model selected, the ROC curves of the training and validation sets were summarized to compare the differences in the AUC values of DBT, US, and fusion models. The Decision Curve Analysis (DCA) was used to quantify the net gains at different threshold probabilities in the validation set to evaluate the clinical utility of the three models.

Nomogram model: The output of the classifier with the best AUC value was converted into Rad-Score and was regarded as a Rad-Score model, and then we utilize the univariate and multivariate logistic regression to find the best clinical predictors, p < 0.05 were considered as the risk factors. Subsequently, the Rad-Score and risk predictors were integrated to construct the nomogram model. The model’s stability was evaluated using AUC, calibration curve, and DCA.

2.8 Statistical analysis

Statistical analyses of the data were conducted using the “SciPy. stats” package of Python (version 3.7.6) and graphing were performed using the “matplotlib. pyplot” package. The AUC, 95% confidence interval (95% CI), accuracy, sensitivity, and specificity were utilized to validate the predictive performance of the models. DeLong test was applied to evaluate the differences in AUC values between the models, p < 0.05 (two-tailed) was considered statistically significant.

3 Results

3.1 Clinical characteristics

A total of 149 cases were finally enrolled, consisting of 82 individuals with low Ki-67 level and 67 with high Ki-67 level. All participants were female, aged 30 to 86 years old, with an average of 59.01 ± 11.78 years old. The patients were randomly divided into a training set (n = 104) and a test set (n = 45). Table 1 displays the statistics of clinical features. There were significant differences in tumor size, ER and HER2 status between the high Ki-67 group and the low Ki-67 group in the training set and the test set (p < 0.05). Additionally, PR status differed significantly in the training set (p < 0.05). However, no notable differences were found in age, location of mass, menstrual status, and lymph node metastasis in either the training set or the test set (p > 0.05).

Table 1
www.frontiersin.org

Table 1 Statistical results of the clinical characteristics.

3.2 Radiomics feature selection

Following feature dimensionality reduction and screening based on LASSO, 9 optimal features were selected for the DBT model and 3 for the US model (detailed in Supplementary Tables 1, 2). The fusion model incorporated 8 optimal features, including 1 feature from the US and 7 features from DBT (4 from the MLO view and 3 from the CC perspective) as detailed in Table 2. All of them were texture features after wavelet, LoG, and square transformations, mainly including gray-level co-occurrence matrix (GLCM) and gray-level dependence matrix (GLDM), gray-level size zone matrix (GLSZM), gray-level run length matrix (GLRLM), and neighboring gray-tone difference matrix (NGTDM), with AUC values vary from 0.556 to 0.783. The weight map of the fusion modal features is displayed in Figure 3, of which, wavelet-LHH_glcm_ClusterShade_CC, log-sigma-3-0-mm-3D_glcm_Contrast_MLO, square_gldm_SmallDependenceLowGrayLevelEmphasis_MLO and wavelet-HLL_glszm_SizeZoneNonUniformityNormalized_US were more weighted. In the high Ki-67 group, the mean values of the four features were -0.36 ± 0.59, -0.18 ± 0.79, -0.27 ± 0.75, 0.19 ± 1.22, and 0.11 ± 0.89, 0.19 ± 1.12, 0.43 ± 1.31, -0.38 ± 1.02 in low Ki-67 group (detailed in Supplementary Figure 1).

Table 2
www.frontiersin.org

Table 2 Features for the prediction of the Ki-67 level in merge_SVM.

Figure 3
www.frontiersin.org

Figure 3 Weights of non-zero coefficient features after the LASSO algorithm.

3.3 Assessment of model efficacy

The prediction performance evaluation results for the three models are displayed in Table 3. After comparing the performance of the three classifiers, it was found that the RF classifier was generally less effective, LR had the best prediction results in the test set but poor classification results in the training set, and SVM showed stable performance in both sets. Therefore, SVM was chosen as the preferred method for modeling due to its comprehensive performance. Figure 4 displays the classification outcomes of the three models constructed by the SVM classifier. In the training set, the AUC values for US_SVM and DBT_SVM were 0.726 (95% CI: 0.627-0.825) and 0.856 (95% CI: 0.780-0.931) respectively. In the test set, the AUC values were 0.668 (95% CI: 0.505-0.832) for US_SVM and 0.704 (95% CI: 0.546-0.863) for DBT_SVM. Overall, DBT_SVM had a slightly higher prediction accuracy than US_SVM. The AUC of the merge_SVM model achieved 0.805 (95% CI: 0.719-0.891) in the training group and 0.800 (95% CI: 0.662-0.937) in the validation group. It had the highest AUC value in the test set and the highest accuracy, specificity, and sensitivity in both training and test sets, except for a slightly lower specificity in the training set. The DeLong test indicated that the AUC value of the merge_SVM was significantly different from the US_SVM (p = 0.048) and not substantially different from the DBT_SVM (p = 0.149). The DCA of merge_SVM expressed the best among the three models, indicating that this model had a higher net clinical benefit than the other two models for predicting high Ki-67 expression (Figure 5).

Table 3
www.frontiersin.org

Table 3 Prediction performance of three classifiers.

Figure 4
www.frontiersin.org

Figure 4 Classification results of three SVM models for predicting Ki-67 expression in breast cancer. (A) Comparison results of accuracy, specificity, and sensitivity of the three models. ACC: accuracy, SPE: specificity, and SEN: sensitivity. (B) Summary of ROC curves of the three SVM models for predicting Ki-67 expression in breast cancer.

Figure 5
www.frontiersin.org

Figure 5 Decision curve analysis for the three models. Neither treatment as the baseline (black dashed line), merge_SVM had the highest net benefit compared to DBT_SVM, US_SVM, and both treatments (black solid line). The net benefit refers to the proportion of patients who can be treated more who are truly Ki-67 high-expressing without increasing overtreatment, compared to neither treatment, using a model or criteria to guide whether to treat or not.

3.4 Assessment of the nomogram

According to the performance of the three radiomics models above, merge_SVM was selected as the Rad-Score model (Fusion-Rad model), then established a rad score formula to calculated each patient’s Rad-Score using non-zero features coefficient. Rad-score and clinical characteristics were analyzed utilizing the univariate and multivariate logistic regression, as shown in Table 4. Among them, tumor size and Rad-Score were relevant to Ki-67 expression level (p < 0.001). In multivariate logistic regression, tumor size (OR 0.925, CI 0.849 - 1.002, p < 0.05), and Rad-Score (OR 0.006, CI -3.174-3.186, p < 0.05) were independent risk factors for Ki-67 status in BC. A nomogram was developed by integrating Rad-Score, tumor size using the logistic regression method (Figure 6). The AUC values of the nomogram models were 0.779 in the training set and 0.818 in the test set (Figure 7, Table 5). The calibration curves revealed good predictive accuracy between model-predicted values and actual outcomes (Figure 8). The DCA of the nomogram model performed better than the Fusion-Rad model (Figure 9).

Table 4
www.frontiersin.org

Table 4 Univariate and multivariate logistic regression results of the risk factors of high Ki-67 level in BC.

Figure 6
www.frontiersin.org

Figure 6 The radiomics-clinical nomogram was developed by logistic regression, which includes tumor size and Rad-Score.

Figure 7
www.frontiersin.org

Figure 7 Summary of ROC curves of the Fusion-Rad and Nomogram models for predicting Ki-67 expression in breast cancer. (A) training set; (B) test set.

Table 5
www.frontiersin.org

Table 5 Performance of predicting Ki-67 levels in different models of training groups and validation groups.

Figure 8
www.frontiersin.org

Figure 8 Calibration curves of the nomogram model in the training (A) and test sets (B).

Figure 9
www.frontiersin.org

Figure 9 Decision curve analysis for the Fusion-Rad and Nomogram models in predicting Ki-67 status for breast cancer patients.

4 Discussion

Early prediction of Ki-67 expression level in breast cancer has important clinical significance for breast cancer treatment planning and prognosis (21). Currently, the detection of Ki-67 is mainly based on IHC, which is an invasive procedure that is time-consuming and non-repeatable. A non-invasive preoperative detection method is needed in clinical practice. Multiple studies have proved that radiomics techniques can be used for the prediction of therapy response and prognosis (22), and tumor biomarkers such as Ki-67 (23). In this research, we extracted an amount quantity of radiomics features from DBT and US images, and then constructed and validated the radiomics predictive models of Ki-67 expression levels in unimodal and fusion modalities. Compared with the DBT and US models, the fusion model achieved better classification results in the test set (AUC=0.800). The DCA indicated that the net clinical benefit of merge_SVM was higher than that of DBT_SVM and US_SVM, indicating that the fusion model has higher clinical applicability and can provide a non-invasive prediction method for Ki-67 level in breast cancer.

In this study, we first established prediction models for Ki-67 expression levels in BC based on two clinically common imaging modalities, DBT and US. Our DBT model has an AUC value of 0.704, which is similar to the performance of Tagliafico’s model (AUC = 0.698) (24). Similarly, our US model has an AUC value of 0.668, which is not much different from the results of the US model constructed by Liu et al. (AUC = 0.713) (25). It is essential to mention that the AUC value of the DBT model in this study was 0.036 higher than that of the US. Additionally, the accuracy, specificity, and sensitivity were also higher. This difference may be related to the different imaging modalities used in DBT and US. DBT utilizes three-dimensional imaging, allowing for the capture of crucial details like microcalcification clusters, burr edges, and architectural distortion of the tumor (9). In contrast, 2D US images provide limited information due to the limitation of ultrasound scanning probes. However, the predictive ability of unimodal models is always limited, and it has been suggested that combining multiple imaging modalities can improve the accuracy of Ki-67 prediction (26, 27). The results of this study showed that the fusion model constructed based on US and DBT has an AUC value of 0.800 (accuracy of 80.0%), surpassing the unimodal models in classification. This indicates that the combination of DBT and US can compensate for the shortcomings of the imaging modalities, provide more heterogeneous tumor information, improving the prediction level. It is worth mentioning that the prediction performance of the fusion model constructed in this study for Ki-67 expression level is comparable to that of the MRI multimodal models. Jiang et al. (26) combined the radiomics features of two MRI modalities, dynamic contrast-enhanced (DCE) and diffusion-weighted (DW), and the predicted AUC for Ki-67 status was 0.818. Fan et al. (28) combined precontrast and apparent diffusion coefficient images, the predicted AUC for Ki-67 achieved 0.811. Our results are consistent with the conclusions of a recent meta-analysis (17), that combining DBT and US can enhance the diagnosis accuracy of dense breasts and potentially serve as an alternative to MRI.

The fusion model was effective in enhancing the estimated performance of Ki-67, however, the gain was limited. When comparing the prediction performance of the fusion model with unimodal models, the AUC value of the fusion model improves significantly compared to the US model (DeLong p < 0.05), but shows only a slight improvement compared to the DBT model (DeLong p > 0.05). The reason for such a result may be due to the deviation in the proportion of the two unimodal feature numbers incorporated in the fusion model, with 7 DBT image features but only 1 US image feature. Additionally, DBT contains images at both CC and MLO views, doubling the number of features and providing more tumor information. Therefore, this study concluded that the fusion model could enhance the accuracy of predicting Ki-67. However, if resources are limited and patients cannot take both imaging examinations at the same time, it is preferable to choose DBT as a preoperative assessment modality.

In this study, the eight radiomics features incorporated in the fusion model were all transformed higher-order texture features, including 2 GLCM, 3 GLDM, 1 GLRLM, 1 GLSZM, and 1 NGTDM features. These features can characterize and quantify the texture attributes and complexity in the whole region, suggesting that texture complexity is associated with the Ki-67 status of breast cancer, which can serve as an important predictor of Ki-67 expression levels. This discovery is similar to previous research that has emphasized the importance of texture features (24, 2931). The GLCM features quantify the complexity of voxel intensities in the tumor region (32). Among GLCM features, Cluster Shade is a measure of skewness and homogeneity. Li et al. (33) demonstrated that the Cluster Shade value was higher in the cervical cancer Ki-67 high expression group compared to the low expression group. In this study, the absolute value of GLCM_ClusterShade was higher in the Ki-67 high expression group than the low expression group, indicating that the breast cancer Ki-67 high expression group may have rougher image texture, higher tumor heterogeneity, and more invasiveness. The GLDM features indicate the degree of roughness of the image texture (32). Petrillo et al. (34) found that these features are effective in distinguishing between benign and malignant breast lesions and in identifying the HER2 status. In the present study, three GLDM features showed better performance in classifying Ki-67 levels (with accuracies of 62.2%, 62.2%, and 66.7%, respectively), indicating that GLDM also can identify Ki-67 status. Texture features in US images can be applied to predict Ki-67 status (25, 31) and lymph node metastasis in breast carcinoma (35, 36). Previous research focused on the importance of GLCM, while the present study found that the GLSZM features in US images also have significant predictive value (accuracy of 66.0%). The GLSZM features quantify the grey-scale changes of the connected regions at the edge of the image, reflecting the clarity of the tumor edge. Niu et al. (37) found that the GLSZM values of benign lesions were higher than that of malignant lesions, which proved that the edges of benign lesions were clearer than those of malignant ones. This study found that the absolute value of the GLSZM features was higher in the low Ki-67 group compared to the high Ki-67 group, suggesting that the tumor margins may be more distinct in BC patients with low Ki-67 expression level.

In this study, ER and HER2 status had intergroup differences (p < 0.05), which suggests that Ki-67 values may be higher in patients with ER and HER2 positivity. According to the results of the univariate and multivariate logistic regression, the size of tumors and Rad-Score could be considered as independent risk factors to predict high Ki-67 level in BC. In order to facilitate the potential utilization of radiomics methods by clinical doctors, we developed a nomogram model incorporating Rad-Score and clinical independent predictors (tumor size), which achieved the best predictive performance (AUC=0.818). The DCA shows that the nomogram has better clinical applicability as a predictor of Ki-67 expression. To utilize our nomogram, clinicians should delineate the ROIs in DBT and US images to acquire Rad-Score, then calculate the probability of high Ki-67 status via the Rad-Score and the value of tumor size. Afterwards, clinicians can combine these probabilities with the patient’s other clinical characteristics to make a comprehensive assessment.

There exist some limitations. (a) This research is a single-center retrospective study with a small sample capacity, which makes it difficult to avoid selective bias. To confirm the accuracy of the models, it is necessary to have more cases of illness and multi-center data. (b) The US features were extracted from s single image of the largest cross-section of the tumor, which may result in the missing of crucial information about tumor heterogeneity. (c) The features we obtained after the screening were all texture-related features, ignoring features such as shape features and first-order features, probably because the difference in those features between the two groups was small or the weight of the features was low. In future research, we will improve the feature selection method to explore the clinical significance of these features. (d) Manual outlining of the ROI not only increases the time needed but also the inconsistency between different radiologists. In future research, deep learning will be explored as a means to segment the ROIs to substitute manual outlining automatically. Additionally, the potential of deep learning to enhance the accuracy of classification will be investigated.

In conclusion, the fusion model developed based on radiomics features of DBT and US images is superior to the unimodal models, which might assist in predicting the Ki-67 expression level of BC patients and provide individualized precision treatment. However, the gain effect of the fusion model is limited, so it is recommended that DBT be preferred as a preoperative diagnostic modality when resources are restricted. Otherwise, the nomogram offers predictive advantages over other methods and can be a valuable tool for predicting Ki-67 levels in breast cancer.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of the Nanjing Medical University Affiliated Suzhou Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

JL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. CY: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. JG: Conceptualization, Resources, Supervision, Writing – review & editing. CL: Data curation, Formal analysis, Supervision, Validation, Writing – original draft. YW: Data curation, Formal analysis, Validation, Writing – original draft. QC: Data curation, Formal analysis, Validation, Writing – original draft. YC: Funding acquisition, Supervision, Writing – review & editing. SC: Conceptualization, Funding acquisition, Resources, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by National Nature Science Foundation of China under Grants (62371449), Suzhou Medical Application Basic Research Project (SKYD2023038, SKY2023198).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1403522/full#supplementary-material

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA-Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Yerushalmi R, Woods R, Ravdin PM, Hayes MM, Gelmon KA. Ki67 in breast cancer: prognostic and predictive potential. Lancet Oncol. (2010) 11:174–83. doi: 10.1016/S1470-2045(09)70262-1

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Soliman NA, Yussif SM. Ki-67 as a prognostic marker according to breast cancer molecular subtype. Cancer Biol Med. (2016) 13:496–504. doi: 10.20892/j.issn.2095-3941.2016.0066

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Healey MA, Hirko KA, Beck AH, Collins LC, Schnitt SJ, Eliassen AH, et al. Assessment of Ki67 expression for breast cancer subtype classification and prognosis in the Nurses' Health Study. Breast Cancer Res Treat. (2017) 166:613–22. doi: 10.1007/s10549-017-4421-3

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Zhang AL, Wang XJ, Fan CF, Mao XY. The role of Ki67 in evaluating neoadjuvant endocrine therapy of hormone receptor-positive breast cancer. Front Endocrinol. (2021) 12:687244. doi: 10.3389/fendo.2021.687244

CrossRef Full Text | Google Scholar

6. Kim HS, Park S, Koo JS, Kim S, Kim JY, Nam S, et al. Risk factors associated with discordant Ki-67 levels between preoperative biopsy and postoperative surgical specimens in breast cancers. PLoS One. (2016) 11:10. doi: 10.1371/journal.pone.0151054

CrossRef Full Text | Google Scholar

7. Ellis MJ, Suman VJ, Hoog J, Goncalves R, Sanati S, Creighton CJ, et al. Ki67 proliferation index as a tool for chemotherapy decisions during and after neoadjuvant aromatase inhibitor treatment of breast cancer: Results from the American college of surgeons oncology group Z1031 trial (Alliance). J Clin Oncol. (2017) 35:1061–9. doi: 10.1200/JCO.2016.69.4406

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Michell MJ. Breast screening review-a radiologist's perspective. Br J Radiol. (2012) 85:845–7. doi: 10.1259/bjr/21332901

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Levesque PH, Hooley RJ. Malignant findings. In: Philpotts LE, Hooley RJ, editors. Breast Tomosynthesis. Philadelphia, Pennsylvania: Elsevier Health Sciences (2018). p. 88–99.

Google Scholar

10. Chong A, Weinstein SP, McDonald ES, Conant EF. Digital breast tomosynthesis: concepts and clinical practice. Radiology. (2019) 292:1–14. doi: 10.1148/radiol.2019180760

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Mandoul C, Verheyden C, Millet I, Orliac C, Pages E, Thomassin I, et al. Breast tomosynthesis: What do we know and where do we stand? Diagn Interv Imaging. (2019) 100:537–51. doi: 10.1016/j.diii.2019.07.012

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Skaane P, Bandos AI, Niklason LT, Sebuødegård S, Østerås BH, Gullien R, et al. Digital mammography versus digital mammography plus tomosynthesis in breast cancer screening: the oslo tomosynthesis screening trial. Radiology. (2019) 291:23–30. doi: 10.1148/radiol.2019182394

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout R, Granton P, et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Conti A, Duggento A, Indovina I, Guerrisi M, Toschi N. Radiomics in breast cancer classification and prediction. Semin Cancer Biol. (2021) 72:238–50. doi: 10.1016/j.semcancer.2020.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Tian J, Dong D, Liu Z, Wei J. Precision diagnosis based on radiomics. In: Tian J, Dong D, Liu Z, Wei J, editors. Radiomics and Its Clinical Application. Academic Press (2021). p. 99–174.

Google Scholar

16. Xue C, Zhou Q, Xi H, Zhou J. Radiomics: A review of current applications and possibilities in the assessment of tumor microenvironment. Diagn Interv Imaging. (2023) 104:113–22. doi: 10.1016/j.diii.2022.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Hussein H, Abbas E, Keshavarzi S, Fazelzad R, Bukhanov K, Kulkarni S, et al. Supplemental breast cancer screening in women with dense breasts and negative mammography: A systematic review and meta-analysis. Radiology. (2023) 306:e221785. doi: 10.1148/radiol.221785

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Duron L, Savatovsky J, Fournier L, Lecler A. Can we use radiomics in ultrasound imaging? Impact of preprocessing on feature repeatability. Diagn Interv Imaging. (2021) 102:659–67. doi: 10.1016/j.diii.2021.10.004

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Liu YP. Interpretation of Ki-67 assessment update of International Ki-67 in Breast Cancer Working Group. Chin J Pathol. (2021) 50:704–9. doi: 10.3760/cma.j.cn112151-20210303-00179

CrossRef Full Text | Google Scholar

20. Spak DA, Plaxco JS, Santiago L, Dryden MJ, Dogan BE. BI-RADS® fifth edition: A summary of changes. Diagn Interv Imaging. (2017) 98:179–90. doi: 10.1016/j.diii.2017.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Lashen AG, Toss MS, Ghannam SF, Makhlouf S, Green A, Mongan NP, et al. Expression, assessment and significance of Ki67 expression in breast cancer: an update. J Clin Pathol. (2023) 76:357–64. doi: 10.1136/jcp-2022-208731

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Chen Q, Zhang L, Liu S, You J, Chen L, Jin Z, et al. Radiomics in precision medicine for gastric cancer: opportunities and challenges. Eur Radiol. (2022) 32:5852–68. doi: 10.1007/s00330-022-08704-8

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Tagliafico AS, Piana M, Schenone D, Lai R, Massone AM, Houssami N. Overview of radiomics in breast cancer diagnosis and prognostication. Breast. (2020) 49:74–80. doi: 10.1016/j.breast.2019.10.018

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Tagliafico AS, Bignotti B, Rossi F, Matos J, Calabrese M, Valdora F, et al. Breast cancer Ki-67 expression prediction by digital breast tomosynthesis radiomics features. Eur Radiol Exp. (2019) 3:36. doi: 10.1186/s41747-019-0117-2

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Liu JJ, Wang XC, Hu MS, Zheng Y, Zhu L, Wang W, et al. Development of an ultrasound-based radiomics nomogram to preoperatively predict Ki-67 expression level in patients with breast cancer. Front Oncol. (2022) 12:963925. doi: 10.3389/fonc.2022.963925

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Jiang T, Song JD, Wang XY, Niu SX, Zhao NN, Dong Y, et al. Intratumoral and peritumoral analysis of mammography, tomosynthesis, and multiparametric MRI for predicting Ki-67 level in breast cancer: a radiomics-based study. Mol Imaging Biol. (2022) 24:550–9. doi: 10.1007/s11307-021-01695-w

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Wang FQ, Wang S, Mao XJ, Ding XM, Xue LY, Zhang J, et al. Study on the prediction of Ki-67 expression of breast carcinoma using intratumoral and peritumoral radiomics based on multi-modal MRI by machine learning. J Pract Radiol. (2023) 39:1606–10. doi: 10.3969/j.issn.1002-1671.2023.10.010

CrossRef Full Text | Google Scholar

28. Fan M, Yuan W, Zhao W, Xu M, Wang S, Gao X, et al. Joint prediction of breast cancer histological grade and Ki-67 expression level based on DCE-MRI and DWI radiomics. IEEE J BioMed Health Inform. (2020) 24:1632–42. doi: 10.1109/JBHI.6221020

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Jiang T, Jiang WY, Chang SJ, Wang HB, Niu SX, Yue ZB, et al. Intratumoral analysis of digital breast tomosynthesis for predicting the Ki-67 level in breast cancer: A multi-center radiomics study. Med Phys. (2022) 49:219–30. doi: 10.1002/mp.15392

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Juan MW, Yu J, Peng GX, Jun LJ, Feng SP, Fang LP. Correlation between DCE-MRI radiomics features and Ki-67 expression in invasive breast cancer. Oncol Lett. (2018) 16:5084–90. doi: 10.3892/ol.2018.9271

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Wu JF, Fang QQ, Yao JC, Ge LF, Hu LY, Wang ZP, et al. Integration of ultrasound radiomics features and clinical factors: A nomogram model for identifying the Ki-67 status in patients with breast carcinoma. Front Oncol. (2022) 12:979358. doi: 10.3389/fonc.2022.979358

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Zwanenburg A, Vallières M, Abdalah MA, Aerts H, Andrearczyk V, Apte A, et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/radiol.2020191145

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Li CP, Zheng MX, Dong JN, Wang CB, Wang HF, Wei C, et al. Prediction of Ki-67 proliferation index in cervical squamous cell carcinoma based on MR T2 WI texture analysis:a preliminary study. J Pract Radiol. (2022) 38:786–90. doi: 10.3969/j.issn.1002-1671.2022.05.021

CrossRef Full Text | Google Scholar

34. Petrillo A, Fusco R, Di Bernardo E, Petrosino T, Barretta ML, Porto A, et al. Prediction of breast cancer histological outcome by radiomics and artificial intelligence analysis in contrast-enhanced mammography. Cancers (Basel). (2022) 14:2132. doi: 10.3390/cancers14092132

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Qiu X, Fu Y, Ye Y, Wang Z, Cao C. A nomogram based on molecular biomarkers and radiomics to predict lymph node metastasis in breast cancer. Front Oncol. (2022) 12:790076. doi: 10.3389/fonc.2022.790076

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Zhou WJ, Zhang YD, Kong WT, Zhang CX, Zhang B. Preoperative prediction of axillary lymph node metastasis in patients with breast cancer based on radiomics of gray-scale ultrasonography. Gland Surg. (2021) 10:1989–2001. doi: 10.21037/gs-21-315

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Niu SX, Yu T, Cao Y, Dong Y, Luo YH, Jiang XR. Digital breast tomosynthesis-based peritumoral radiomics approaches in the differentiation of benign and Malignant breast lesions. Diagn Interv Radiol. (2022) 28:217–25. doi: 10.5152/dir.2022.20664

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: breast cancer, digital breast tomosynthesis, Ki-67 level, radiomics, ultrasound

Citation: Liu J, Yan C, Liu C, Wang Y, Chen Q, Chen Y, Guo J and Chen S (2024) Predicting Ki-67 expression levels in breast cancer using radiomics-based approaches on digital breast tomosynthesis and ultrasound. Front. Oncol. 14:1403522. doi: 10.3389/fonc.2024.1403522

Received: 19 March 2024; Accepted: 26 June 2024;
Published: 11 July 2024.

Edited by:

Sharon R. Pine, University of Colorado, United States

Reviewed by:

Roxana Pintican, University of Medicine and Pharmacy Iuliu Hatieganu, Romania
Zilong He, Southern Medical University, China

Copyright © 2024 Liu, Yan, Liu, Wang, Chen, Chen, Guo and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shuangqing Chen, shuangqingchen@163.com; Jianfeng Guo, jfguo@126.com

These authors contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.