- 1Third Hospital of Shanxi Medical University, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Taiyuan, China
- 2Department of Urology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
- 3Department of Urology, Xinzhou People’s Hospital, Xinzhou, China
- 4Department of Urology, Datong Fifth People’s Hospital, Datong, China
- 5Department of Urology, Handan First Hospital, Handan, China
- 6Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Third Hospital of Shanxi Medical University, Taiyuan, China
- 7Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Objective: To explore the value of combined radiomics and deep learning models using different machine learning algorithms based on mammography (MG) and magnetic resonance imaging (MRI) for predicting axillary lymph node metastasis (ALNM) in breast cancer (BC). The objective is to provide guidance for developing scientifically individualized treatment plans, assessing prognosis, and planning preoperative interventions.
Methods: A retrospective analysis was conducted on clinical and imaging data from 270 patients with BC confirmed by surgical pathology at the Third Hospital of Shanxi Medical University between November 2022 and April 2024. Multiple sequence images from MG and MRI were selected, and regions of interest in the lesions were delineated. Radiomics and deep learning (3D-Resnet18) features were extracted and fused. The samples were randomly divided into training and test sets in a 7:3 ratio. Dimensionality reduction and feature selection were performed using the least absolute shrinkage and selection operator (LASSO) regression model, and other methods. Various machine learning algorithms were used to construct radiomics, deep learning, and combined models. These models were visualized and evaluated for performance using receiver operating characteristic curves, area under the curve (AUC), calibration curves, and decision curves.
Results: The highest AUCs in the test set were achieved using radiomics-logistic regression (AUC = 0.759), deep learning-multilayer perceptron (MLP) (AUC = 0.712), and combined-MLP models (AUC = 0.846). The MLP model demonstrated strong classification performance, with the combined model (AUC = 0.846) outperforming both the radiomics (AUC = 0.756) and deep learning (AUC = 0.712) models.
Conclusion: The multimodal radiomics and deep learning models developed in this study, incorporating various machine learning algorithms, offer significant value for the preoperative prediction of ALNM in BC.
1 Introduction
Breast cancer (BC) has surpassed lung cancer as the most common cancer among women worldwide and is the leading cause of cancer-related deaths in women globally (1). Prognosis and treatment strategies for BC vary depending on the molecular subtype, clinical stage, and histological grading (2). Axillary lymph node metastasis (ALNM) plays a critical role in BC clinical staging. Studies have shown that patients with BC and ALNM have a 14% lower 5-year survival rate compared to those without metastasis (3, 4). Preoperative adjuvant therapy or axillary lymph node dissection (ALND) can improve patient survival (5, 6). Early prediction of ALNM can help clinicians assess prognosis and develop individualized treatment plans for patients with BC. However, imaging assessments in clinical practice are often subjective, requiring fine-needle aspiration or postoperative histopathological biopsy for accurate results. Therefore, there is a need for accurate and noninvasive methods to evaluate ALNM.
Radiomics can be used to noninvasively extract high-dimensional data from images, revealing tumor-rated characteristics that are invisible to the naked eye. This approach can potentially provide more accurate interpretations of results, prognoses, and treatment predictions (7–9). Deep learning, a new field in artificial intelligence (AI) and machine learning, involves multilayer neural networks that mimic the architecture of the human brain. Convolutional neural networks, a highly effective deep learning method, are widely used in medical image analysis (10). BC imaging typically involves mammography (MG) and magnetic resonance imaging (MRI), each with strengths and weaknesses in distinguishing calcifications from soft tissues. Research indicates that combining both methods provides more comprehensive information (11, 12).
Previous studies have mostly relied on single MG or MRI images and individual machine-learning algorithms to construct radiomics or deep-learning models, which limits tumor information and hinders the identification of the most effective models. This study investigated the performance of multimodal radiomics combined with deep learning models, utilizing various machine learning algorithms based on MG and MRI, for the preoperative prediction of ALNM in patients with BC. The objective was to guide preoperative intervention, prognosis assessment, and the development of scientifically individualized treatment plans.
2 Materials and methods
2.1 Study cohort
A retrospective analysis was conducted on the clinical and radiological data of patients with BC confirmed by surgical pathology at the Third Hospital of Shanxi Medical University between November 2022 and April 2024. The inclusion criteria were as follows:
1. Underwent breast MRI and MG examinations with clear and complete images performed before biopsy, surgical treatment, or chemoradiotherapy.
2. An interval of ≤ 1 week between MG and MRI examinations.
3. BC confirmed through pathological results, with the axillary lymph node (ALN) status determined by ALND, fine-needle aspiration, or sentinel lymph node biopsy (SLNB), some of which were conducted prior to neoadjuvant therapy.
The exclusion criteria were as follows:
1. Primary breast tumor lesions that were too small lesions or had unclear boundaries, making it difficult to delineate the region of interest (ROI).
2. Occult, recurrent, or distant metastatic BC.
3. A history of other malignancies.
4. Receiving neoadjuvant therapy before the pathological evaluation of lymph nodes and imaging assessment.
The final analysis included 270 female patients. All samples were randomly divided into training and testing sets in a 7:3 ratio. Statistical analysis was conducted to verify that the baseline data were balanced between the groups. The ethics committee of the Third Hospital of Shanxi Medical University approved the study and waived the requirement for informed consent. The patient inclusion flowchart is presented in Figure 1.
2.2 Equipment and methods
2.2.1 Equipment and image acquisition
MG was performed using the Hologic Selenia Dimensions digital mammography system for craniocaudal (CC) and mediolateral oblique (MLO) views of both breasts. The scanning sequence included the CC view of the affected side, the CC view unaffected side, the MLO view of the affected side, and the MLO view of the unaffected. Additional lateral views, spot compression views, and tomosynthesis were performed as needed.
MRI examinations were performed using a 3.0T MR scanner (Skyra, Siemens, Germany) equipped with an 18-channel dedicated coil. An apparent diffusion coefficient sequence was generated from two different b-value diffusion-weighted imaging (DWI) sequences. After mask scanning, Gd-DTPA (Gadopentetate Dimeglumine, Bayer, Germany) was injected into the antecubital vein at a rate of 2.5 mL/s using a power injector, at a dose of 0.2 mL/kg body weight, followed by a 20 mL saline flush for dynamic contrast-enhanced (DCE)-MRI. Enhanced images were continuously acquired over six phases after contrast injection. Detailed parameters are listed in Supplementary Table 1.
2.2.2 Image preprocessing, segmentation, and feature extraction
For image preprocessing, the “N4BiasFieldCorrection” function from the “ANTsPy” package was first used to correct bias in MRI images, eliminating signal distortions caused by inhomogeneous magnetic fields and enhancing image uniformity and contrast. After that, the normalization function from the “nibabel” package was applied to standardize the MRI grayscale values, ensuring consistency across different images. All images were then resampled to a fixed resolution of 1×1×1 mm voxel size to standardize voxel spacing for spatial normalization.
After preprocessing, two experienced radiologists, blinded to the patients’ pathological results, used ITK-SNAP 4.2.0 software to delineate ROIs of the same lesions in the CC and MLO views of MG images as well as in T2-weighted imaging, DWI (b value = 800), apparent diffusion coefficient, and the second phase of DCE-MRI sequences. In cases with multiple lesions, the largest single lesion was selected. Radiologist A initially delineated the lesions for all patients, while Radiologist B randomly selected and delineated the lesions of 30 patients. Radiologist A repeated the delineation for these 30 patients 2 weeks later to test feature consistency. Any discrepancies were resolved by consensus.
Using the “Pyradiomics” package in Python 3.7, radiomics features were extracted from the delineated ROIs, including shape features, first-order statistics, texture features (second-order statistics) [Gray-Level Co-occurrence Matrix, Gray-Level Run Length Matrix, Gray-Level Dependence Matrix], higher-order statistics, frequency-domain features, filter-based features, and Local Binary Patterns features. Radiomics features from the different sequences were combined for subsequent screening. Deep learning features were extracted using a pretrained 3D ResNet model, with the input layer modified to accept a single-channel input and the final fully connected layer removed. The extracted deep-learning features were then combined across sequences. The combined model features were formed by integrating the deep learning features with the radiomic features that passed consistency testing. Radiomic, deep learning, and combined model features were independently analyzed in subsequent steps.
2.2.3 Feature selection, model building, and evaluation
Radiomics features extracted from ROIs outlined by different doctors were assessed using a bidirectional consistency model to calculate the intraclass correlation coefficient (ICC), retaining features with an ICC > 0.9. Since deep-learning features are high-dimensional numerical vectors rather than manually defined, they did not require consistency testing.
A synthetic minority oversampling technique was used to balance the class distribution in the dataset by interpolating new minority class samples in the feature space (k = 5).
Z-score normalization was applied to the features to ensure that they were analyzed on the same scale. The mean and standard deviation of each feature were calculated from the training set, and these values were used to standardize both the training and validation sets to prevent data leakage.
After that, univariate analysis was performed to filter features. For normally distributed features with equal variance, independent sample t-tests were used; for normally distributed features with unequal variance, corrected t-test methods such as the Cochran & Cox, Satterthwaite, and Welch methods were applied; and for non-normally distributed features, the Mann-Whitney U test was employed. Features with p-values < 0.05 were retained.
Based on the univariate analysis results, Spearman correlation analysis was performed, and features with a correlation > 0.9 were randomly removed, repeating this process until all feature correlations were ≤ 0.9.
The Max-Relevance and min-redundancy method was applied to identify the features most relevant to the final output while minimizing redundancy among them.
Finally, LASSO regression was used for feature dimensionality reduction, with 20 repetitions of 10-fold cross-validation using different random seeds employed to select the feature subset corresponding to the optimal regularization parameter λ with the minimum mean squared error (MSE).
Using 20 repetitions of 5-fold cross-validation using different random seeds, the final selected radiomics and deep learning features were used to construct models with eight machine learning algorithms: logistic regression (LR), naive Bayes, support vector machine (SVM), k-nearest neighbors (KNN), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), adaptive boosting (AdaBoost), and multilayer perceptron (MLP).
Receiver operating characteristic (ROC) curves were plotted for all the models, and metrics such as area under the curve (AUC), diagnostic accuracy, sensitivity, specificity, positive predictive value, negative predictive value, precision, recall, F1 score, and threshold were calculated. The best-performing algorithm model was selected, and waterfall plots were used to display the distribution of the model’s prediction probabilities. The DeLong test, net reclassification improvement index (NRI), and integrated discrimination improvement index (IDI) were used to compare model performance. Calibration curves (CAL) and decision curve analysis (DCA) were used to assess model fit and clinical benefits. The research workflow is illustrated in Figure 2.
2.2.4 Statistical analysis
Statistical analyses were performed using R version 4.4.1. Continuous variables were presented as “mean ± standard deviation.” Independent-sample t-tests were used for normally distributed continuous variables; in contrast, the Mann–Whitney U test was used for non-normally distributed continuous variables. Categorical variables are expressed as frequencies and percentages, and comparisons were made using the chi-square test, continuity correction test, and Fisher’s exact test. Statistical significance was set at p < 0.05.
3 Results
3.1 Clinical characteristics
This study included 270 patients with BC: 117 (43.3%) with ALNM and 153 (56.7%) without ALNM. All patients were randomly divided into training (n = 189) and test (n = 81) sets in a 7:3 ratio. The training set comprised 80 (29.6%) patients with ALNM and 109 (40.4%) patients without ALNM; in contrast, the test set included 39 (14.4%) patients with ALNM and 42 (15.6%) patients without ALNM. Patient characteristics and comparisons between and within groups are presented in Table 1.
3.2 Feature selection and machine learning model construction
Table 2 lists the number of features retained at each step of the feature selection process and the corresponding figure numbers for visualizations. The feature selection process using LASSO regression is shown in Figure 3, while the final selected features and their coefficients are listed in Supplementary Table 2.
Figure 3. (A, D, G) Coefficients of LASSO 10 fold cross validation. (B, E, H) MSE of LASSO 10 fold cross validation.The feature subset corresponding to the λ value with the minimum MSE was retained. (C, F, I) Weights of features in the final subset. (A–C) Radiomics features. (D–F) Deep learning features. (G–I) Combined features.
3.3 Model performance evaluation and visualization
The diagnostic performance is presented in Supplementary Table 3, and the ROC curves are shown in Figure 4, indicating that all models performed well in both the test and training cohorts. Among all machine learning algorithms, the AUCs of the combined models were improved compared to those of the radiomics or deep learning models alone. The classifers with the highest perfomance in the test set for the three feature subset models were as follows:
Figure 4. ROC for all models. (A, B) Radiomics models. (C, D) Deep learning models. (E, F) Combined models. (A, C, E) Training set. (B, D, F) Test set.
Accuracy: Radiomics-MLP (0.753), Deep Learning-MLP (0.691), Combined-LR (0.778);
AUC: Radiomics-LR (0.759), Deep Learning-MLP (0.712), and Combined-MLP (0.846);
F1 score: Radiomics-MLP (0.730), Deep Learning-MLP (0.684), Combined-MLP (0.782);
Sensitivity: Radiomics-XGBoost (0.861), Deep Learning-SVM (0.833), Combined-MLP (0.944);
Specificity: Radiomics-KNN (0.867), Deep Learning-LightGBM (0.889), Combined-KNN (0.800);
PPV: Radiomics-SVM (0.727), Deep Learning-KNN (0.800), Combined-KNN (0.700);
NPV: Radiomics-AdaBoost (0.811), Deep Learning-MLP (0.763), Combined-SVM (0.963).
The MLP model demonstrated superior overall performance across the three different feature subset models, as detailed below:
Radiomics-MLP: AUC = 0.756, Accuracy = 0.753, F1 Score = 0.730;
Deep Learning-MLP: AUC = 0.712, Accuracy = 0.691, F1 Score = 0.684;
Combined-MLP: AUC = 0.846, Accuracy = 0.765, F1 Score = 0.782.
Therefore, this algorithm was selected for further interpretation of the study results.
The ROC, DCA, and CAL of the MLP model are shown in Supplementary Figure 5, the waterfall plot in Supplementary Figure 6, and the DeLong test, NRI, and IDI plots in Supplementary Figure 7. The test set results indicate that the combined model showed a statistically significant difference over single-modality models according to the DeLong test (P=0.043 for Combined-MLP vs. Radiomics-MLP; P=0.038 for Combined-MLP vs. Deep Learning-MLP). Additionally, NRI and IDI metrics showed improvements (NRI=0.074 and IDI=0.126 for Combined-MLP vs. Radiomics-MLP; NRI=0.213 and IDI=0.147 for Combined-MLP vs. Deep Learning-MLP). The ROC curve of the MLP model demonstrated excellent classification performance, with the AUC of the combined model in the test set (AUC = 0.846) showing an improvement compared with the radiomics (AUC = 0.756) and deep learning models (AUC = 0.712).
4 Discussion
The ALN is the most common site of metastasis in patients with BC. Determining ALNM is crucial for clinical staging and prognosis assessment because it helps in planning surgical and adjuvant treatment strategies (13). For example, depending on the metastasis, patients may need to receive neoadjuvant chemotherapy to reduce the size of the tumors and metastatic lymph nodes, increasing the success rate of surgery and the possibility of breast-conserving surgery. It can also assist radiologists in precisely targeting the radiation therapy area, thereby improving treatment efficacy and minimizing damage to normal tissues (14, 15). Imaging assessments of ALNM rely heavily on the subjective experience of physicians. SLNB and ALND can improve patient survival. However, these two methods have drawbacks. SLNB shows a high negative rate of 70–80%, indicating unnecessary procedures (16). ALND can lead to complications such as arm numbness and upper limb lymphedema (17). Therefore, we aimed to predict ALNM preoperatively to guide clinicians in preoperative intervention, prognostic assessment, and formulation of scientifically individualized treatment plans for BC.
In recent years, the field of AI has experienced unprecedented growth, significantly affecting various domains, including healthcare. Among the numerous applications, AI has shown remarkable potential for the early detection of cancer, prediction of treatment response, and prognosis assessment through advanced techniques, such as radiomics and deep learning (9, 10). This new technology offers accurate and noninvasive methods for predicting the pathological indicators of BC mentioned above. MG is a commonly used screening method for BC, with a sensitivity of approximately 75%, and is capable of detecting microcalcifications. However, its sensitivity decreases in women with dense breast tissue. In two-dimensional imaging, the overlap of normal fibroglandular tissue can mimic malignant features, leading to a higher false positive rate (18). MRI is suitable for screening high-risk patients with BC, with a sensitivity as high as 92%, and provides more detailed anatomical information that helps assess the extent and invasiveness of tumors. However, calcifications may not produce strong signals, making it difficult to distinguish them from the surrounding tissues. Additionally, MRI is sensitive to patient movement and metal implants during imaging, which can lead to artifacts that affect the image quality (19). While radiologists are highly skilled in interpreting medical images, their assessments are inherently subjective and can be influenced by factors such as experience, fatigue, and cognitive bias. This subjectivity can sometimes lead to variability in diagnosis and assessment. In contrast, radiomics and deep learning models offer a more objective and standardized approach to image analysis (20). Numerous studies have demonstrated that radiomics or deep learning models based on MG or MRI perform well in predicting BC risk, ALNM, histological grading, and response to neoadjuvant therapy, as well as in differentiating between benign and malignant breast masses and molecular subtypes of BC (21–27). Recent studies by Yuchen and Ma et al. found that radiomics models constructed by combining MG and MRI outperformed those based on a single sequence (11, 12). Additionally, research by Manon and Xue et al. demonstrated that models combining radiomics with deep learning outperform models using either method alone (28, 29). Inspired by the study mentioned above, we combined MG with MRI, radiomics, and deep learning to construct models using various machine-learning algorithms.
ResNet, proposed by He et al. in 2015, addresses the common issues of vanishing and exploding gradients in deep neural network training (30). This network architecture introduces a residual learning framework that enables the training of considerably deeper networks (31, 32). We utilize ResNet-18, a shallower version in the ResNet family. It has fewer parameters and lower computational complexity but still provides efficient feature extraction capabilities. It is widely used in various computer vision tasks.
Previous studies have established radiomics models to predict ALNM by delineating ROIs in the ALN region (3, 33). Our study demonstrated that models predicting ALNM based on ROIs drawn around breast tumors exhibit strong performance. This suggests that the imaging features of the primary tumor not only reflect the internal structure of the tumor but may also reveal insights into the tumor’s biological behavior, aggressiveness, and interactions with lymph nodes.
In the final selected feature subset, deep learning features did not include those from the MG sequence because MG sequence features were excluded owing to their high correlation with features from other sequences. Several combined features overlapped with the selected radiomics and deep learning features. We also attempted other methods to construct combined models; however, their performances did not match the approach used in this study. The specific methods and possible reasons are as follows: when combining independently selected radiomics and deep learning features, the interactions between features were not considered, leading to the exclusion of weak features that could play an important role in the combined feature set, ultimately resulting in a combined feature set that fails to capture the complementary information between the two types of features. Using the predicted probability values from radiomics and deep learning models as new features or taking their mean while simplifying the model complexity may result in information loss. This is because probability values typically reflect a model’s overall confidence in a specific sample rather than fully representing the complex relationships among all potential features (34, 35).
When comparing baseline data between the ALNM and non-ALNM groups, we found significant statistical differences in features such as MG/MRI ALNM and MG/MRI BI-RADS. However, after further selection of these clinical features and combining them with radiomics or deep learning models, the diagnostic performance in the test set did not show significant improvement and even decreased. We speculate that this may be due to the high correlation between clinical features and radiomics or deep learning features, leading to information redundancy when combined, which did not provide additional useful information to the model but instead increased its complexity. Additionally, our relatively small sample size may not represent the characteristics of the entire patient population, leading to a higher risk of overfitting (36, 37).
The MLP is a type of feedforward artificial neural network, composed of multiple layers of perceptrons, capable of handling highly nonlinear and complex problems through multiple hidden layers and nonlinear activation functions. Rosenblatt first proposed the perceptron model in the 1960s, and MLP is a multilayer extension. In this study, the combined model built with the MLP algorithm (AUC=0.846, Accuracy=0.765) outperformed the radiomics model (AUC=0.756, Accuracy=0.753) and the deep learning model (AUC=0.712, Accuracy=0.691). It was also more accurate than subjective assessments by radiologists (Accuracy for MG=0.637, Accuracy for MRI=0.689) and exceeded the performance of the model constructed by Hua, Y. et al., which combined MG and MRI but extracted only radiomics features (AUC=0.793, Accuracy=0.750) (11). The DCA indicates that all MLP models have higher net benefits than traditional decision strategies. CAL showed a good fit between the model’s predicted probabilities and actual outcomes. The waterfall plot visually illustrates the changes in individual samples within the model predictions. Most samples exhibited significant classification effects, with changes in the predicted probabilities showing a clear gradient, further validating the strong predictive ability of the model at the individual level. The waterfall plot also highlights a few misclassified samples, indicating the areas in which the model requires improvement. The DeLong test showed a significant statistical difference, and both the NRI and IDI were not vanishing, indicating that the MLP model can effectively integrate radiomics and deep learning features (38).
This study has a few limitations:
1. It was a single-center study with a small sample size and lacked external validation from other institutions, which may introduce selection bias.
2. Only the second-phase features of DCE-MRI were extracted; future research could include other phases or ultrasound images.
5 Conclusion
Multimodal radiomics, deep learning, and combined models based on MG and MRI constructed using various machine learning algorithms demonstrated good performance in the preoperative prediction of ALNM. These models may provide valuable guidance for physicians in preoperative intervention, prognostic assessment, and the development of scientifically individualized treatment plans for BC.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Materials, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Ethics Committee of the Third Hospital of Shanxi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements. Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
FG: Conceptualization, Data curation, Project administration, Writing – original draft. SS: Conceptualization, Data curation, Project administration, Writing – original draft. XD: Conceptualization, Data curation, Project administration, Writing – original draft. YW: Conceptualization, Data curation, Project administration, Writing – original draft. WY: Conceptualization, Data curation, Project administration, Writing – original draft. PY: Conceptualization, Data curation, Project administration, Writing – original draft. SW: Conceptualization, Data curation, Project administration, Writing – original draft. JY: Conceptualization, Data curation, Project administration, Writing – original draft. XZ: Conceptualization, Data curation, Project administration, Writing – review & editing. YZ: Conceptualization, Data curation, Project administration, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Acknowledgments
We would like to thank Editage (www.editage.cn) for English language editing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2024.1482020/full#supplementary-material
References
1. Zhou J, Yu X, Wu Q, Wu Y, Fu C, Wang Y, et al. Radiomics analysis of intratumoral and different peritumoral regions from multiparametric MRI for evaluating HER2 status of breast cancer: A comparative study. Heliyon. (2024) 10:e28722. doi: 10.1016/j.heliyon.2024.e28722
2. Xie X, Zhou H, Ma M, Nie J, Gao W, Zhong J, et al. A deep learning model for predicting molecular subtype of breast cancer by fusing multiple sequences of DCE-MRI from two institutes. Acad Radiol. (2024). doi: 10.1016/j.acra.2024.03.002
3. Samiei S, Granzier RWY, Ibrahim A, Primakov S, Lobbes MBI, Beets-Tan RGH, et al. Dedicated axillary MRI-based radiomics analysis for the prediction of axillary lymph node metastasis in breast cancer. Cancers. (2021) 13:757. doi: 10.3390/cancers13040757
4. Qu L, Mei X, Yi Z, Zou Q, Zhou Q, Zhang D, et al. An unsupervised learning model based on CT radiomics features accurately predicts axillary lymph node metastasis in breast cancer patients—diagnostic study. Int J Surg. (2024). doi: 10.1097/JS9.0000000000001778
5. Zeidan AM, Stahl M, Sekeres MA, Steensma DP, Komrokji RS. A call for action: Increasing enrollment of untreated patients with higher-risk myelodysplastic syndromes in first-line clinical trials. Cancer. (2017) 123:3662–72. doi: 10.1002/cncr.v123.19
6. Peto R, Davies C, Godwin J, Gray R, Pan H C, Clarke M, et al. Comparisons between different polychemotherapy regimens for early breast cancer: meta-analyses of long-term outcome among 100 000 women in 123 randomised trials. Lancet (British edition). (2012) 379:432–44.
7. Li H, Hou Y, Xue L, Fan W, Gao B, Yin X. Use of MRI radiomics models in evaluating the low HER2 expression in breast cancer. Curr Med Imaging Formerly Curr Med Imaging Rev. (2024) 20. doi: 10.2174/0115734056234429240426181002
8. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036
9. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169
10. Carriero A, Groenhoff L, Vologina E, Basile P, Albera M. Deep learning in breast cancer imaging: state of the art and recent advancements in early 2024. Diagnostics (Basel). (2024) 14. doi: 10.3390/diagnostics14080848
11. Hua Y, Peng Q, Han J, Fei J, Sun A. A two-center study of a combined nomogram based on mammography and MRI to predict ALN metastasis in breast cancer. Magnetic Resonance Imaging. (2024) 110:128–37. doi: 10.1016/j.mri.2024.04.019
12. Ma X, Zhang L, Xiao Q, Huang Y, Lin L, Peng W, et al. Predicting prognosis of phyllodes tumors using a mammography- and magnetic resonance imaging-based radiomics model: A preliminary study. Clin Breast Cancer. (2024). doi: 10.1016/j.clbc.2024.05.006
13. Zhou LQ, Wu XL, Huang SY, Wu GG, Ye HR, Wei Q, et al. Lymph node metastasis prediction from primary breast cancer US images using deep learning. Radiology. (2020) 294:19–28. doi: 10.1148/radiol.2019190372
14. Donker M, van Tienhoven G, Straver ME, Meijnen P, van de Velde CJH, Mansel RE, et al. Radiotherapy or surgery of the axilla after a positive sentinel node in breast cancer (EORTC 10981-22023 AMAROS): a randomised, multicentre, open-label, phase 3 non-inferiority trial. Lancet Oncol. (2014) 15:1303–10. doi: 10.1016/S1470-2045(14)70460-7
15. Garcia-Tejedor A, Ortega-Exposito C, Salinas S, Luzardo-González A, Falo C, Martinez-Pérez E, et al. Axillary lymph node dissection versus radiotherapy in breast cancer with positive sentinel nodes after neoadjuvant therapy (ADARNAT trial). Front Oncol. (2023) 13:1184021. doi: 10.3389/fonc.2023.1184021
16. Boughey JC, Moriarty JP, Degnim AC, Gregg MS, Egginton JS, et al. Cost modeling of preoperative axillary ultrasound and fine-needle aspiration to guide surgery for invasive breast cancer. Ann Surg Oncol. (2010) 17:953–8. doi: 10.1245/s10434-010-0919-1
17. Langer I, Guller U, Berclaz G, Koechli OR, Schaer G, Fehr MK, et al. Morbidity of sentinel lymph node biopsy (SLN) alone versus SLN and completion axillary lymph node dissection after breast cancer surgery. Ann Surg. (2007) 245:452–61. doi: 10.1097/01.sla.0000245472.47748.ec
18. Iranmakani S, Mortezazadeh T, Sajadian F, Ghaziani M F, Ghafari A, Khezerloo D, et al. A review of various modalities in breast imaging: technical aspects and clinical outcomes. Egyptian J Radiol Nucl Med. (2020) 51:57–22. doi: 10.1186/s43055-020-00175-5
19. Zhang T, Tan T, Samperna R, Li Z, Gao Y, Wang X, et al. Radiomics and artificial intelligence in breast imaging: a survey. Artif Intell Rev. (2023) 56:S857–92. doi: 10.1007/s10462-023-10543-y
20. Pesapane F, Trentin C, Ferrari F, Signorelli G, Tantrige P, Montesano M, et al. Deep learning performance for detection and classification of microcalcifications on mammography. Eur Radiol Exp. (2023) 7:69. doi: 10.1186/s41747-023-00384-3
21. Kontos D, Winham SJ, Oustimov A, Pantalone L, Hsieh M, Gastounioti A, et al. Radiomic phenotypes of mammographic parenchymal complexity: toward augmenting breast density in breast cancer risk assessment. Radiology. (2019) 290:41–9. doi: 10.1148/radiol.2018180179
22. Fusco R, Piccirillo A, Sansone M, Granata V, Rubulotta MR, Petrosino T, et al. Radiomics and artificial intelligence analysis with textural metrics extracted by contrast-enhanced mammography in the breast lesions classification. Diagnostics (Basel). (2021) 11. doi: 10.3390/diagnostics11050815
23. Wang S, Sun Y, Li R, Mao N, Li Q, Jiang T, et al. Diagnostic performance of perilesional radiomics analysis of contrast-enhanced mammography for the differentiation of benign and Malignant breast lesions. Eur Radiol. (2022) 32:639–49. doi: 10.1007/s00330-021-08134-y
24. Ma W, Zhao Y, Ji Y, Guo X, Jian X, Liu P, et al. Breast cancer molecular subtype prediction by mammographic radiomic features. Acad Radiol. (2019) 26:196–201. doi: 10.1016/j.acra.2018.01.023
25. Lee JY, Lee KS, Seo BK, Cho KR, Woo OH, Song SE, et al. Radiomic machine learning for predicting prognostic biomarkers and molecular subtypes of breast cancer using tumor heterogeneity and angiogenesis properties on MRI. Eur Radiol. (2022) 32:650–60. doi: 10.1007/s00330-021-08146-8
26. Yu Y, He Z, Ouyang J, Tan Y, Chen Y, Gu Y, et al. Magnetic resonance imaging radiomics predicts preoperative axillary lymph node metastasis to support surgical decisions and is associated with tumor microenvironment in invasive breast cancer: A machine learning, multicenter study. eBioMedicine. (2021) 69:103460. doi: 10.1016/j.ebiom.2021.103460
27. Adam R, Dell'Aquila K, Hodges L, Maldjian T, Duong TQ. Deep learning applications to breast cancer detection by magnetic resonance imaging: a literature review. Breast Cancer Res. (2023) 25:87. doi: 10.1186/s13058-023-01687-4
28. Li X, Yang L, Jiao X. Comparison of traditional radiomics, deep learning radiomics and fusion methods for axillary lymph node metastasis prediction in breast cancer. Acad Radiol. (2023) 30:1281–7. doi: 10.1016/j.acra.2022.10.015
29. Beuque MPL, Lobbes MBI, van Wijk Y, Widaatalla Y, Primakov S, Majer M, et al. Combining deep learning and handcrafted radiomics for classification of suspicious lesions on contrast-enhanced mammograms. Radiology. (2023) 307:e221843. doi: 10.1148/radiol.221843
30. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, Piscataway, NJ (2016).
31. Szegedy C, Ioffe S, Vanhoucke V, Alemi A. Inception-v4, inception-ResNet and the impact of residual connections on learning. (2016). doi: 10.1609/aaai.v31i1.11231
32. Huang G, Liu Z, Laurens VDM, Weinberger KQ. Densely connected convolutional networks. IEEE Comput Soc. (2016). doi: 10.1109/CVPR35066.2017
33. Yu Y, Tan Y, Xie C, Hu Q, Ouyang J, Chen Y. Development and validation of a preoperative magnetic resonance imaging radiomics–based signature to predict axillary lymph node metastasis and disease-free survival in patients with early-stage breast cancer. JAMA Network Open. (2020) 3:e2028086. doi: 10.1001/jamanetworkopen.2020.28086
34. Zwanenburg A, Abdalah MA, Apte A, Ashrafinia S, Beukinga J, Bogowicz M, et al. PO-0981: results from the image biomarker standardisation initiative. Radiotherapy Oncol. (2018) 127:S543–4. doi: 10.1016/S0167-8140(18)31291-X
35. Wang Y, Ma X, Zhang Y, Wang M, Yatabe Y, Lam S, et al. ITPKA gene body methylation regulates gene expression and serves as an early diagnostic marker in lung and other cancers. J Thorac Oncol. (2016) 11:1469–81. doi: 10.1016/j.jtho.2016.05.010
36. Aerts HJWL, Velazquez ER, Leijenaar RTH, Parmar C, Lambin P. Corrigendum: Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5. doi: 10.1038/ncomms5644
37. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. (2015) 350:g7594. doi: 10.1161/CIRCULATIONAHA.114.014508
Keywords: mammography, magnetic resonance imaging, machine learning, radiomics, deep learning, breast cancer, axillary lymph node metastasis
Citation: Guo F, Sun S, Deng X, Wang Y, Yao W, Yue P, Wu S, Yan J, Zhang X and Zhang Y (2024) Predicting axillary lymph node metastasis in breast cancer using a multimodal radiomics and deep learning model. Front. Immunol. 15:1482020. doi: 10.3389/fimmu.2024.1482020
Received: 17 August 2024; Accepted: 27 November 2024;
Published: 13 December 2024.
Edited by:
Issam El Naqa, University of Michigan, United StatesReviewed by:
Omar Hamdy, Mansoura University, EgyptDomenico Pomarico, University of Bari Aldo Moro, Italy
Copyright © 2024 Guo, Sun, Deng, Wang, Yao, Yue, Wu, Yan, Zhang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yangang Zhang, dXJvenlnQDE2My5jb20=; Xiaojun Zhang, a3l6eV8xMTA2QDE2My5jb20=
†These authors have contributed equally to this work