Deep learning model for the early prediction of pathologic response following neoadjuvant chemotherapy in breast cancer patients using dynamic contrast-enhanced MRI

Lv, Meng; Zhao, BinXin; Mao, Yan; Wang, Yongmei; Su, Xiaohui; Zhang, Zaixian; Wu, Jie; Gao, Xueqiang; Wang, Qi

doi:10.3389/fonc.2025.1491843

ORIGINAL RESEARCH article

Front. Oncol., 25 February 2025

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1491843

Deep learning model for the early prediction of pathologic response following neoadjuvant chemotherapy in breast cancer patients using dynamic contrast-enhanced MRI

Meng Lv¹

BinXin Zhao²

Yan Mao¹

Yongmei Wang¹

Xiaohui Su³

Zaixian Zhang³

Jie Wu⁴

Xueqiang Gao¹

Qi Wang^2*

¹Breast Disease Diagnosis and Treatment Center, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
²Department of Radiation Oncology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
³Department of Radiology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
⁴Department of Pathology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China

Purpose: This study aims to investigate the diagnostic accuracy of various deep learning methods on DCE-MRI, in order to provide a simple and accessible tool for predicting pathologic response of NAC in breast cancer patients.

Methods: In this study, we enrolled 313 breast cancer patients who had complete DCE-MRI data and underwent NAC followed by breast surgery. According to Miller-Payne criteria, the efficacy of NAC was categorized into two groups: the patients achieved grade 1-3 of Miller-Payne criteria were classified as the non-responders, while patients achieved grade 4-5 of Miller-Payne criteria were classified as responders. Multiple deep learning frameworks, including ViT, VGG16, ShuffleNet_v2, ResNet18, MobileNet_v2, MnasNet-0.5, GoogleNet, DenseNet121, and AlexNet, were used for transfer learning of the classification model. The deep learning features were obtained from the final fully connected layer of the deep learning models, with 256 features extracted based on DCE-MRI data for each patient of each deep learning model. Various machine-learning techniques, including support vector machine (SVM), K-nearest neighbor (KNN), RandomForest, ExtraTrees, XGBoost, LightGBM, and multiple-layer perceptron (MLP), were employed to construct classification models.

Results: We utilized various deep learning models to extract features and subsequently constructed machine learning models. Based on the performance of different machine learning models’ AUC values, we selected the classifiers with the best performance. ResNet18 exhibited superior performance, with an AUC of 0.87 (95% CI: 0.82 - 0.91) and 0.87 (95% CI: 0.78 - 0.96) in the train and test cohorts, respectively.

Conclusions: Using pre-treatment DCE-MRI images, our study trained multiple deep models and developed the best-performing DLR model for predicting pathologic response of NAC in breast cancer patients. This prognostic tool provides a dependable and impartial basis for effectively identifying breast cancer patients who are most likely to benefit from NAC before its initiation. At the same time, it can also identify those patients who are insensitive to NAC, allowing them to proceed directly to surgical treatment and prevent the risk of losing the opportunity for surgery due to disease progression after NAC.

Introduction

Breast cancer has become the most common prevalent malignancy worldwide and the first leading cause of cancer death in women (1). Neoadjuvant chemotherapy (NAC) is recommended as the standard treatment for both locally advanced and early invasive breast cancer patients with an intent to perform breast-conserving surgeries (2). The evaluation of NAC also provided prognosis prediction and in vivo drug susceptibility test. Research has indicated that a significant proportion of patients may experience beneficial effects from NAC, potentially achieving a complete pathologic response (pCR). Nevertheless, a subset of 10-35% of breast cancer cases have been identified as unresponsive to NAC, with approximately 5% of patients exhibiting tumor growth following treatment (3). In such cases, NAC has been shown to be ineffective and may even delay surgical intervention. Therefore, early prediction of response to NAC is critical for optimizing and adjusting therapeutic strategies, which may mitigate toxicity without impacting efficacy. The Miller-Payne grading criteria serves as a suitable pathological assessment method, utilizing tumor cell density and morphology to classify residual tumors as Grade 1-5. Grade 4-5 tumors are characterized by no evidence of residual tumor or microscopic foci of invasive carcinoma, and are indicative of chemotherapy-sensitive breast cancers with a optimistic long-term prognosis.

Dynamic contrast enhancement MRI (DCE-MRI) is the most common and effective imaging test for clinical breast MRI examinations. It has shown superiority of identifying small breast cancer lesions, and evaluating blood perfusion and distribution of tumor vessels. Therefore, DCE-MRI is recommended to evaluate the efficacy of NAC in breast cancer patients following an early treatment period (4). Previous studies investigated the role of quantitative DCE-MRI parameters in the therapeutic evaluation of NAC. A retrospective study enrolled 37 breast cancer patients and found changes in DCE-MRI kinetic parameters were correlated with pathologic response after NAC (5). Li and colleagues (6) discovered the signal enhancement ratio washout volume and K_ep might prognosticate pathologic response in breast cancer patients. The diagnostic efficacy of quantitative parameters ranged from 0.73 to 0.78. The efficacy of DCE-MRI in prediction of pathologic response was limited and relied on dynamic changes of radiologic parameters. Importantly, the reliable volumetric and kinetic parameters in the prediction of therapeutic efficacy cannot require prior to NAC treatment (7).

Radiomics, an emerging field in cancer treatment, involves the automated analysis of quantitative data extracted from medical images to correlate with malignant biological properties, therapeutic efficacy, and clinical prognosis. This approach offers the potential for individualized precision therapy in a non-invasive manner, allowing for the characterization of tumor properties solely through imaging data rather than invasive sampling procedures. Advancements in deep learning radiomics (DLR) and data processing tools have facilitated the interpretation and utilization of data in clinical settings. Unlike traditional radiomics methods, deep learning-based radiomics techniques exploit the inherent non-linearity of deep neural networks to extract relevant features automatically without manual feature extraction. On the other hand, deep learning has the capability to leverage comprehensive feature data, particularly with respect to the spatial arrangement of pixels, in order to extract information pertaining to the textures and shapes. Consequently, even when employing basic digital images, deep learning is anticipated to excel in the precise and detailed identification. Verma (8) et al. investigated a multimodal spatiotemporal DLR to predict pCR of NAC among breast cancer patients. The AUC of 3D-VGGNet and 3D-ResNet signatures were 0.68, and 0.50, respectively. Due to the limited prognostic efficacy, many studies focused on the fusion of different DLR models with multimodal images, which complicated the development of predictive signature. In a retrospective study, 536 breast cancer patients were enrolled to provide a DLR signature for predicting pCR to NAC (9). The fusion of different DLR signatures with multiple MR images yielded an AUC of 0.745. Although DLR has been proposed for predicting pathologic response following NAC, these studies have been hindered by small sample sizes and limited predictive accuracy. Meanwhile, most of these studies focused on the prediction of pCR, instead of pathologic response, following NAC in breast cancer patients. Hence, this study aims to investigate the diagnostic accuracy of various deep learning methods on DCE-MRI, in order to provide a simple and accessible tool for predicting pathologic response of NAC in breast cancer patients.

Materials and methods

Patients

A total of 313 newly diagnosed breast cancer patients treated at the Affiliated Hospital of Qingdao University between 2016 and 2020 were included in this retrospective study, for which informed consent was waived. The study was approved by the ethics committee of the Affiliated Hospital of Qingdao University and adhered to the principles outlined in the Declaration of Helsinki. The study established specific inclusion criteria, including (1) primary invasive breast cancer confirmed by histology; (2) complete medical records; (3) qualified dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) images before neoadjuvant chemotherapy (NAC); (4) receipt of preoperative systemic chemotherapy; (5) adherence to NAC protocols based on either the National Comprehensive Cancer Network or Chinese Society of Clinical Oncology guidelines; (6) confirmation of surgical outcomes through pathologic examination of Miller-Payne grading criteria. Concurrently, the exclusion criteria included (1) advanced cancer patients with distant metastases; (2) a prior history of other malignancy, incomplete neoadjuvant chemotherapy (NAC) treatment prior to surgery; (3) incomplete essential clinical data (molecular subtype).

Pathological evaluation

The patients who underwent surgery following NAC were assessed using the Miller-Payne criteria. The efficacy of NAC was categorized into five levels: G1 denoting some changes in cancer cells without a decrease in total numbers, G2 indicating a reduction rate of <30% with high total numbers, G3 representing a moderate decrease of ≥30% but <90% in cancer cells, G4 showing a significant reduction of ≥90% with only scattered cell clusters remaining, and G5 indicating the absence of cancer cells at the original tumor site. The patients achieved G1, G2, and G3 were classified as the non-responders, while patients achieved G4 and G5 were classified as responders.

Magnetic resonance acquisition protocol

Pre-treatment dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) was carried out for each patient prior to biopsy, within a timeframe of 1-2 weeks before NAC. The MRI scan was performed using a 3.0 T scanner equipped with either an 8-channel or 16-channel breast coil (Signa HDxt, GE Healthcare), with patients positioned in a prone manner. The DCE-MRI protocol included one pre-contrast and eight post-contrast T1-weighted images with fat saturation. Following the intravenous administration of gadolinium-DTPA contrast agent (0.2ml/kg), a subsequent flush of 20 ml of saline solution was administered at a flow rate of approximately 2 ml/s. The initial post-contrast images were acquired 60 seconds after the start of the gadolinium-DTPA injection, followed by seven additional scans. The configurations used to obtain MR images were described previously (10).

Tumor segmentation and deep learning features extraction

The identification and delineation of regions of interest (ROI) were conducted manually on individual slices of DCE-MRI, focusing on the peak enhanced phase determined by the time-intensity curve, utilizing the itk-SNAP software (www.itksnap.org). This task was executed by two radiologists possessing five years of experience each. In the peak enhanced phase of the time-intensity curve, the breast carcinoma exhibited significant enhancement, whereas the surrounding stroma displayed slight enhancement. Subsequent to the completion of tumor masking contouring by the junior radiologist, the senior radiologist boasting 10 years of experience reviewed the ROI for accuracy and implemented any necessary modifications.

Multiple deep learning frameworks, including Vision Transformer (ViT), VGG16, ShuffleNet_v2, ResNet18, MobileNet_v2, MnasNet-0.5, GoogleNet, DenseNet121, and AlexNet, were used for transfer learning of the classification model. In deep learning analysis, a ROI images measuring 448 × 448 pixels was utilized to crop the largest cross-section of breast tumor as input. In order to standardize image signal intensity across patients, image intensity was normalized to a consistent range of 0–1000. The detailed description of the model architectures used in our study was shown in Supplementary Table S1. The deep learning process involved the development of independent inputs for each image. Following the completion of training for the deep learning model, features were downscaled from the final fully connected layer to 256 and use them as input for the machine learning model. In the test cohort, ROI images were inputted into the trained deep learning model. The deep learning features from the fully connected layer were also extracted for further analysis.

Deep learning models construction and validation

The dataset was randomly partitioned into train and test cohorts at an 8:2 ratio. The train cohort was employed for the development of deep learning models utilizing the extracted deep learning features. The radiomic features underwent an initial screening using the Mann-Whitney U test with a significance level set at P < 0.05. Following this, the Pearson correlation coefficient was utilized to assess the correlation between each pair of radiomic features, with features exhibiting a correlation coefficient |r| greater than 0.9 being removed. Feature selection in the train cohort was conducted using the least absolute shrinkage and selection operator (LASSO) method. The settings for the Lasso model are as follows: alpha = 1, and the maximum number of iterations for the optimization algorithm is set to 1000. Various machine-learning techniques, including support vector machine (SVM), K-nearest neighbor (KNN), RandomForest, ExtraTrees, XGBoost, LightGBM, and multiple-layer perceptron (MLP), were employed to construct classification models. A 5-fold cross-validation was performed using the StratifiedKFold function from scikit-learn, which divided the train cohort into five non-overlapping subsets. In each iteration, one partition was used as the test set, while the remaining partitions served as the train set. This approach ensures that each class is represented proportionally across both the training and testing folds, helping to determine the optimal model hyperparameters. The performance of these models was assessed through ROC analysis, as well as the calculation of sensitivity, specificity, positive predictive value(PPV) and negative predictive value(NPV). A calibration curve was utilized to plot prediction probabilities against measured rates. The evaluation of model adequacy was carried out using the Hosmer-Lemeshow test.

Statistical analyses

Statistical analyses were performed using R Studio (version:2023.12.1) and Python 3.12.2, with the Fisher’s test, χ² test, or Mann-Whitney U test utilized to assess the association between the effectiveness of NAC and clinical variables. “pROC”, “rms”, “rmda”, and “generalhoslem” were used to generate the ROC curve, calibration curve, and Hosmer-Lemeshow test. We used a two-tailed P value of 0.05 for the statistical analysis.

Results

Study population characteristics

This study enrolled 313 patients with breast cancer from 2016 to 2021. The flowchart of the screening process is summarized in Figure 1. We randomly divided 313 breast patients into train and test sets at an 8:2 ratio. Based on pathological analysis of the surgical specimens, the Miller-Payne grading results were as follows: 16, 67, 86, 54, and 90 patients achieved G1, G2, G3, G4, and G5, respectively. 144 patients were classified as responders, and 169 patients were classified as non-responders.

Figure 1

Figure 1. Flow chart of patient enrollment.

Table 1 lists the clinical characteristics of all patients. The ER status, PR status, Her-2 status, Ki-67 index, and clinical T stage showed a significant association with pathologic response after NAC in breast cancer patients. There was no significant difference between responders and non-responders in terms of age, menopausal status, and clinical N stage. Meanwhile, no statistically significant disparities were found in clinical parameters, including age, menopausal status, ER status, PR status, Her-2 status, Ki-67 index, and clinical T/N stages, between the train and test cohorts (shown in Supplementary Table S2).

Table 1

Table 1. The clinical characteristics between Non-responders and responders.

Deep learning features extraction and selection

The flowchart of building the DLR signatures is summarized in Figure 2. Multiple deep learning frameworks, including ViT, VGG16, ShuffleNet_v2, ResNet18, MobileNet_v2, MnasNet-0.5, GoogleNet, DenseNet121, and AlexNet, were used for transfer learning of the classification model. The deep learning features were obtained from the final fully connected layer of the deep learning models, with 256 features extracted based on DCE-MRI data for each patient of each deep learning model. The Pearson correlation coefficient analysis and subsequent LASSO regression analysis were conducted to eliminate redundant and irrelevant features. As an example, 10 features and 11 features were chosen to construct classification model in ViT model and VGG16 model, respectively. The screened features were used for subsequent construction of classification model. The detailed selection features of deep learning models were shown in Supplementary Table S3.

Figure 2

Figure 2. The flowchart of building deep learning radiomic models.

Deep learning models construction and validation

We analyzed the performance of SVM, KNN, RandomForest, ExtraTress, XGBoost, LightGBM, and MLP to construct classification models for predicting pathologic response following NAC in breast cancer patients. The detailed results of the models are shown in Table 2.

Table 2

Table 2. The detailed results of different classifiers among various deep models for predicting pathologic response following NAC in breast cancer patients.

Taking the ViT deep learning model as an example, in the train cohort, the AUC for SVM, KNN, RandomForest, ExtraTress, XGBoost, LightGBM, and MLP were recorded at 0.90, 0.77, 1.00, 1.00, 0.99, 0.93, and 0.80, respectively. Within the test cohort, these values were observed as 0.73 for SVM, 0.63 for KNN, 0.74 for RandomForest, 0.59 for ExtraTrees, 0.72 for XGBoost, 0.74 for LightGBM, and 0.78 for MLP, respectively. The MLP classification model exhibited good performance with an AUC of 0.80 (95% CI, 0.74 - 0.85) and 0.78 (95% CI, 0.67 - 0.89) in train and test groups, respectively. The Delong’s test was utilized to access the disparities in predictive performance between MLP and other alternative models in the test cohort. The predictive capacity of MLP model is better than that of the KNN (p < 0.01) and ExtraTrees (p =0.02) models; however, it exhibits no statistically significant differences when compared with other models.

Sequentially, we utilized various deep learning models to extract features and subsequently constructed machine learning models. Based on the performance of different machine learning models, we selected the classifiers with the best performance. The specific results of the best-performing classifiers among various deep models are presented in Table 3. The comparative performance of diverse deep learning models exhibits substantial equivalence, although ResNet18 and AlexNet demonstrates marginally superior outcomes. The ROC curves of different deep learning models are shown in detail in Figures 3 and 4. In the training set, the sensitivity, specificity, PPV, and NPV of the ResNet18 model are 0.77, 0.81, 0.77, and 0.80, respectively. In the test set, the sensitivity, specificity, PPV, and NPV of the ResNet18 model are 0.83, 0.74, 0.73, and 0.83, respectively. In the internal validation set, the DeLong test revealed that the predictive performance of ResNet18 was significantly superior to MobileNet_v2 (p= 0.04), MnasNet-0.5 (p=0.04), and DenseNet121 (p=0.04), with statistical significance. The calibration curves for ResNet18 consistently showed agreement both in the train set (illustrated in Figure 5A) and the test set (illustrated in Figure 5B).

Table 3

Table 3. The best-performing classifiers among various deep models.

Figure 3

Figure 3. The ROC curves of different deep learning models for predicting pathological response of breast cancer patients after NAC in train cohort. (A) ROC curve for ViT model; (B) ROC curve for VGG16 model; (C) ROC curve for ShuffleNet_v2 model; (D) ROC curve for ResNet18 model; (E) ROC curve for MobileNet_v2 model; (F) ROC curve for MnasNet-0.5 model; (G) ROC curve for GoogleNet model; (H) ROC curve for DenseNet121 model; (I) ROC curve for AlexNet model.

Figure 4

Figure 4. The ROC curves of different deep learning models for predicting pathological response of breast cancer patients after NAC in test cohort. (A) ROC curve for ViT model; (B) ROC curve for VGG16 model; (C) ROC curve for ShuffleNet_v2 model; (D) ROC curve for ResNet18 model; (E) ROC curve for MobileNet_v2 model; (F) ROC curve for MnasNet-0.5 model; (G) ROC curve for GoogleNet model; (H) ROC curve for DenseNet121 model; (I) ROC curve for AlexNet model.

Figure 5

Figure 5. The calibration curves of radiomic signature based on different classification models. (A) Calibration curve of ResNet18 model in the train set; (B) Calibration curve of ResNet18 model in the test set.

Discussion

NAC stands as the established therapeutic approach for both locally advanced and early invasive breast cancer patients who aimed at facilitating breast-conserving surgeries. The identification of a reliable method for predicting sensitivity to NAC before surgical intervention holds significant importance in treatment planning. This assessment profoundly influences the choice between initiating NAC followed by surgery or proceeding directly to surgery without prior NAC administration. Radiomics has emerged as a burgeoning domain within cancer treatment. Typically, quantitative data extracted from images is automatically analyzed to correlate with malignant biological properties, therapeutic efficacy, and clinical prognosis. This approach offers a promising avenue for delivering tailored precision therapy in a non-invasive manner. With advancements in deep learning radiomics and associated data processing tools, the interpretation and utilization of data in clinical contexts have become more accessible.

In this study, we enrolled 313 breast cancer patients who had complete DCE-MRI data and underwent NAC followed by breast surgery. Various deep learning frameworks, such as ViT, VGG16, ShuffleNet_v2, ResNet18, MobileNet_v2, MnasNet-0.5, GoogleNet, DenseNet121, and AlexNet, were utilized for transfer learning to develop the classification model. Deep learning features were extracted from the fully connected layer and used to construct classification models. ResNet18 exhibited superior performance, with an AUC of 0.87 (95% CI: 0.82 - 0.91) and 0.87 (95% CI: 0.78 - 0.96) in the train and test cohorts, respectively.

In the realm of medical image analysis, deep learning-driven radiomic features have demonstrated superior performance. Li (11) et al. recruited 95 breast cancer patients to construct a DLR model that integrates pre-treatment and early-treatment DCE-MRI data for predicting pCR to NAC. The AUC of DLR was 0.64 for pre-treatment, 0.88 for early-treatment, and 0.90 for combined data. In a multicenter retrospective study, 1262 patients were included in order to develop a novel tool for predicting pCR of breast cancer to NAC (12). The stacking model, which integrates pre-, post-, and delta-models based on traditional radiomic features and DLR features, achieved AUC values of 0.89, 0.92, and 0.89 in the external validation cohorts, respectively. Traditional methods are simple, conventional, and not black-box models. However, models based solely on traditional radiomic features do not show ideal predictive performance, while models based on multi-omics, multi-temporal data, traditional radiomic features, and deep learning features demonstrate better predictive power, albeit with a more complex process.

We conducted experiments to explore the base DLR model’s performance without machine learning step. The results are included in the Supplementary Table S4, where we provide a detailed analysis of the performance metrics for each approach. The findings indicate that while the base DLR models showed suboptimal performance, we further conducted a two-stage system with deep learning (for feature generation), and machine learning (for feature transformation followed by classifiers). The integration of machine learning with feature transformation significantly enhances the overall performance, justifying the need for the proposed approach.

Our study has the following advantages. At first, it focused on identifying patients with Miller-Payne grade 4-5 who responded better to NAC, rather than solely predicting pCR status. This approach allowed us to select patients who may benefit from NAC. Additionally, it enabled patients who were insensitive to NAC to proceed directly to surgical therapy, avoiding excessive therapy and potentially losing the opportunity for surgery due to disease progression after NAC. Simultaneously, pCR indicated complete pathological remission of both the metastatic axillary lymph nodes and the primary breast tumor. Predicting pCR through radiomics requires delineation and feature extraction of ROI separately for the metastatic lymph nodes and the primary breast tumor, which undoubtedly increased the complexity of the radiomics model, affecting its reproducibility and practical application. Finally, previous studies have mostly focused on building models for predicting NAC response in breast cancer patients using MRI parameters and traditional radiomics. In contrast, we explored multiple deep learning models for predicting NAC response and found ResNet18 demonstrated excellent performance, achieving an AUC of 0.87 and 0.87 in the train and test cohorts, respectively. Despite the strengths of our study, there is a lot of room for enhancement. Initially, a singular imaging protocol is utilized for pre-treatment NAC. Although DCE-MRI of the breast stood out as the most distinctive, the multiparametric MRI, including T2WI and DWI, might provide more comprehensive and unbiased data. Furthermore, this study was limited by its retrospective and single-center nature. A prospective, multicenter investigation could help in creating a universal prognostic model applicable to various clinical scenarios.

Using pre-treatment DCE-MRI images, our study trained multiple deep models and developed the best-performing DLR model for predicting pathologic response of NAC in breast cancer patients. The model demonstrated excellent performance in both the train and test cohorts. As a result, this prognostic tool provides a dependable and impartial basis for effectively identifying breast cancer patients who are most likely to benefit from NAC before its initiation. At the same time, it can also identify those patients who are insensitive to NAC, allowing them to proceed directly to surgical treatment and prevent the risk of losing the opportunity for surgery due to disease progression after NAC.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The Ethics Committee of Affiliated Hospital of Qingdao University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this is a retrospective study, informed consent was waived.

Author contributions

ML: Conceptualization, Data curation, Writing – original draft. BZ: Writing – original draft, Methodology. YM: Data curation, Methodology, Resources, Writing – original draft. YW: Formal Analysis, Investigation, Software, Writing – original draft. XS: Data curation, Investigation, Methodology, Writing – original draft. ZZ: Investigation, Resources, Validation, Writing – review & editing. JW: Data curation, Resources, Writing – review & editing. XG: Investigation, Methodology, Writing – original draft. QW: Conceptualization, Formal Analysis, Funding acquisition, Project administration, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Qingdao Postdoctoral Sustentation Fund (RZ2100001380) and National Natural Science Foundation of China (82003224). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Acknowledgments

We thank Home for Researchers editorial team (www.home-for-researchers.com) for language editing service.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1491843/full#supplementary-material

Supplementary Table 3 | The detailed selection features of deep learning models.

Abbreviations

AUC, area under the curve; DCE-MRI, dynamic contrast-enhanced magnetic resonance imaging; DLR, deep learning radiomic; DWI, diffusion-weighted imaging; ER, Estrogen receptor; KNN, K-nearest neighbor; LASSO, least absolute shrinkage and selection operator; MLP, multiple-layer perceptron; NAC, neoadjuvant chemotherapy; pCR, complete pathological response; PR, Progesterone receptor; RECIST, Response Evaluation Criteria in Solid Tumors; ROI, regions of interest; SVM, support vector machine; T2WI, T2-weighted imaging; ViT, Vision Transformer.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | Crossref Full Text | Google Scholar

2. Navarro-Cecilia J, Duenas-Rodriguez B, Luque-Lopez C, Ramirez-Exposito MJ, Martinez-Ferrol J, Ruiz-Mateas A, et al. Intraoperative sentinel node biopsy by one-step nucleic acid amplification (OSNA) avoids axillary lymphadenectomy in women with breast cancer treated with neoadjuvant chemotherapy. Eur J Surg Oncol. (2013) 39:873–9. doi: 10.1016/j.ejso.2013.05.002

PubMed Abstract | Crossref Full Text | Google Scholar

3. Rastogi P, Anderson SJ, Bear HD, Geyer CE, Kahlenberg MS, Robidoux A, et al. Preoperative chemotherapy: updates of National Surgical Adjuvant Breast and Bowel Project Protocols B-18 and B-27. J Clin Oncol. (2008) 26:778–85. doi: 10.1200/JCO.2007.15.0235

PubMed Abstract | Crossref Full Text | Google Scholar

4. Mann RM, Balleyguier C, Baltzer PA, Bick U, Colin C, Cornford E, et al. Breast MRI: EUSOBI recommendations for women's information. Eur Radiol. (2015) 25:3669–78. doi: 10.1007/s00330-015-3807-z

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ah-See ML, Makris A, Taylor NJ, Harrison M, Richman PI, Burcombe RJ, et al. Early changes in functional dynamic magnetic resonance imaging predict for pathologic response to neoadjuvant chemotherapy in primary breast cancer. Clin Cancer Res. (2008) 14:6580–9. doi: 10.1158/1078-0432.CCR-07-4310

PubMed Abstract | Crossref Full Text | Google Scholar

6. Li X, Arlinghaus LR, Ayers GD, Chakravarthy AB, Abramson RG, Abramson VG, et al. DCE-MRI analysis methods for predicting the response of breast cancer to neoadjuvant chemotherapy: pilot study findings. Magnetic Resonance Med. (2014) 71:1592–602. doi: 10.1002/mrm.24782

PubMed Abstract | Crossref Full Text | Google Scholar

7. Ogston KN, Miller ID, Payne S, Hutcheon AW, Sarkar TK, Smith I, et al. A new histological grading system to assess response of breast cancers to primary chemotherapy: prognostic significance and survival. Breast. (2003) 12:320–7. doi: 10.1016/S0960-9776(03)00106-1

PubMed Abstract | Crossref Full Text | Google Scholar

8. Verma M, Abdelrahman L, Collado-Mesa F, Abdel-Mottaleb M. Multimodal spatiotemporal deep learning framework to predict response of breast cancer to neoadjuvant systemic therapy. Diagnostics (Basel). (2023) 13. doi: 10.3390/diagnostics13132251

PubMed Abstract | Crossref Full Text | Google Scholar

9. Joo S, Ko ES, Kwon S, Jeon E, Jung H, Kim JY, et al. Multimodal deep learning models for the prediction of pathologic response to neoadjuvant chemotherapy in breast cancer. Sci Rep. (2021) 11:18800. doi: 10.1038/s41598-021-98408-8

PubMed Abstract | Crossref Full Text | Google Scholar

10. Zhang B, Yu Y, Mao Y, Wang H, Lv M, Su X, et al. Development of MRI-based deep learning signature for prediction of axillary response after NAC in breast cancer. Acad Radiol. (2023). doi: 10.1016/j.acra.2023.10.004

PubMed Abstract | Crossref Full Text | Google Scholar

11. Li Y, Fan Y, Xu D, Li Y, Zhong Z, Pan H, et al. Deep learning radiomic analysis of DCE-MRI combined with clinical characteristics predicts pathological complete response to neoadjuvant chemotherapy in breast cancer. Front Oncol. (2022) 12:1041142. doi: 10.3389/fonc.2022.1041142

PubMed Abstract | Crossref Full Text | Google Scholar

12. Huang Y, Zhu T, Zhang X, Li W, Zheng X, Cheng M, et al. Longitudinal MRI-based fusion novel model predicts pathological complete response in breast cancer treated with neoadjuvant chemotherapy: a multicenter, retrospective study. EClinicalMedicine. (2023) 58:101899. doi: 10.1016/j.eclinm.2023.101899

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breast cancer, neoadjuvant chemotherapy, Miller-Payne grading criteria, dynamic contrast enhancement MRI, deep learning model

Citation: Lv M, Zhao B, Mao Y, Wang Y, Su X, Zhang Z, Wu J, Gao X and Wang Q (2025) Deep learning model for the early prediction of pathologic response following neoadjuvant chemotherapy in breast cancer patients using dynamic contrast-enhanced MRI. Front. Oncol. 15:1491843. doi: 10.3389/fonc.2025.1491843

Received: 05 September 2024; Accepted: 05 February 2025;
Published: 25 February 2025.

Edited by:

Redhwan Ahmed Al-Naggar, National University of Malaysia, Malaysia

Reviewed by:

Yuanpin Zhou, Zhejiang University, China
Gopichandh Danala, University of Oklahoma, United States

Copyright © 2025 Lv, Zhao, Mao, Wang, Su, Zhang, Wu, Gao and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qi Wang, cWRmeV93cUBxZHUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Deep learning model for the early prediction of pathologic response following neoadjuvant chemotherapy in breast cancer patients using dynamic contrast-enhanced MRI

Introduction

Materials and methods

Patients

Pathological evaluation

Magnetic resonance acquisition protocol

Tumor segmentation and deep learning features extraction

Deep learning models construction and validation

Statistical analyses

Results

Study population characteristics

Deep learning features extraction and selection

Deep learning models construction and validation

Discussion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

Abbreviations

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good