Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer

Peng, Yunsong; Cheng, Ziliang; Gong, Chang; Zheng, Chushan; Zhang, Xiang; Wu, Zhuo; Yang, Yaping; Yang, Xiaodong; Zheng, Jian; Shen, Jun

doi:10.3389/fonc.2022.846775

ORIGINAL RESEARCH article

Front. Oncol. , 10 March 2022

Sec. Breast Cancer

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.846775

This article is part of the Research Topic Quantitative Imaging and Artificial Intelligence in Breast Tumor Diagnosis View all 27 articles

Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer

Yunsong Peng^1,2†

Ziliang Cheng^3†

Chang Gong⁴

Chushan Zheng³

Xiang Zhang³

Zhuo Wu³

Yaping Yang⁴

Xiaodong Yang^1,2

Jian Zheng^1,2*

Jun Shen^3*

¹Division of Life Sciences and Medicine, School of Biomedical Engineering (Suzhou), University of Science and Technology of China, Hefei, China
²Medical Imaging Department, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, China
³Department of Radiology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
⁴Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China

Purpose: To compare the performances of deep learning (DL) to radiomics analysis (RA) in predicting pathological complete response (pCR) to neoadjuvant chemotherapy (NAC) based on pretreatment dynamic contrast-enhanced MRI (DCE-MRI) in breast cancer.

Materials and Methods: This retrospective study included 356 breast cancer patients who underwent DCE-MRI before NAC and underwent surgery after NAC. Image features and kinetic parameters of tumors were derived from DCE-MRI. Molecular information was assessed based on immunohistochemistry results. The image-based RA and DL models were constructed by adding kinetic parameters or molecular information to image-only linear discriminant analysis (LDA) and convolutional neural network (CNN) models. The predictive performances of developed models were assessed by receiver operating characteristic (ROC) curve analysis and compared with the DeLong method.

Results: The overall pCR rate was 23.3% (83/356). The area under the ROC (AUROC) of the image-kinetic-molecular RA model was 0.781 [95% confidence interval (CI): 0.735, 0.828], which was higher than that of the image-kinetic RA model (0.629, 95% CI: 0.595, 0.663; P < 0.001) and comparable to that of the image-molecular RA model (0.755, 95% CI: 0.708, 0.802; P = 0.133). The AUROC of the image-kinetic-molecular DL model was 0.83 (95% CI: 0.816, 0.847), which was higher than that of the image-kinetic and image-molecular DL models (0.707, 95% CI: 0.654, 0.761; 0.79, 95% CI: 0.768, 0.812; P < 0.001) and higher than that of the image-kinetic-molecular RA model (0.778, 95% CI: 0.735, 0.828; P < 0.001).

Conclusions: The pretreatment DCE-MRI-based DL model is superior to the RA model in predicting pCR to NAC in breast cancer patients. The image-kinetic-molecular DL model has the best prediction performance.

Introduction

Breast cancer is the most common diagnosed cancer and the most common cause of cancer death worldwide (1). Neoadjuvant chemotherapy (NAC) has been well established in managing breast cancer for patients with locally advanced cancer and early-stage operable breast cancers of specific molecular subtypes (2). Pathologic complete response (pCR) is mainly used to evaluate the degree of regression after NAC, as pCR has been demonstrated to be associated with better survival (3). However, only 7%–38% of breast cancers can achieve pCR (4). Thus, predicting pCR early before NAC is imperative and can timely switch to a new personalized treatment strategy and exempt from unnecessary chemotherapy toxicity patients with a low possibility of pCR.

MRI has been proven to be most accurate for measuring treatment response based on the change of tumor size or volume (5). Other than morphologic criteria, kinetic parameters including quantitative parameters, e.g., K^trans (volume transfer constant), K_ep (reverse reflux rate constant), V_e (volume fraction of extravascular extracellular space), and V_p (volume fraction of plasma), and semiquantitative parameters, e.g., TTP (time to peak), MaxConc (maximum concentration), MaxSlope (maximal slope), and AUC (area under the curve), can be derived from dynamic contrast-enhanced MRI (DCE-MRI), which can reflect tumor microvascular function such as vascular density and permeability (6). It has been reported that reduction in the K^trans or K_ep after two cycles of NAC is associated with the response to NAC (7, 8). However, only a few studies with small sample sizes have evaluated the power of pretreatment kinetic parameters in predicting pCR, with a reported moderate predictive performance [area under the receiver operating characteristic (AUROC) = 0.56–0.66] (9, 10).

Recently, imaging-based machine learning approaches have been used to predict therapeutic response by quantifying the tumor heterogeneity and irregularity of tissue components (11). Radiomics analysis (RA) and deep learning (DL) are the two most popular machine learning approaches, which have immense capability to obtain minable data by evaluating tumor features of images (11–14). RA relies on a pipeline including extraction of numerous handcrafted imaging features, followed by feature selection and then machine learning-based classification (11). However, the performance of radiomics models derived from pretreatment DCE-MRI is limited in predicting pCR with an AUROC ranging from 0.568 to 0.79 (12, 15, 16). DL can automatically learn discriminative features directly from images without the necessity of feature predefinition (17). The AUROC of DL models developed from pretreatment DCE-MRI alone ranged from 0.553 to 0.7969 (13, 14). In addition, a recent study has shown that the convolutional neural network (CNN) model based on pretreatment DCE-MRI (AUROC = 0.7969) had better prediction performance than the CNN model based on posttreatment DCE-MRI (AUROC = 0.7737) (13). So far, there is a lack of head-to-head comparison of predictive performance between RA and DL models based on pretreatment DCE-MRI in predicting pCR to NAC. Furthermore, whether an integrative model, which incorporates tumor image features, kinetic parameters, and molecular biomarkers, could improve predictive performance remains to be determined.

In this study, women with breast cancer who received NAC were retrospectively included. The image features and kinetic parameters of tumors derived from pretreatment DCE-MRI and molecular information determined by immunohistochemistry (IHC) were used to develop prediction models. The purpose of our study was to determine whether the DL model is better than the RA model in predicting pCR to NAC in breast cancer patients based on pretreatment DCE-MRI and whether incorporating molecular biomarkers and kinetic parameters into image features can improve the predictive performance.

Materials and Methods

Study Population

This retrospective study was approved by the Ethics Committee of Sun Yat-sen Memorial Hospital, with a waiver for informed consent from all participants. In our institution, a total of 1,757 patients with primary breast invasive cancer were diagnosed between April 16, 2016, and April 30, 2020. The inclusion criteria were as follows: 1) an initial diagnosis of primary invasive breast cancer; 2) DCE-MRI performed before biopsy and within 1 week before NAC; 3) surgical excision of the tumor whether achieving pCR or non-pCR after NAC treatment. The exclusion criteria were distant metastasis (n = 150), another malignant tumor (n = 16), surgery but without NAC (n = 1,187), without any treatment (n = 33), non-standard NAC treatment (n = 8), or tumor progression during NAC (n = 7). The patient enrollment pathway is shown in the consort diagram (Figure 1). Finally, 356 patients were included for analysis. The entire cohorts were split into independent training and validation dataset by 5-fold cross-validation (18). Four-fold data (80% of the tumors) were used as training dataset, and the remaining one-fold data (20% of the tumors) were used as validation dataset. The prediction probabilities of five independent validation sets were collected as a whole set and used to evaluate the model performance. The 5-fold cross-validation procedure is illustrated in Supplementary E1 and Supplementary Figure S1.

FIGURE 1

Figure 1 Flowchart of patient enrollment in the study. *Seven patients did not complete the established neoadjuvant chemotherapy program because of tumor progression, three patients did not have an operation, five HER2-positive patients did not receive trastuzumab plus pertuzumab treatment.

MRI Protocol

Breast MRI was performed on a 1.5T unit (Magnetom Avanto; Siemens Medical Solutions, Erlangen, Germany) with patients in the head-first prone position. The body coil was used as the transmitter, and a dedicated 8-channel phased-array breast coil (Siemens Medical Solutions, Erlangen, Germany) was used as the receiver. MRI sequences consisted of axial T2-weighted turbo spin-echo (TSE) with short tau inversion recovery (STIR) sequence; axial T1-weighted volume interpolated body examination (T1W-VIBE) with Dixon sequence, and axial diffusion-weighted imaging (DWI) with spectral attenuated inversion recovery (SPAIR) fat saturation with 2 b values (b = 0, 800 s/mm²) and axial DCE imaging. DCE images were acquired by using a 3D fat-suppressed T1W-VIBE sequence. The DCE acquisition consisted of 40–70 measurements with a temporal resolution of 8 s and a total of 5–7 min of imaging time. After two consecutive measurements, gadodiamide (Gd-DTPA-BMA) (Omniscan; GE Healthcare, Ireland) was administered via intravenous bolus injection at a dosage of 0.1 mmol/kg and a flow rate of 3.5 ml/s, followed by a 20-ml saline flush. Before DCE acquisition, multiple flip angle images (2°, 4°, 6°, 8°, 10°, and 12°) were obtained for the calculation of T1 maps using the same sequence and parameters except for the flip angle. The details of acquisition parameters of MRI pulse sequence are provided in Supplementary Table S1.

Neoadjuvant Chemotherapy Programs and Outcome

The diagnosis of all patients was established by a core needle biopsy of the primary tumor before NAC. The regimens of NAC, provided in Supplementary E2, were defined according to the National Comprehensive Cancer Network (NCCN) guideline (19). According to the Food and Drug Administration criteria (20), all patients underwent surgical resection of the tumors and sentinel lymph node dissection (SLNB) or axillary lymph node dissection (ALND) after NAC. The resected tumors and lymph nodes were sampled for histologic examination to evaluate the chemotherapeutic response. The pCR (ypT0/Tis-ypN0) was defined as the absence of residual invasive tumor in the breast and axillary lymph nodes on the operative specimen (breast tumor and axillary lymph nodes) following NAC. In contrast, non-pCR was defined as a residual invasive cancer in the breast or axillary nodes.

Kinetic Parameters and Prediction Model Building

DCE-MRI data were analyzed independently by two radiologists (ZC and CZ with 10 years and 8 years of experience with breast MRI) using specialized quantitative analysis software (Omni Kinetics, GE Healthcare). The kinetic parameters were calculated using the extended Tofts model. During measurement, the regions of interest (ROIs) were carefully drawn to cover the whole tumor. Necrotic or cystic areas of the lesions, if presented, were excluded from the evaluation. The intraclass correlation coefficient (ICC) of kinetic parameters between the two readers was 0.834–0.977. Data from the two readers were averaged for analysis. The least absolute shrinkage and selection operator (LASSO) regression analysis was applied to select independent predictive kinetic parameters. These selected kinetic parameters were used to construct the kinetic-only RA model using a robust supervised classifier, linear discriminant analysis (LDA) (21), which was employed to classify the NAC treatment efficiency by searching for a linear combination of the independent predictive kinetic parameters. A multilayer perceptron (MLP) neural network (22) was employed to construct the kinetic-only DL model. The structure of the MLP neural network is shown in Supplementary Figure 2A.

Molecular Information and Prediction Model Building

Molecular information, including the status of hormone receptor [estrogen receptor (ER), progesterone receptor (PR)], human epidermal growth factor receptor 2 (HER2), and Ki67 expression, was recorded from IHC results. ER/PR negative was defined as <1% of tumor cells with positive nuclear staining and ER/PR positive as ≥1% of tumor cells with positive nuclear staining; the cutoff for Ki67 was 14%; tumors with IHC staining of 0 or 1 were defined as HER2 negative, whereas tumors that either showed 3+ IHC staining or had gene copy number >2.0 were considered HER2 positive (23). The molecular-only LDA and MLP models were constructed by using the molecular information as input. The structure of the MLP neural network is shown in Supplementary Figure 2B.

Radiomics Analysis and Image-Based Radiomics Analysis Prediction Model Building

For RA, the tumors were segmented on DCE-MRI images obtained 88 s after the beginning of the contrast agent injection, as the clinical breast DCE-MRI guideline indicates peak enhancement and obvious conspicuity at this time point in most breast cancers (24). Tumor segmentation was performed using ITK-SNAP software (https://www.itksnap.org) by one radiologist (ZC, with 10 years of experience in breast MRI) who was blinded to the clinical and histopathologic results. Tumors were segmented on a section-by-section basis until the whole tumor volume was captured and a three-dimensional ROI was acquired. A second radiologist (JS, with 21 years of experience in breast MRI) reviewed all the delineations to ensure correct segmentation. The segmented images were processed by using the open-source Python 3.7 (https://www.python.org.) and PyRadiomics toolkit to extract 851 radiomics features, including image intensity statistical, shape, texture, and wavelet features (Supplementary Table S2). A coarse-to-fine feature selection strategy was applied to reduce the dimension and avoid overfitting. Redundant features were removed according to the Spearman correlation coefficient, and then the optimal feature subsets (Supplementary Table S3) were selected using least absolute shrinkage and selection operator (LASSO) regression. The prediction models, based on optimal image features, were built by using the five machine learning classifiers [i.e., LDA, support vector machine (SVM), random forest (RF), AdaBoost, and Naive Bayes] to verify the performance of the classifiers to predict pCR successfully. Then, the optimal classifier was used to build the image-only and image-based RA model.

The integrative image-based RA model was further developed by incorporating kinetic parameters (image-kinetic RA model), molecular information (image-molecular RA model), or both (image-kinetic-molecular RA model) into the image-only model. The optimal feature subsets of integrative image-based RA models are shown in Supplementary Tables S4–S6. The workflow for building RA predictive models is shown in Figure 2. All the RA models were constructed by using Matlab R2018b (MathWorks, Natick, MA, USA).

FIGURE 2

Figure 2 The workflow for building radiomics analysis-based predictive models.

Deep Learning Analysis and Image-Based Deep Learning Prediction Model Building

For DL analysis, a rectangular box of 128 × 128 × 3 pixels in size was used to crop three consecutive slices showing the maximum cross-sectional area of the tumor as input. To ensure comparability of the image signal intensity across patients, image intensity was normalized to a fixed range of 0–1. Random rotation, flip, and translation were used for data augmentation to alleviate the possible overfitting in the training procedure of model development. The image features were extracted by using a deep residual neural network, ResNeXt50 (25), pretrained on a large-scale, well-annotated ImageNet dataset to automatically learn discriminative image features, as illustrated in Supplementary E3 and Supplementary Figure S3. The whole DL structure contained a ResNeXt50 CNN and three fully connected layers, with the probability of pCR as output to build the image-only CNN model. Adam optimizer was used to train all DL models with a learning rate of 0.0001 and a batch size of 32. The triplet loss procedure was introduced to extract more discriminative features using the output of ResNeXt50, and the cross-entropy was introduced as classification loss using the final output of the fully connected layer. Details of the loss function are provided in Supplementary E4.

The integrative image-based DL model was further developed by adding kinetic (image-kinetic DL model), molecular information (image-molecular DL model), or both (image-kinetic-molecular DL model) into the CNN of the image-only model. The kinetic and molecular information was incorporated in the first fully connected layer of DL models. The kinetic and molecular information was incorporated in the first fully connected layer of DL models. The framework for building DL predictive models is shown in Figure 3. All the DL programs were implemented in Pytorch (https://pytorch.org.) on an Intel Core i7-7700 K processor (Intel, Santa Clara, CA, USA) and Nvidia RTX 2080 Ti GPU with 11 GB RAM (Nvidia, Santa Clara, CA, USA).

FIGURE 3

Figure 3 The framework for building deep learning-based predictive models.

Statistical Analysis

Data were expressed as mean ± standard deviation for continuous variables, and categorical variables were summarized as frequencies and percentages. The differences in age, molecular information, histopathologic types, tumor number type, clinical T stage, clinical N stage, clinical TNM stage, and treatments between pCR and non-pCR groups were compared by χ² or Wilcoxon rank-sum tests as appropriate. The inter-rater agreement of kinetic parameter evaluation was assessed by using the ICC. An ICC value >0.75 indicates good to excellent agreement. The predictive performance of the models was assessed by the ROC curve analysis. The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy of the models were calculated based on a cutoff value determined by the maximum Youden index. And their confidence intervals were calculated by bootstrap analysis with 10,000-fold resampling. DeLong method was used to compare the AUROC between the models. A two-sided P value <0.05 indicated statistical significance. All statistical analyses were performed by using SPSS software (version 21; SPSS, Chicago, IL, USA) and MedCalc software (version 18.9.1; MedCalc, Ostend, Belgium).

Results

Clinicopathologic Characteristics

A total of 356 female patients (mean age, 46.9 ± 9.4 years) were included in this study. The clinicopathologic characteristics are shown in Table 1. Here, 83 patients (23.3%) achieved pCR (pCR group), while the remaining 273 patients (76.7%) were non-pCR (non-pCR group). pCR group had a higher prevalence of ER-negative, PR-negative, HER2-positive compared with the non-pCR group (all P < 0.001). There was no significant difference in age, Ki67, histological type, tumor number type, clinical T stage, clinical N stage, clinical TNM stage, breast surgery, and axillary surgery between the two groups (all P > 0.05).

TABLE 1

Table 1 Clinicopathologic characteristics of patients in the non-pCR and pCR groups.

Image-, Kinetic-, and Molecular-Only Prediction Models

The LDA was the most robust classifier across multiple classifiers (Supplementary Table S7). The image-only LDA model had 12 image features selected by LASSO regression (Supplementary Table S3). The image-only CNN models had 1,000 image features extracted by ResNeXt50. The K^trans, K_ep, and MaxSlope were the independent predictors and included in the kinetic-only LDA and MLP models. Their AUROC, sensitivity, specificity, PPV, NPV, accuracy, and corresponding 95% CI are shown in Table 2 and Figures 4A, B. The AUROC of the molecular-only LDA model was 0.744, which was higher than that of the kinetic-only LDA model (0.682, P = 0.012) and image-only LDA model (0.55, P < 0.001). The AUROC of the molecular-only MLP model was 0.752, which was higher than that of the kinetic-only MLP model (0.652, P = 0.007) and image-only CNN model (0.554, P < 0.001). The AUROC of the kinetic-only LDA model was 0.682, which was higher than that of the kinetic-only MLP model (AUROC = 0.652, P = 0.008). There was no significant difference between image-only LDA and image-only CNN models (AUROC = 0.55 and 0.554, P = 0.208), as well as between molecular-only LDA and molecular-only MLP models (AUROC = 0.744 and 0.752, P = 0.33).

TABLE 2

Table 2 Performances of the image-, kinetic-, and molecular-only LDA and DL Prediction Models.

FIGURE 4

Figure 4 Receiver operating characteristic (ROC) curves of the image-, kinetic-, and molecular-only linear discriminant analysis (LDA) (A) and deep learning (DL) (B) models.

Integrative Image-Based Radiomics Analysis and Deep Learning Models

The AUROC, sensitivity, specificity, PPV, NPV, accuracy, and corresponding 95% CI of integrative image-based RA and DL models are shown in Table 3 and Figures 5A, B. The AUROC of the image-kinetic-molecular RA model was 0.781, which was higher than that of the image-kinetic RA model (0.629, P < 0.001), while it did not differ from the image-molecular RA model (0.755, P = 0.118). The AUROC of the image-kinetic-molecular DL model was 0.832, which was higher than that of image-kinetic and image-molecular DL models (0.707, 0.79; both P < 0.001). The heatmaps (Figure 6) generated from ResNeXt50 based on the Grad-Cam algorithm (26) indicated that locations were crucial in generating the output.

TABLE 3

Table 3 Performances of the integrative image-based RA and DL models.

FIGURE 5

Figure 5 Receiver operating characteristic (ROC) curves of the integrative image-based radiomics analysis (RA) (A) and deep learning (DL) (B) models.

FIGURE 6

Figure 6 Dynamic contrast-enhanced magnetic resonance (DCE-MR) images and feature heatmaps generated from the ResNet50 in pathologic complete response (pCR) or non-pCR patients. The scaled weights of deep learning features are represented by the color bar. The color closer to red indicates that it has a greater weight and received more attention from the model. (A, D) A 41-year-old woman with an hormone response (HR)-positive/human epidermal growth factor receptor 2 (HER2)-negative invasive lobular carcinoma in the right breast and did not achieve pCR following 6 cycles of neoadjuvant chemotherapy (NAC). (B, E) A 53-year-old woman with a triple negative breast cancer (TNBC), invasive ductal carcinoma in the right breast, and achieved pCR following 8 cycles of NAC. (C, F) A 59-year-old woman with a HER2-positive invasive ductal carcinoma in the right breast and achieved pCR following 8 cycles of NAC.

Comparison Between Integrative Image-Based Radiomics Analysis and Deep Learning Models

The AUROC of image-kinetic, image-molecular, and image-kinetic-molecular DL model (0.707, 0.79, and 0.83, respectively) were significantly higher than that of the corresponding image-kinetic, image-molecular, and image-kinetic-molecular RA models (0.629, 0.755, and 0.781, respectively; all P < 0.001). The image-kinetic-molecular DL model had significantly higher AUROC than other integrative models (Table 3).

Discussion

Our study results showed that both the molecular-only LDA and MLP models had a better prediction performance than the kinetic-only LDA and MLP model and image-only LDA and CNN model. The integrative image-kinetic-molecular RA and DL models significantly improved the predictive performance. Moreover, the image-kinetic-molecular DL model had the best performance (AUROC, 0.83) in predicting pCR before NAC in breast cancer patients.

Conventionally, the tumor size is used to assess the effect of NAC. Whereas the baseline tumor size cannot predict pCR (7, 10). It has been shown that molecular biomarkers are correlated with NAC sensitivity in breast cancer (27). For example, HR negativity and HER2 positivity were associated with higher pCR rates [odds ratio (OR) = 0.497 and 1.833, respectively] (28). The IHC4 score combining ER, PR, HER2, and Ki67 expression levels was associated with pCR rate; furthermore, the lower the IHC4 score, the higher the pCR rate in the ER-positive breast cancer patients (AUROC = 0.613) (29). Our results showed that the molecular-only LDA and MLP model achieved an AUROC of 0.744 and 0.752 in the breast cancer patients, higher than kinetic-only and image-only predictive models. However, the molecular information is acquired via invasive needle biopsy, which cannot reflect certain pathophysiological characteristics of tumors, such as microvascular density and permeability, and tumor heterogeneity, which is known to be relevant to the sensitivity of pCR NAC in breast cancer (15, 30).

The kinetic parameters can reflect the pathophysiological microvascular characteristics of tumors (6, 31). Previous studies (7–10) with a small sample size showed that pretreatment K^trans, K_ep, or V_e, or their change after two cycles of NAC, could predict pCR but has a varying AUROC (0.658–0.93). More importantly, the metric capable of predicting pCR before NAC is more desirable in clinical settings. Identifying breast cancer patients who can truly benefit from NAC is crucial for successfully sparing toxicity and optimally selecting patients for endocrine or targeted therapy vs. chemotherapy. Whether the pretreatment value of K^trans, K_ep, or V_e could predict pCR remains to be determined. Our study showed that the kinetic-only LDA and MLP models building based on the pretreatment DCE-MRI achieved an AUROC of 0.682 and 0.656, comparable to the change of K^trans, K_ep, or V_e after two cycles of NAC (9, 10).

Breast cancer is a highly heterogeneous disease. The prediction performance of molecular-only and kinetic-only models was suboptimal for predicting pCR, and the highest AUROC of the molecular-only MLP model was only 0.752 in our study. The image features extracted from DCE-MRI could reflect spatial heterogeneity, including volumetric distribution of microvascular density and the extracellular compartment (32, 33). The image-only LDA and CNN models based on image features derived from pretreatment DCE-MRI were inadequate for predicting pCR (AUROC, 0.55 and 0.554). In theory, adding kinetic parameters or molecular information to the image-only model may improve predicting pCR to NAC. Indeed, the performance of the image-kinetic, image-molecular RA, and DL models (AUROC, 0.629 and 0.755; 0.707 and 0.79) was also undesirable. The integrative RA and DL models, including image features, kinetic parameters, and molecular information, improved the counterparts of model performance in predicting pCR to NAC with an AUROC of 0.781 and 0.83, which might represent more tumor heterogeneity comprehensively. Previous studies (12, 14) have also shown that the prediction performance of the RA or DL model based on pretreatment MRI in predicting pCR in breast cancer patients could be improved by combining with molecular information.

Notably, our results showed that the prediction performance of integrative DL models, including image-kinetic, image-molecular, and image-kinetic-molecular DL models was higher than that of the corresponding RA models. The image-kinetic-molecular DL model achieved the best performance (AUROC, 0.83) in predicting pCR before NAC. The most crucial aspect of DL, which significantly departs from radiomics classifiers, is that multiple and deep layers of perceptions capture low- to high-image features that are not designed by human engineers but are learned based on representation learning (11). Previous studies have also reported that the performance of DL is better than RA in breast lesion discrimination (17), axillary lymph node metastasis prediction (34), and esophagus cancer treatment prediction (35). In addition, unlike the radiomics feature extraction procedure, DL feature extraction only needs setting a bounding box of fixed size to the tumor region, which improves efficiency and offers more excellent reliability and higher reproducibility. For RA, handcrafted image segmentation is time-consuming and labor-intensive. Automatic and semiautomatic segmentation is less accurate for the lesions with low enhancement, indistinct or vague borders (i.e., diffuse non-mass enhancement), or the lesions in a moderate to marked background parenchymal enhancement (BPE) (36, 37). Taken together, the pretreatment DCE-MRI-based DL model in our study is clinically more favorable than the RA model for pretreatment prediction of pCR in breast cancer patients.

Our study has several limitations. First, the RA or DL approaches based on T2WI or DWI were not used to develop a prediction model. T2WI is not always able to clearly detect the exact border of breast cancer, especially in patients with dense breasts (38). In addition, DWI was easily affected by fat suppression and motion artifacts, which likely caused low reproducibility in ADC maps and ADC value (39). Previous studies have shown that RA or DL model established based on single T2WI, DWI, or ADC has relatively poor predictive ability (12, 16). Second, this study was a retrospective study in a single center. This may have caused selection bias. Third, the heterogeneous nature of molecular subtypes in breast cancer led to different NAC regimens and pCR probability, but this reflects the reality in clinical settings practice. Further investigation with multicenter and larger datasets is warranted to determine the generalization ability of our pretreatment DCE-MRI-based DL prediction model.

In conclusion, our study showed that the integrative image-based DL models are superior to the image-based RA models. The image-kinetic-molecular DL model achieved the best performance in predicting pCR to NAC in breast cancer patients.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics Statement

The studies involving human participants were reviewed and approved by Sun Yat-sen Memorial Hospital (Sun Yat-sen University, Guangzhou, China). The ethics committee waived the requirement of written informed consent for participation.

Author Contributions

ZC and YP: guarantor of integrity of the entire study, study concepts, and design. ZC, YP, CG, CZ, XZ, and ZW: clinical studies and literature research. ZC, YP, and YY: statistical analysis. JS, JZ, and XY: article editing. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by the Key Areas Research and Development Program of Guangdong (Grant No. 2019B020235001) for JS, National Natural Science Foundation of China (Grant No. U1801681) for JS, Guangdong Province Universities and Colleges Pearl River Scholar Funded Scheme (2017) for JS, and Suzhou Science and Technology Bureau under Grant (SJC2021023) for JZ.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.846775/full#supplementary-material

References

1. Torre LA, Islami F, Siegel RL, Ward EM, Jemal A. Global Cancer in Women: Burden and Trends. Cancer Epidemiol Biomarkers Prev (2017) 26:444–57. doi: 10.1158/1055-9965.EPI-16-0858

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Asselain B, Barlow W, Bartlett J, Bergh J, Bergsten-Nordström E, Bliss J, et al. Long-Term Outcomes for Neoadjuvant Versus Adjuvant Chemotherapy in Early Breast Cancer: Meta-Analysis of Individual Patient Data From Ten Randomised Trials. Lancet Oncol (2018) 19:27–39. doi: 10.1016/S1470-2045(17)30777-5

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Kaufmann M, von Minckwitz G, Bear HD, Buzdar A, McGale P, Bonnefoi H, et al. Recommendations From an International Expert Panel on the Use of Neoadjuvant (Primary) Systemic Treatment of Operable Breast Cancer: New Perspectives 2006. Ann Oncol (2007) 18:1927–34. doi: 10.1093/annonc/mdm201

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Zardavas D, Irrthum A, Swanton C, Piccart M. Clinical Management of Breast Cancer Heterogeneity. Nat Rev Clin Oncol (2015) 12:381–94. doi: 10.1038/nrclinonc.2015.73

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Le-Petross HT, Lim B. Role of MR Imaging in Neoadjuvant Therapy Monitoring. Magn Reson Imaging Clin N Am (2018) 26:207–20. doi: 10.1016/j.mric.2017.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yi B, Kang DK, Yoon D, Jung YS, Kim KS, Yim H, et al. Is There Any Correlation Between Model-Based Perfusion Parameters and Model-Free Parameters of Time-Signal Intensity Curve on Dynamic Contrast Enhanced MRI in Breast Cancer Patients? Eur Radiol (2014) 24:1089–96. doi: 10.1007/s00330-014-3100-6

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Yu Y, Jiang Q, Miao Y, Li J, Bao S, Wang H, et al. Quantitative Analysis of Clinical Dynamic Contrast-Enhanced MR Imaging for Evaluating Treatment Response in Human Breast Cancer. Radiology (2010) 257:47–55. doi: 10.1148/radiol.10092169

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Ah-See ML, Makris A, Taylor NJ, Harrison M, Richman PI, Burcombe RJ, et al. Early Changes in Functional Dynamic Magnetic Resonance Imaging Predict for Pathologic Response to Neoadjuvant Chemotherapy in Primary Breast Cancer. Clin Cancer Res (2008) 14:6580–9. doi: 10.1158/1078-0432.CCR-07-4310

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Pickles MD, Lowry M, Manton DJ, Gibbs P, Turnbull LW. Role of Dynamic Contrast Enhanced MRI in Monitoring Early Response of Locally Advanced Breast Cancer to Neoadjuvant Chemotherapy. Breast Cancer Res Treat (2005) 91:1–10. doi: 10.1007/s10549-004-5819-2

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Drisis S, Metens T, Ignatiadis M, Stathopoulos K, Chao SL, Lemort M. Quantitative DCE-MRI for Prediction of Pathological Complete Response Following Neoadjuvant Treatment for Locally Advanced Breast Cancer: The Impact of Breast Cancer Subtypes on the Diagnostic Accuracy. Eur Radiol (2016) 26:1474–84. doi: 10.1007/s00330-015-3948-0

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Sheth D, Giger ML. Artificial Intelligence in the Interpretation of Breast Cancer on MRI. J Magn Reson Imaging (2020) 51:1310–24. doi: 10.1002/jmri.26878

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Liu Z, Li Z, Qu J, Zhang R, Zhou X, Li L, et al. Radiomics of Multiparametric MRI for Pretreatment Prediction of Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer: A Multicenter Study. Clin Cancer Res (2019) 25:3538–47. doi: 10.1158/1078-0432.CCR-18-3190

PubMed Abstract | CrossRef Full Text | Google Scholar

13. El Adoui M, Drisis S, Benjelloun M. Multi-Input Deep Learning Architecture for Predicting Breast Tumor Response to Chemotherapy Using Quantitative MR Images. Int J Comput Assist Radiol Surg (2020) 15:1491–500. doi: 10.1007/s11548-020-02209-9

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Qu YH, Zhu HT, Cao K, Li XT, Ye M, Sun YS. Prediction of Pathological Complete Response to Neoadjuvant Chemotherapy in Breast Cancer Using a Deep Learning (DL) Method. Thorac Cancer (2020) 11:651–8. doi: 10.1111/1759-7714.13309

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Fan M, Chen H, You C, Liu L, Gu Y, Peng W, et al. Radiomics of Tumor Heterogeneity in Longitudinal Dynamic Contrast-Enhanced Magnetic Resonance Imaging for Predicting Response to Neoadjuvant Chemotherapy in Breast Cancer. Front Mol Biosci (2021) 8:622219. doi: 10.3389/fmolb.2021.622219

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Eun NL, Kang D, Son EJ, Park JS, Youk JH, Kim JA, et al. Texture Analysis With 3.0-T Mri for Association of Response to Neoadjuvant Chemotherapy in Breast Cancer. Radiology (2020) 294:31–41. doi: 10.1148/radiol.2019182718

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Truhn D, Schrading S, Haarburger C, Schneider H, Merhof D, Kuhl C. Radiomic Versus Convolutional Neural Networks Analysis for Classification of Contrast-Enhancing Lesions at Multiparametric Breast MRI. Radiology (2019) 290:290–7. doi: 10.1148/radiol.2018181352

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Racz A, Bajusz D, Heberger K. Modelling Methods and Cross-Validation Variants in QSAR: A Multi-Level Analysis. SAR QSAR Environ Res (2018) 29:661–74. doi: 10.1080/1062936X.2018.1505778

PubMed Abstract | CrossRef Full Text | Google Scholar

19. National Comprehensive Cancer Network. Invasive Breast Cancer (2020). Available at: https://www.nccn.org/patients/guidelines/content/PDF/breast-invasive-patient.pdf (Accessed Augest 18, 2021).

Google Scholar

20. US Department of Health and Human Services Food and Drug Administration. Pathological Complete Response in Neoadjuvant Treatment of High-Risk Early-Stage Breast Cancer: Use as an Endpoint to Support Accelerated Approval Guidance for Industry (2020). Available at: https://www.fda.gov/regulatory-information/search-fda-guidance-documents/pathological-complete-response-neoadjuvant-treatment-high-risk-early-stage-breast-cancer-use (Accessed Augest 18, 2021).

Google Scholar

21. Rani A, Kumar S, Micheloni C, Foresti GL. Incorporating Linear Discriminant Analysis in Neural Tree for Multidimensional Splitting. Appl Soft Comput (2013) 13:4219–28. doi: 10.1016/j.asoc.2013.06.007

CrossRef Full Text | Google Scholar

22. Kalafi EY, Nor NAM, Taib NA, Ganggayah MD, Town C, Dhillon SK. Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data. Folia Biologica (2019) 65:212–20.

PubMed Abstract | Google Scholar

23. Cancer Genome Atlas N. Comprehensive Molecular Portraits of Human Breast Tumours. Nature (2012) 490:61–70. doi: 10.1038/nature11412

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Mann RM, Cho N, Moy L. Breast MRI: State of the Art. Radiology (2019) 292:520–36. doi: 10.1148/radiol.2019182947

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Xie S, Girshick R, Dollar P, Tu Z, He K. Aggregated Residual Transformations for Deep Neural Networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, Hawaii, United States. (2017). pp. 1492–500. doi: 10.1109/CVPR.2017.634

CrossRef Full Text | Google Scholar

26. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization. Int J Comput Vis (2019) 128:336–59. doi: 10.1007/s11263-019-01228-7

CrossRef Full Text | Google Scholar

27. Rouzier R, Perou CM, Symmans WF, Ibrahim N, Cristofanilli M, Anderson K, et al. Breast Cancer Molecular Subtypes Respond Differently to Preoperative Chemotherapy. Clin Cancer Res (2005) 11:5678–85. doi: 10.1158/1078-0432.CCR-04-2421

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Pu S, Wang K, Liu Y, Liao X, Chen H, He J, et al. Nomogram-Derived Prediction of Pathologic Complete Response (pCR) in Breast Cancer Patients Treated With Neoadjuvant Chemotherapy (NCT). BMC Cancer (2020) 20:1120. doi: 10.1186/s12885-020-07621-7

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Tan W, Luo W, Jia W, Liang G, Xie X, Zheng W, et al. A Combination of Nottingham Prognostic Index and IHC4 Score Predicts Pathological Complete Response of Neoadjuvant Chemotherapy in Estrogen Receptor Positive Breast Cancer. Oncotarget (2016) 7:87312–22. doi: 10.18632/oncotarget.13549

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Harris L, Fritsche H, Mennel R, Norton L, Ravdin P, Taube S, et al. American Society of Clinical Oncology 2007 Update of Recommendations for the Use of Tumor Markers in Breast Cancer. J Clin Oncol (2007) 25:5287–312. doi: 10.1200/JCO.2007.14.2364

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Tofts PS, Brix G, Buckley DL, Evelhoch JL, Henderson E, Knopp MV, et al. Estimating Kinetic Parameters From Dynamic Contrast-Enhanced T1-Weighted MRI of a Diffusable Tracer :Standardized Quantites and Symbols. J Magn Reson Imaging (1999) 10:223–32. doi: 10.1002/(SICI)1522-2586(199909)10:3<223::AID-JMRI2>3.0.CO;2-S

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ravichandran K, Braman N, Janowczyk A, Madabhushi A. A Deep Learning Classifier for Prediction of Pathological Complete Response to Neoadjuvant Chemotherapy From Baseline Breast DCE-MRI. In: SPIE Medical Imaging, (2018). Houston, Texas, United States. doi: 10.1117/12.2294056

CrossRef Full Text | Google Scholar

33. Reig B. Radiomics and Deep Learning Methods in Expanding the Use of Screening Breast MRI. Eur Radiol (2021) 31:5863–5. doi: 10.1007/s00330-021-08056-9

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Sun Q, Lin X, Zhao Y, Li L, Yan K, Liang D, et al. Deep Learning vs. Radiomics for Predicting Axillary Lymph Node Metastasis of Breast Cancer Using Ultrasound Images: Don't Forget the Peritumoral Region. Front Oncol (2020) 10:53. doi: 10.3389/fonc.2020.00053

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Hu Y, Xie C, Yang H, Ho JWK, Wen J, Han L, et al. Computed Tomography-Based Deep-Learning Prediction of Neoadjuvant Chemoradiotherapy Treatment Response in Esophageal Squamous Cell Carcinoma. Radiother Oncol (2021) 154:6–13. doi: 10.1016/j.radonc.2020.09.014

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Chen W, Giger ML, Bick U. A Fuzzy C-Means (FCM)-Based Approach for Computerized Segmentation of Breast Lesions in Dynamic Contrast-Enhanced MR Images. Acad Radiol (2006) 13:63–72. doi: 10.1016/j.acra.2005.08.035

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Ye DM, Wang HT, Yu T. The Application of Radiomics in Breast MRI: A Review. Technol Cancer Res Treat (2020) 19:1533033820916191. doi: 10.1177/1533033820916191

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Romeo V, Picariello V, Pignata A, Mancusi V, Stanzione A, Cuocolo R, et al. Influence of Different Post-Contrast Time Points on Dynamic Contrast-Enhanced (DCE) MRI T Staging in Breast Cancer. Eur J Radiol (2020) 124:108819. doi: 10.1016/j.ejrad.2020.108819

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Braithwaite AC, Dale BM, Boll DT, Merkle EM. Short- and Midterm Reproducibility of Apparent Diffusion Coefficient Measurements at 3.0-T Diffusion-Weighted Imaging of the Abdomen. Radiology (2009) 250:459–65. doi: 10.1148/radiol.2502080849

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: breast cancer, neoadjuvant chemotherapy, dynamic contrast-enhanced magnetic resonance imaging, radiomics, deep learning

Citation: Peng Y, Cheng Z, Gong C, Zheng C, Zhang X, Wu Z, Yang Y, Yang X, Zheng J and Shen J (2022) Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer. Front. Oncol. 12:846775. doi: 10.3389/fonc.2022.846775

Received: 31 December 2021; Accepted: 26 January 2022;
Published: 10 March 2022.

Edited by:

Zhongxiang Ding, Zhejiang University, China

Reviewed by:

Ting Song, Third Affiliated Hospital of Guangzhou Medical University, China
Quan Zhou, Third Affiliated Hospital of Southern Medical University, China

Copyright © 2022 Peng, Cheng, Gong, Zheng, Zhang, Wu, Yang, Yang, Zheng and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jian Zheng, emhlbmdqQHNpYmV0LmFjLmNu; Jun Shen, c2hlbmp1bkBtYWlsLnN5c3UuZWR1LmNu

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer

Introduction

Materials and Methods

Study Population

MRI Protocol

Neoadjuvant Chemotherapy Programs and Outcome

Kinetic Parameters and Prediction Model Building

Molecular Information and Prediction Model Building

Radiomics Analysis and Image-Based Radiomics Analysis Prediction Model Building

Deep Learning Analysis and Image-Based Deep Learning Prediction Model Building

Statistical Analysis

Results

Clinicopathologic Characteristics

Image-, Kinetic-, and Molecular-Only Prediction Models

Integrative Image-Based Radiomics Analysis and Deep Learning Models

Comparison Between Integrative Image-Based Radiomics Analysis and Deep Learning Models

Discussion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Supplementary Material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good