AUTHOR=Sheng Weiyong , Xia Shouli , Wang Yaru , Yan Lizhao , Ke Songqing , Mellisa Evelyn , Gong Fen , Zheng Yun , Tang Tiansheng TITLE=Invasive ductal breast cancer molecular subtype prediction by MRI radiomic and clinical features based on machine learning JOURNAL=Frontiers in Oncology VOLUME=12 YEAR=2022 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.964605 DOI=10.3389/fonc.2022.964605 ISSN=2234-943X ABSTRACT=Background

Most studies of molecular subtype prediction in breast cancer were mainly based on two-dimensional MRI images, the predictive value of three-dimensional volumetric features from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) for predicting breast cancer molecular subtypes has not been thoroughly investigated. This study aimed to look into the role of features derived from DCE-MRI and how they could be combined with clinical data to predict invasive ductal breast cancer molecular subtypes.

Methods

From January 2019 to December 2021, 190 Chinese women with invasive ductal breast cancer were studied (32 triple-negative, 59 HER2-enriched, and 99 luminal lesions) in this institutional review board-approved retrospective cohort study. The image processing software extracted 1130 quantitative radiomic features from the segmented lesion area, including shape-based, first-order statistical, texture, and wavelet features. Three binary classifications of the subtypes were performed: triple-negative vs. non-triple-negative, HER2-overexpressed vs. non-HER2-overexpressed, and luminal (A + B) vs. non-luminal. For the classification, five machine learning methods (random forest, logistic regression, support vector machine, naïve Bayes, and eXtreme Gradient Boosting) were employed. The classifiers were chosen using the least absolute shrinkage and selection operator method. The area evaluated classification performance under the receiver operating characteristic curve, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean.

Results

EXtreme Gradient Boosting model showed the best performance in luminal and non-luminal groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.8282, 0.7524, 0.6542, 0.6964, 0.6086, 0.3458, 0.8524 and 0.7016, respectively. Meanwhile, the random forest model showed the best performance in HER2-overexpressed and non-HER2-overexpressed groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.8054, 0.2941, 0.9744, 0.7679, 0.4348, 0.0256, 0.8333 and 0.5353, respectively. Furthermore, eXtreme Gradient Boosting model showed the best performance in the triple-negative and non-triple-negative groups, with AUC, sensitivity, specificity, accuracy, F1-Score, false positive rate, precision, and geometric mean of 0.9031, 0.9362, 0.4444, 0.8571, 0.9167, 0.5556, 0.8980 and 0.6450.

Conclusion

Clinical data and three-dimension imaging features from DCE-MRI were identified as potential biomarkers for distinguishing between three molecular subtypes of invasive ductal carcinomas breast cancer. In the future, more extensive studies will be required to evaluate the findings.