Preoperative prediction of nodal status using clinical data and artificial intelligence derived mammogram features enabling abstention of sentinel lymph node biopsy in breast cancer

Rejmer, Cornelia; Dihge, Looket; Bendahl, Pär-Ola; Förnvik, Daniel; Dustler, Magnus; Rydén, Lisa

doi:10.3389/fonc.2024.1394448

ORIGINAL RESEARCH article

Front. Oncol., 10 July 2024

Sec. Surgical Oncology

Volume 14 - 2024 | https://doi.org/10.3389/fonc.2024.1394448

This article is part of the Research TopicAdvancing Patient-Centric Oncology: Non-Operative Management and Surgical De-Escalation in Cancer CareView all 10 articles

Preoperative prediction of nodal status using clinical data and artificial intelligence derived mammogram features enabling abstention of sentinel lymph node biopsy in breast cancer

Cornelia Rejmer^1*

Looket Dihge^1,2

Pär-Ola Bendahl³

Daniel Förnvik^4,5

Magnus Dustler^4,6

Lisa Rydén^1,7,8

¹Department of Clinical Sciences, Division of Surgery, Lund University, Lund, Sweden
²Department of Plastic and Reconstructive Surgery, Skåne University Hospital, Malmö, Sweden
³Department of Clinical Sciences, Division of Oncology, Lund University, Lund, Sweden
⁴Medical Radiation Physics, Department of Translational Medicine, Lund University, Malmö, Sweden
⁵Department of Hematology, Oncology and Radiations Physics, Skåne University Hospital, Lund, Sweden
⁶Diagnostic Radiology, Department of Translational Medicine, Lund University, Malmö, Sweden
⁷Department of Surgery, Skåne University Hospital, Malmö, Sweden
⁸Department of Clinical Medicine, Aarhus University, Aarhus, Denmark

Introduction: Patients with clinically node-negative breast cancer have a negative sentinel lymph node status (pN0) in approximately 75% of cases and the necessity of routine surgical nodal staging by sentinel lymph node biopsy (SLNB) has been questioned. Previous prediction models for pN0 have included postoperative variables, thus defeating their purpose to spare patients non-beneficial axillary surgery. We aimed to develop a preoperative prediction model for pN0 and to evaluate the contribution of mammographic breast density and mammogram features derived by artificial intelligence for de-escalation of SLNB.

Materials and methods: This retrospective cohort study included 755 women with primary breast cancer. Mammograms were analyzed by commercially available artificial intelligence and automated systems. The additional predictive value of features was evaluated using logistic regression models including preoperative clinical variables and radiological tumor size. The final model was internally validated using bootstrap and externally validated in a separate cohort. A nomogram for prediction of pN0 was developed. The correlation between pathological tumor size and the preoperative radiological tumor size was calculated.

Results: Radiological tumor size was the strongest predictor of pN0 and included in a preoperative prediction model displaying an area under the curve of 0.68 (95% confidence interval: 0.63–0.72) in internal validation and 0.64 (95% confidence interval: 0.59–0.69) in external validation. Although the addition of mammographic features did not improve discrimination, the prediction model provided a 21% SLNB reduction rate when a false negative rate of 10% was accepted, reflecting the accepted false negative rate of SLNB.

Conclusion: This study shows that the preoperatively available radiological tumor size might replace pathological tumor size as a key predictor in a preoperative prediction model for pN0. While the overall performance was not improved by mammographic features, one in five patients could be omitted from axillary surgery by applying the preoperative prediction model for nodal status. The nomogram visualizing the model could support preoperative patient-centered decision-making on the management of the axilla.

1 Introduction

Sentinel lymph node biopsy (SLNB) is the recommended surgical axillary staging method in patients with clinically node-negative breast cancer, although approximately 75–80% have non-malignant axillary lymph nodes in the definitive pathology report (1–4). Consequently, patients with negative sentinel lymph node status (pN0) do not benefit from SLNB. The American College of Surgeons Oncology Group Z0011(ACOSOG Z0011) study questioned the necessity of axillary lymph node dissection and showed that abstaining from ALND in patients with T1-T2 clinically node negative primary breast cancer with 1-2 sentinel lymph nodes containing metastases was non-inferior to ALND regarding overall survival. This raised the question of the necessity of SLNB. The randomized Sentinel Node vs Observation After Axillary Ultra-Sound (SOUND) trial recently showed that abstaining SLNB in patients with T1 tumors having breast-conserving surgery was non-inferior to SLNB regarding distance-free survival at five years (5). However, implementation of the findings from the SOUND trial is not applicable to all breast cancer patients. The ASCO guidelines already recommended abstaining from SLNB in 2021 for patients ≥ 70 years with a luminal subtype undergoing breast-conserving surgery and the recommended adjuvant endocrine therapy (2). There is an increasing awareness of the importance of the long-term effects of surgery on patient’s function and well-being. The Intergroup Sentinel Mamma study (INSEMA) evaluating invasive disease-free survival and morbidity after breast-conserving surgery with or without SLNB reported that morbidity was lower in the group without SLNB than in the group with SLNB at one, three, and 18 months postoperative (3). This warrants strategies for implementation of de-escalation of axillary surgery and methods for preoperative identification of patients for whom SLNB can be safely omitted.

Several clinicopathological models for prediction of axillary lymph node (ALN) and sentinel lymph node (SLN) status have been developed during the past decade (6–9). In 2019, Dihge et al. (10) developed an artificial neural network (ANN) model to predict pN0. The selected variables in the model are well known predictors and most were previously included in Bevilacqua et al.’s prediction model (7, 10). However, previous prediction models were developed using postoperatively available variables, defeating the purpose of a patient-centric preoperative decision tool for safe omission of SLNB. A commonly used key predictor for pN0, pathological tumor size, is a postoperative measure that could be replaced by radiological tumor size. Studies have indicated that primarily mammographic tumor size is similar to the postoperatively available pathologic tumor size, while other imaging modalities often over- or underestimate the tumor size (6, 7, 11). To our knowledge, a comparison of pathological tumor size and radiological tumor size has not previously been described in the setting of ALN status prediction.

Prediction models for ALN status using mammograms have been presented using presence of microcalcifications, breast density, and radiomic signatures, exclusively or in combination with clinicopathological variables, most of which were postoperatively obtained (12–15). In addition, several studies have investigated using other breast imaging modalities for nodal prediction, including ultrasound and contrast-enhanced mammography (16–21). To our knowledge, no prediction model for pN0 currently incorporates commercially available AI cancer detection features from mammograms and exclusively preoperatively available clinicopathological variables.

Thus, we aimed to evaluate non-operative nodal staging and the possibility to omit axillary surgery by developing a prediction model for pN0 using only preoperatively available data. The additional predictive value of mammographic variables extracted by a commercially available AI cancer detection system and an automated breast density assessment system in patients with clinically node-negative primary breast cancer is explored. A nomogram is developed as a preoperative decision-tool to enable a patient-centered approach to SLNB. Additionally, we aimed to evaluate radiological tumor size as a preoperative alternative to the postoperative pathological tumor size as a predictor in a preoperative prediction model for pN0.

2 Materials and methods

2.1 Study population

A total of 770 women diagnosed with primary breast cancer between January 2009 and December 2012 were prospectively included in a registry at the Department of Pathology at the Skåne University Hospital (Lund, Sweden). Patients with clinically node negative primary breast cancer undergoing primary breast surgery and SLNB were included as previously described by Dihge et al. (6, 10). Clinically node negative was defined as no palpable mass in the physical examination. All patients underwent SLNB as axillary staging, and if needed, axillary lymph node dissection. Another cohort including 586 patients from Skåne University Hospital (Malmö, Sweden in 2020) and Helsingborg Regional Hospital (Helsingborg, Sweden in 2019–2020) was used for external validation (22).

2.2 Clinicopathological data

Patient and tumor information was collected from patients’ electronic files and a pathology database, as described by Dihge et al. (6, 10) and Skarping et al. (22). The histological type was divided into two groups after variable selection: the first group included no special type and lobular, and the second group included other and mixed types (7). Estrogen receptor (ER) status, progesterone receptor (PR) status, HER2 status, and Ki67 percentage were analyzed and categorized according to guidelines (23, 24). Mode of tumor detection was divided into symptomatic presentation and by the national mammography screening program. Tumor localization was defined by location in the four quadrants and central, and after statistical variable selection, categorized as upper inner quadrant vs. other locations (7).

SLN status was categorized as negative or positive (pN0 or pN+). pN0 was defined as breast cancer without lymph node metastasis or with isolated tumor cells. pN+ was defined as ≥1 SLN with micrometastasis or macrometastasis, defined as >200 cells and/or cluster size of 0.2 – 2 mm, and cluster size >2 mm, respectively (25).

2.3 Mammographic images and image analysis systems

All available mammographic images from screening and diagnostic imaging were included in the analyses. A modified Breast Imaging Reporting and Data System malignancy score (1–5) was used for mammographic and ultrasound images, as they are part of the clinical routine work-up. For this study mammographic malignancy score (1–5), ultrasound malignancy score (1–5), the largest specified radiologic tumor size in mm, and laterality were collected from the Picture Archiving and Communication System (PACS). In cases of missing mammographic tumor size, size from ultrasound was used since both modalities are preoperative and included in the initial clinical work-up. Mammography has been shown to have a high accuracy when compared to the surgical specimen, while ultrasound tends to underestimate the tumor size (26).

Transpara (version 1.7.0, Screenpoint Medical, Nijmegen, the Netherlands), a breast cancer detection tool uses deep learning algorithms to detect suspicious soft tissue lesions and microcalcifications (calc) that may indicate breast cancer. Each suspicious region is assigned a score between 1 and 100. When used for screening mammography, Transpara sorts cases into ten different risk categories (1-10) based on the regional suspicion scores. It is calibrated to sort roughly equal numbers of cancers into each category, with a goal of 90%+ of cancers in category 10 (27). Several retrospective studies (28–31) have shown Transpara to be effective in increasing cancer detection and reducing workload in screening, and the prospective Mammography Screening with Artificial Intelligence (MASAI) trial has demonstrated it to be effective in a clinical screening setting (32). Additionally, Transpara has been reported to predict stage II breast cancer years before diagnosis, indicating detection of properties beyond the cancer diagnosis (33). For this study, the highest calc cluster score and soft tissue lesion score were extracted from Transpara. The scores were included in the set of candidate predictors as continuous variables (0–100) and dichotomized as presence of finding (0 vs 1–100). All available mammograms were included and were manually cross-checked for laterality and correct tumor localization in Transpara before data extraction.

LIBRA (version 1.0.5) is an automated breast density estimation algorithm, which analyzes images based on gray-level values and segments them into dense and non-dense areas, developed at the University of Pennsylvania (17). Gastounioti et al. (34) and Keller et al. (35), among others, have validated LIBRA as a breast density measurement system. In this study, dense area [cm2] and density [%] were extracted from LIBRA. Craniocaudal and mediolateral oblique projections were available for all patients and therefore included in the analyses. The mean values from LIBRA of the projections on the ipsilateral side were used as the contralateral side was not available for all patients. Moreover, several studies revealed an association between breast density and breast cancer as well as with ALN status (33–38).

2.4 Statistical analysis

Descriptive analyses were performed to explore the associations between clinical, pathological, and radiological variables, and SLN status using the Mann–Whitney U test and Chi-square test for continuous and categorical variables, respectively. Pathological and mammographic tumor size were compared using Pearson correlation and Bland-Altman analysis. Univariable logistic regression was used for radiological variables to predict pN0. The top ten variables included in the model published by Dihge et al. (10) were used as a framework and included in a multivariable logistic regression (MLR) analysis. In the published article, an ANN model was developed with cross-validation and compared with an MLR model. The performance of the simpler MLR model was found to be marginally inferior. Therefore, we proceeded with the MLR model (area under the receiver operating characteristics curve (AUC) 0.73) in the present study, hence referred to as the postoperative framework model (10). In this study, vascular invasion and multifocality were excluded due to clinical unavailability or poor preoperative quality. The postoperative variables, Ki67 and pathological tumor size, were exchanged for the preoperatively available variables, histological grade and radiological tumor size. Stepwise backward elimination MLR with a p-value threshold for removal of 0.157 was performed to obtain a clinical preoperative model. Radiological variables were evaluated as additional candidates to improve the clinical preoperative model, using stepwise backward elimination. The model stability was assessed by performing the model selection procedure in 1000 bootstrap samples as well as by five-fold cross-validation repeated ten times. Prediction models were compared using AUC and the Akaike information criterion (AIC). The SLNB reduction rate was calculated with a cut-off based on a 10% false negative rate (FNR), reflecting the clinically accepted FNR of SLNB (39, 40). In addition, SLNB reduction rate was calculated with a 20% FNR for comparison. Point estimates for the clinical preoperative prediction model were illustrated by a nomogram. The proposed prediction model was externally validated, temporally and geographically, in a separate cohort. The calibration in the validation cohort was evaluated using the Hosmer-Lemeshow approach. Briefly, the predicted probabilities of pN0 were plotted against the mean predicted probability of pN0 according to the model. Perfect calibration will hence respond to 10 plot symbols on a line with a 45-degree slope.

P-values were not corrected for multiple comparisons due to the explorative nature of the study. All p-values are two-sided and interpreted as level of evidence against the null hypothesis without reference to a cut-off for significance. Stata (StataCorp. 2021. Stata Statistical Software: Release 17. College Station, TX; StataCorp LLC) was used for all statistical analyses.

3 Results

Mammograms were identified in 755 of the 770 patients included in the study. All patients and images were included in the LIBRA subgroup. Transpara failed to analyze mammograms from three patients and 30 patients were excluded due to technical issues in PACS. Inconclusive cases, according to radiologists, and cases without a clear indication of tumor location in PACS were excluded due to the inability to cross-check the AI findings. The inclusion and exclusion of patients with annotated mammograms are presented in Figure 1. The AI assessment of tumor localization on mammograms was correct in 96.1% of the cases.

Figure 1

Figure 1 Flow chart of patient inclusion. At inclusion there were missing images (n=15) in PACS, additional images (n=30) were excluded due to technical issues. Abbreviations: Laboratory for Individualized Breast Radiodensity Assessment (LIBRA); Picture Archiving and Communication System (PACS).

The patient, tumor, and radiological characteristics of the primary cohort are presented in Table 1 and the external validation cohort in Supplementary Table 1. Patient and tumor characteristics were similar in the two cohorts, apart from the prevalence of pN0. In the primary cohort, 35% were pN+, while only 26% were pN+ in the external cohort. The patient and tumor characteristics that showed the strongest evidence for association with pN0 were pathological tumor size (p <0.001), mode of tumor detection (p <0.001), multifocality (p <0.001), vascular invasion (p <0.001), Ki67 (p <0.001), histological grade (p=0.007), age (p=0.027), and histological type (p=0.046). The radiological variables strongest associated with pN0 were radiological tumor size (p <0.001), and the highest soft tissue lesion score (p <0.001).

Table 1

Table 1 Patient, tumor, and radiological variables stratified by sentinel lymph node status.

A comparison of tumor size by pathological and mammographic assessment is presented in Supplementary Table 2. The agreement between tumor size variables was also evaluated in a Bland-Altman plot of differences vs. average (Figure 2). The mean pathological and radiological tumor sizes were 16.7 and 17.1 mm, respectively, and the Pearson correlation coefficient was 0.62.

Figure 2

Figure 2 A Bland-Altman plot illustrating the difference in mean values between the preoperative radiological tumor size and the pathological tumor size from the surgical specimen.

The univariable logistic regression analyses of pN0 are presented in Supplementary Table 3 and AUCs for radiological variables in Supplementary Table 4. Radiological tumor size (odds ratio (OR) 0.97 per mm, 95% confidence interval (CI) 0.95–0.98, p <0.001) had the strongest evidence of association with pN0 in univariable analyses.

The MLR resulted in a clinical preoperative model including radiological tumor size, ER status, age, mode of detection, histological type, and tumor localization (upper inner quadrant vs. other), with an AUC of 0.68 (95% CI: 0.63–0.72) (Table 2). A nomogram visualizing the point estimates for the clinical preoperative model was developed (Figure 3). The remaining radiological variables added to this model using the same method, resulted in a combined preoperative model including radiological tumor size, ER status, age, mode of detection, histological type, tumor localization, highest soft tissue lesion score (continuous), and soft tissue lesion (binary) with an AUC of 0.68 (95% CI: 0.63–0.72) (Table 2). The corresponding AUC for the postoperative framework model was 0.76 (0.71–0.80). Each model’s AIC is presented in Supplementary Table 5. The candidate variable selection procedure was evaluated in 1000 bootstrap samples as well as by cross-validation. Radiological tumor size was selected in 96.5% of the bootstrap analyses (Supplementary Table 6) and in 100% of the cross-validation analyses. The clinical preoperative prediction model was externally validated with an AUC of 0.64 (95% CI: 0.59–0.69). A Hosmer-Lemeshow calibration plot is presented in Supplementary Figure 1. Applying the clinical preoperative prediction model to assign pN0 resulted in a possible SLNB reduction rate of 21% or 34% (Table 3), corresponding to a cut-off that accepts a 10% or 20% FNR, respectively.

Table 2

Table 2 The clinical and combined preoperative prediction models for pN0. Backward variable selection with threshold p≥0.157 for removal.

Figure 3

Figure 3 A nomogram illustrating the point estimates of the included variables’ coefficients for the clinical preoperative prediction model for prediction of negative sentinel lymph node status.

Table 3

Table 3 SLNB reduction rate using the clinical preoperative prediction model to assign sentinel lymph node status.

4 Discussion

In this study, a truly preoperative prediction model for pN0 in primary breast cancer was developed combining radiological and preoperatively available routine clinicopathological variables. The inclusion criteria were not restricted by age, tumor size, or type of surgery. This supports that the model can be used on a case-by-case basis evaluation of pN0 outside the ASCO guidelines on abstaining SLNB in older patients with ER+/HER2- tumors (2). Radiological tumor size was the strongest preoperative predictor of pN0 reflected by its low p-value and high selection rate (≈100%) in bootstrap analyses and cross-validation. This indicates that mammographic tumor size could replace pathological tumor size in preoperative models. Moreover, it was strongly associated with pathological tumor size. The soft tissue lesion score was associated with pN0 in univariable analyses, supporting the hypothesis that mammographic features could aid in preoperatively identifying these patients. However, although associated with the outcome, addition of radiological variables to the clinical preoperative model did not improve discrimination. The clinical and combined preoperative model had AUCs of 0.68 (95% CI: 0.63–0.72), indicating that the addition of radiological variables did not improve the overall performance of the model. External validation of the clinical preoperative prediction model resulted in an AUC of 0.64 (95% CI: 0.59–0.69). The Hosmer-Lemeshow calibration plot demonstrated that the prediction model underestimates the probability of node negativity, although the estimates follow the 45-degree line. A likely explanation is the difference in pN+ prevalence between the cohorts. Nevertheless, the clinical preoperative prediction model could putatively support the omission of SLNB in 21% of patients, if a 10% FNR is accepted, reflecting the accepted FNR of the SLNB procedure and support the implementation of the ASCO guidelines and the results of the SOUND trial (2, 5). The reduction rate is directly dependent on the accepted FNR. The FNR reflecting SLNB could be considered conservative, and accepting a higher FNR might be acceptable in clinical practice. Applying a 20% FNR resulted in a 34% SLNB reduction rate. An alternative to a fixed cut-point, enabling a more patient-centered care, would be to allow different cut-points to be discussed and decided with the patient, on a case-by-case basis.

Pathological tumor size is a strong predictor of SLN status and often included in published prediction models, although it is assessed on the postoperative surgical specimen (10, 12, 13). In accordance with previous research, pathologic and radiologic measurements of tumor size were strongly correlated. The correlation coefficient was 0.62, in the present study, which can be compared to the correlation between pathologic tumor size and radiologic tumor size measured by mammography or ultrasound, depending on histological subtypes, in a study by Gruber et al. (11). This indicates that radiologic tumor size could be used as an alternative measure for pathological tumor size. Mammography has also been shown to estimate the tumor size more accurately than ultrasound, which underestimates the size with a varying degree depending on the histological tumor type (11). In this study radiologic tumor size was strongly associated with pN0 indicating that mammographic tumor size could replace pathological tumor size as a predictor of pN0. However, there may be subgroups, such as patients with dense breasts, in which radiological tumor size needs further evaluation.

The clinical preoperative and combined preoperative models (AUC 0.68) had lower AUCs than the postoperative framework model (AUC 0.76). This was expected as strong predictors, determined on the surgical specimen, were excluded from the model to make it clinically useful in a preoperative setting. The AIC of the postoperative framework prediction model was lower than those of the clinical preoperative and combined preoperative models, which was expected considering the superior discriminative capacity. Additionally, the AUC of the clinical preoperative prediction model was slightly lower in the external validation cohort, which was expected. The difference may be due to differences in prevalence of pN0 in the cohorts, where the external validation cohort had a higher prevalence. Additional analyses adjusting for the prevalence (data not shown) showed an AUC similar to the AUC of the internal validation. A prediction model for heavy nodal burden by Meteroja et al. (41) included prevalence of the outcome as a variable in the model to adjust for differences between populations. This is, however, not applicable in the present study due to the single center approach.

When elaborating on the radiological variables used in this study, it is important to note that Transpara was not intended to be used as a tool for SLN status prediction, although this study indicates its potential predictive value. Additional development of Transpara features in this direction may improve its predictive ability for pN0. However, the potential clinical use and definitions of medicolegal regulations regarding this type of diagnostic tools are debated and yet to be determined before clinical implementation (27, 42). Other forms of AI such as feature extraction from other imaging modalities, such as ultrasound, magnetic resonance imaging, and computed tomography, as well as machine learning methods have been evaluated for prediction of ALN status in several studies receiving high AUCs (16, 18–20). However, to our knowledge, none are yet available for implementation in clinical practice. Implementation of image analysis software in clinical practice could have other applications apart from screening and should therefore be evaluated for other possibilities (43). The implementation of a prediction model in clinical practice would entail additional costs, whereas omission of SLNB would likely reduce costs associated with surgery as previously described for the ANN model proposed by Dihge et al. (10, 44). Striving for de-escalation in cancer care, omission of SLNB could also improve the quality of life and reduce postoperative morbidity, as reported in the INSEMA trial (3). The ASCO guidelines on management of the axilla in early-stage breast cancer stated that patients should be evaluated on a case-by-case basis to ensure oncological safety (2). Therefore, a truly preoperative prediction model for pN0 based on routine information on the individual patients would be of high clinical relevance and act as a foundation for patient-centered decision making. The presented nomogram could be an easy-to-use decision tool to support the preoperative multidisciplinary decision-making to omit SLNB for one in five patients with predicted pN0 status, consequently reducing the succeeding complications. Considering the preoperative nature of the proposed model, it could be considered an improvement compared to the previously published prediction model, regardless of the inferior discriminative capacity. However, the proposed model was developed and validated in retrospective cohorts, thus requiring additional research to ultimately benefit patients. In order to enable implementation of the clinical preoperative prediction model, the model should be validated in prospective studies.

A limitation of previous clinical prediction models is that key variables can only be obtained postoperatively (6, 7, 10). In other studies, this problem was circumvented by including radiological variables from different imaging modalities exclusively or in addition to clinicopathological variables. Liu et al. proposed an exclusively radiological ANN model using contrast-enhanced computed tomography (16). However, this approach is only feasible for clinical implementation for patients with breast cancer who undergo contrast-enhanced computed tomography during the initial routine work-up, an argument which also applies to models that include magnetic resonance imaging (18, 19). Given the wide implementation of mammography as a cornerstone in the clinical work-up for suspicious breast cancer including screening programs, mammographic images are available for all patients and can be used for preoperative diagnostics. Cen et al. proposed a model that included postoperative clinicopathological variables and microcalcification density on mammographic images, resulting in a model with an AUC of 0.70 (12). In the study, microcalcification density >20 cm² was associated with a positive ALN status. This was not observed in the present study, which might be a result of differing measuring techniques. Yang et al. (14) created a prediction model for ALN status (n=147) using a radiomic signature on mammography with an AUC of 0.88 in the validation cohort, but no independent validation has hitherto been performed. Studies have shown a positive association between breast density and malignant axillary lymph nodes when measured by radiologists and automated methods (36, 38), but when evaluated in a prediction model for pN0 including multifocality, pathological tumor size, histological type, Ki67 and histological grade, Hack et al. (13) found no additional predictive value, which is in line with the present study. In this study, images from the ipsilateral side were included in the LIBRA analysis. This was decided as mammographic images on the contralateral side were not available for all patients. The variation in tumor size (0.5 – 90 mm) can be assumed to have affected the results of the breast density variable in descriptive and univariable logistic regression as well as the performance of the variable in the MLR to some extent. However, the results are in accordance with previous results (13). LIBRA, which is a fully automated assessment tool that analyzes processed images, has been validated for breast density measurements on mammographic images by Gastounioti et al. (34) and Keller et al. (17), among others. The discrepancy between previous reports on the association between breast density and ALN status (13, 36, 38) could be due to differences between methods as revealed by Keller et al. (35).

A strength of this study is the relatively large cohort of 770 patients and that the inclusion criteria were not restricted by age, tumor size or type of surgery. All eligible cases during a four-year period were consecutively included, and the cohort should therefore be representative of the breast cancer population at Skåne University Hospital in Lund during the time period. Another strength is the assessment of Transpara’s accuracy through cross-checking for the correct tumor location. Regardless of the cross-checking, all cases were included to resemble a clinical setting. The pN0 nomogram presents a graphical easy-to-interpret visualization of the included predictors and the relative importance of each independent variable is visible at a glance. There are several limitations to this study, such as the low prevalence of pN0 compared to recent cohorts. This could be due to the fact that breast cancers are discovered at an earlier stage now than in past decades. Another limitation is the exclusion of 30 cases due to technical issues in the Transpara sub-cohort, a cause of which could not be identified despite repeated contact with the technical support at the hospital and PACS provider. The authors believe that the reason may be a technical issue with the PACS provider during the archiving process. However, the missing images represent less than 4% of the cohort and the authors have found no reason to believe that the missingness is systematic. The impact of the technical issues on the model development should therefore be minimal. Additionally, Transpara failed to analyze the images of three patients (<0.4%), with unspecified errors. Furthermore, radiological tumor size was missing in 23% of cases, likely due to the fact that the radiological tumor size measurement was not always provided at the time of inclusion of patients in the present study. Thus, the performance of the radiological variables could be biased owing to the missing data. The inclusion of sonographic tumor size in cases where mammographic size was not available has likely decreased the correlation between pathological and radiological tumor size as ultrasound underestimates the size. The correlation can be expected to be higher in a cohort using only mammographic tumor size, increasing the performance of the model. Considering the high correlation presented in this study and the performance of the radiological tumor size in all analyses, the inclusion of sonographic data should not have affected the results. Another limitation is the inclusion of only ipsilateral images in the LIRBA analysis, however, the effect on the overall conclusions can be presumed to be limited. Additionally, the prediction model on which this study is based is an ANN model that has the potential to capture non-linear associations and interactions, whereas the MLR model used in this study captures only linear effects on the log odds scale. This is a limitation and a strength, as the risk of overfitting is lower with less complex models such as MLR than with complex models. Moreover, to minimize the number of variables compared with the number of pN0 patients in the cohort, no interaction variables were included.

4.1 Conclusion

Radiological tumor size was strongly predictive of SLN status, thus supporting the hypothesis that radiological tumor size could replace pathological tumor size as a predictor of pN0. Additionally, although they did not improve the clinical preoperative prediction model, mammographic features might have nodal predictive capabilities. The presented clinical preoperative prediction model is visualized by a nomogram that could support the preoperative multidisciplinary decision-making to omit SLNB on a case-by-case basis for one in five patients with clinically node negative primary breast cancer with predicted pN0 status.

Data availability statement

The datasets presented in this article are not readily available because of sensitive information according to current data legislation but are available from the corresponding author upon reasonable request. Requests to access the datasets should be directed to Y29ybmVsaWEucmVqbWVyQG1lZC5sdS5zZQ==.

Ethics statement

The study involving humans were approved by The Regional Ethical Review Board of Lund, Sweden, and KVB Samråd, Region Skåne. The study were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

CR: Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing, Validation. LD: Data curation, Writing – review & editing, Investigation. P-OB: Writing – review & editing, Conceptualization, Formal analysis, Methodology, Validation. DF: Writing – review & editing, Investigation, Resources. MD: Conceptualization, Resources, Writing – review & editing. LR: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was founded by the Erling Persson Family Foundation (2022/0410); the Governmental Funding of Clinical Research within the National Health Service (ALF) (2022/40304); the Governmental Funding of Clinical Research within the National Health Service (ALF), Young researcher LD; Per-Eric and Ulla Schyberg’s Foundation; the Swedish Cancer Society (2022/0388); the Swedish Research Council (2020/01491); the Swedish Cancer- and Allergy Fund. No funding body took part in the design of the study, collection of data, analysis, interpretation of data or in writing the manuscript.

Acknowledgments

We would like to thank Editage [http://www.editage.com] for editing and reviewing this manuscript for English language. We would also like to thank the funding agencies mentioned in the funding section. Preliminary findings were presented in part at the San Antonio Breast Cancer Symposium 2021. A preprint was published in 2023. The results of the current study and the manuscript have not been published and are not being considered for publication in print or electronics elsewhere (45, 46).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1394448/full#supplementary-material

Abbreviations

AI, Artificial intelligence; AIC, Akaike information criterion; ALN, Axillary lymph node; ANN, Artificial neural network; AUC, Area under the receiver operating characteristic curve; CI, Confidence interval; ER, Estrogen receptor; FNR, false negative rate; INSEMA, Intergroup Sentinel Mamma study; LIBRA, Laboratory for Individualized Breast Radiodensity Assessment; MLR, Multivariable logistic regression; pN0, Negative sentinel lymph node status; pN+, Positive sentinel lymph node status; PACS, Picture Archiving and Communication System; PR, Progesterone receptor; ROC, Receiver operating characteristic; SLN, Sentinel lymph node; SLNB, Sentinel lymph node biopsy; SOUND, Sentinel Node vs Observation After Axillary Ultra-Sound.

References

1. Cardoso F, Kyriakides S, Ohno S, Penault-Llorca F, Poortmans P, Rubio IT, et al. Early breast cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol. (2019) 30:1194–220. doi: 10.1093/annonc/mdz173

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Brackstone M, Baldassarre FG, Perera FE, Cil T, Chavez Mac Gregor M, Dayes IS, et al. Management of the axilla in early-stage breast cancer: ontario health (Cancer care ontario) and ASCO guideline. J Clin Oncol. (2021) 39:3056–82. doi: 10.1200/JCO.21.00934

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Reimer T, Stachs A, Veselinovic K, Polata S, Müller T, Kühn T, et al. Patient-reported outcomes for the Intergroup Sentinel Mamma study (INSEMA): A randomised trial with persistent impact of axillary surgery on arm and breast symptoms in patients with early breast cancer. eClinicalMedicine. (2023) 55:101756. doi: 10.1016/j.eclinm.2022.101756

PubMed Abstract | CrossRef Full Text | Google Scholar

4. McCartan D, Stempel M, Eaton A, Morrow M, Pilewskie M. Impact of body mass index on clinical axillary nodal assessment in breast cancer patients. Ann Surg Oncol. (2016) 23:3324–9. doi: 10.1245/s10434-016-5330-0

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gentilini OD, Botteri E, Sangalli C, Galimberti V, Porpiglia M, Agresti R, et al. Sentinel lymph node biopsy vs no axillary surgery in patients with small breast cancer and negative results on ultrasonography of axillary lymph nodes: the SOUND randomized clinical trial. JAMA Oncol. (2023). doi: 10.1093/bjs/znad391

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Dihge L, Bendahl P-O, Rydén L. Nomograms for preoperative prediction of axillary nodal status in breast cancer. Br J Surg. (2017) 104:1494–505. doi: 10.1002/bjs.10583

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Bevilacqua JLB, Kattan MW, Fey JV, Cody HS III, Borgen PI, Van Zee KJ. Doctor, what are my chances of having a positive sentinel node? A validated nomogram for risk estimation. J Clin Oncol. (2007) 25:3670–9. doi: 10.1200/JCO.2006.08.8013

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Meretoja TJ, Heikkilä PS, Mansfield AS, Cserni G, Ambrozay E, Boross G, et al. A predictive tool to estimate the risk of axillary metastases in breast cancer patients with negative axillary ultrasound. Ann Surg Oncol. (2014) 21:2229–36. doi: 10.1245/s10434-014-3617-6

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Reyal F, Rouzier R, Depont-Hazelzet B, Bollet MA, Pierga J-Y, Alran S, et al. The molecular subtype classification is a determinant of sentinel node positivity in early breast carcinoma. PloS One. (2011) 6:e20297. doi: 10.1371/journal.pone.0020297

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Dihge L, Ohlsson M, Edén P, Bendahl PO, Rydén L. Artificial neural network models to predict nodal status in clinically node-negative breast cancer. BMC Cancer. (2019) 19:610. doi: 10.1186/s12885-019-5827-6

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Gruber IV, Rueckert M, Kagan KO, Staebler A, Siegmann KC, Hartkopf A, et al. Measurement of tumour size with mammography, sonography and magnetic resonance imaging as compared to histological tumour size in primary breast cancer. BMC Cancer. (2013) 13:328. doi: 10.1186/1471-2407-13-328

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Cen D, Xu L, Zhang S, Zhou S, Huang Y, Chen Z, et al. BI-RADS 3-5 microcalcifications: prediction of lymph node metastasis of breast cancer. Oncotarget. (2017) 8:30190–8. doi: 10.18632/oncotarget.v8i18

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Hack C, Häberle L, Geisler K, Schulz-Wendtland R, Hartmann A, Fasching P, et al. Mammographic density and prediction of nodal status in breast cancer patients. Geburtshilfe und Frauenheilkunde. (2013) 73:136–41. doi: 10.1055/s-00000020

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Yang J, Wang T, Yang L, Wang Y, Li H, Zhou X, et al. Preoperative prediction of axillary lymph node metastasis in breast cancer using mammography-based radiomics method. Sci Rep. (2019) 9:4429. doi: 10.1038/s41598-019-40831-z

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Tan H, Wu Y, Bao F, Zhou J, Wan J, Tian J, et al. Mammography-based radiomics nomogram: a potential biomarker to predict axillary lymph node metastasis in breast cancer. Br J Radiol. (2020) 93:20191019. doi: 10.1259/bjr.20191019

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Liu Z, Ni S, Yang C, Sun W, Huang D, Su H, et al. Axillary lymph node metastasis prediction by contrast-enhanced computed tomography images for breast cancer patients based on deep learning. Comput Biol Med. (2021) 136:104715. doi: 10.1016/j.compbiomed.2021.104715

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Keller BM, Nathan DL, Wang Y, Zheng Y, Gee JC, Conant EF, et al. Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation. Med Phys. (2012) 39:4903–17. doi: 10.1118/1.4736530

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Cattell R, Ying J, Lei L, Ding J, Chen S, Serrano Sosa M, et al. Preoperative prediction of lymph node metastasis using deep learning-based features. Visual Computing Industry Biomedicine Art. (2022) 5. doi: 10.1186/s42492-022-00104-5

CrossRef Full Text | Google Scholar

19. Wang C, Chen X, Luo H, Liu Y, Meng R, Wang M, et al. Development and internal validation of a preoperative prediction model for sentinel lymph node status in breast cancer: combining radiomics signature and clinical factors. Front Oncol. (2021) 11:754843. doi: 10.3389/fonc.2021.754843

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Zhou LQ, Wu XL, Huang SY, Wu GG, Ye HR, Wei Q, et al. Lymph node metastasis prediction from primary breast cancer US images using deep learning. Radiology. (2020) 294:19–28. doi: 10.1148/radiol.2019190372

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Qiu X, Jiang Y, Zhao Q, Yan C, Huang M, Jiang T. Could ultrasound-based radiomics noninvasively predict axillary lymph node metastasis in breast cancer? J Ultrasound Med. (2020) 39:1897–905. doi: 10.1002/jum.15294

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Skarping I, Ellbrant J, Dihge L, Ohlsson M, Huss L, Bendahl P-O, et al. Retrospective validation study of an artificial neural network-based preoperative decision-support tool for noninvasive lymph node staging (NILS) in women with primary breast cancer (ISRCTN14341750). BMC Cancer. (2024) 24:86. doi: 10.1186/s12885-024-11854-1

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Goldhirsch A, Winer EP, Coates AS, Gelber RD, Piccart-Gebhart M, Thürlimann B, et al. Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2013. Ann Oncol. (2013) 24:2206–23. doi: 10.1093/annonc/mdt303

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wolff AC, Hammond MEH, Allison KH, Harvey BE, McShane LM, Dowsett M. HER2 testing in breast cancer: american society of clinical oncology/college of american pathologists clinical practice guideline focused update summary. J Oncol Pract. (2018) 14:437–41. doi: 10.1200/JOP.18.00206

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Regionala cancercentrum i samverkan. Nationellt vårdprogram bröstcancer 2024 (2024). Available online at: https://kunskapsbanken.cancercentrum.se/diagnoser/brostcancer/vardprogram/.

Google Scholar

26. Steinhof-Radwańska K, Lorek A, Holecki M, Barczyk-Gutkowska A, Grażyńska A, Szczudło-Chraścina J, et al. Multifocality and multicentrality in breast cancer: comparison of the efficiency of mammography, contrast-enhanced spectral mammography, and magnetic resonance imaging in a group of patients with primarily operable breast cancer. Curr Oncol. (2021) 28:4016–30. doi: 10.3390/curroncol28050341

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Rodriguez-Ruiz A, Lång K, Gubern-Merida A, Broeders M, Gennaro G, Clauser P, et al. Stand-alone artificial intelligence for breast cancer detection in mammography: comparison with 101 radiologists. JNCI: J Natl Cancer Institute. (2019) 111:916–22. doi: 10.1093/jnci/djy222

CrossRef Full Text | Google Scholar

28. Larsen M, Aglen CF, Lee CI, Hoff SR, Lund-Hanssen H, Lång K, et al. Artificial intelligence evaluation of 122 969 mammography examinations from a population-based screening program. Radiology. (2022) 303:502–11. doi: 10.1148/radiol.212381

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Lauritzen AD, Rodríguez-Ruiz A, von Euler-Chelpin MC, Lynge E, Vejborg I, Nielsen M, et al. An artificial intelligence–based mammography screening protocol for breast cancer: outcome and radiologist workload. Radiology. (2022) 304:41–9. doi: 10.1148/radiol.210948

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Romero-Martín S, Elías-Cabot E, Raya-Povedano JL, Gubern-Mérida A, Rodríguez-Ruiz A, Álvarez-Benito M. Stand-alone use of artificial intelligence for digital mammography and digital breast tomosynthesis screening: A retrospective evaluation. Radiology. (2022) 302:535–42. doi: 10.1148/radiol.211590

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Lång K, Dustler M, Dahlblom V, Åkesson A, Andersson I, Zackrisson S. Identifying normal mammograms in a large screening population using artificial intelligence. Eur Radiol. (2021) 31:1687–92. doi: 10.1007/s00330-020-07165-1

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Lång K, Josefsson V, Larsson A-M, Larsson S, Högberg C, Sartor H, et al. Artificial intelligence-supported screen reading versus standard double reading in the Mammography Screening with Artificial Intelligence trial (MASAI): a clinical safety analysis of a randomised, controlled, non-inferiority, single-blinded, screening accuracy study. Lancet Oncol. (2023) 24:936–44. doi: 10.1016/S1470-2045(23)00298-X

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Vachon CM, Scott CG, Norman AD, Khanani SA, Jensen MR, Hruska CB, et al. Impact of artificial intelligence system and volumetric density on risk prediction of interval, screen-detected, and advanced breast cancer. J Clin Oncol. (2023). doi: 10.1200/JCO.22.01153

CrossRef Full Text | Google Scholar

34. Gastounioti A, Kasi CD, Scott CG, Brandt KR, Jensen MR, Hruska CB, et al. Evaluation of LIBRA software for fully automated mammographic density assessment in breast cancer risk prediction. Radiology. (2020) 296:24–31. doi: 10.1148/radiol.2020192509

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Keller BM, Chen J, Daye D, Conant EF, Kontos D. Preliminary evaluation of the publicly available Laboratory for Breast Radiodensity Assessment (LIBRA) software tool: comparison of fully automated area and volumetric density measures in a case–control study with digital mammography. Breast Cancer Res. (2015) 17:117. doi: 10.1186/s13058-015-0626-8

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Krishnan K, Baglietto L, Stone J, McLean C, Southey MC, English DR, et al. Mammographic density and risk of breast cancer by tumor characteristics: A case-control study. BMC Cancer. (2017) 17. doi: 10.1186/s12885-017-3871-7

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Bertrand KA, Tamimi RM, Scott CG, Jensen MR, Pankratz VS, Visscher D, et al. Mammographic density and risk of breast cancer by age and tumor characteristics. Breast Cancer Res. (2013) 15:R104. doi: 10.1186/bcr3570

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Sartor H, Borgquist S, Hartman L, Zackrisson S. Do pathological parameters differ with regard to breast density and mode of detection in breast cancer? The Malmö Diet and Cancer Study. Breast. (2015) 24:12–7. doi: 10.1016/j.breast.2014.10.006

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Pesek S, Ashikaga T, Krag LE, Krag D. The false-negative rate of sentinel node biopsy in patients with breast cancer: A meta-analysis. World J Surg. (2012) 36:2239–51. doi: 10.1007/s00268-012-1623-z

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Krag DN, Anderson SJ, Julian TB, Brown AM, Harlow SP, Ashikaga T, et al. Technical outcomes of sentinel-lymph-node resection and conventional axillary-lymph-node dissection in patients with clinically node-negative breast cancer: results from the NSABP B-32 randomised phase III trial. Lancet Oncol. (2007) 8:881–8. doi: 10.1016/S1470-2045(07)70278-4

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Meretoja TJ, Audisio RA, Heikkilä PS, Bori R, Sejben I, Regitnig P, et al. International multicenter tool to predict the risk of four or more tumor-positive axillary lymph nodes in breast cancer patients with sentinel node macrometastases. Breast Cancer Res Treat. (2013) 138:817–27. doi: 10.1007/s10549-013-2468-3

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Rodríguez-Ruiz A, Krupinski E, Mordang J-J, Schilling K, Heywang-Köbrunner SH, Sechopoulos I, et al. Detection of breast cancer with mammography: effect of an artificial intelligence support system. Radiology. (2019) 290:305–14. doi: 10.1148/radiol.2018181371

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Rodriguez-Ruiz A, Lång K, Gubern-Merida A, Teuwen J, Broeders M, Gennaro G, et al. Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study. Eur Radiol. (2019) 29:4825–32. doi: 10.1007/s00330-019-06186-9

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Skarping I, Nilsson K, Dihge L, Fridhammar A, Ohlsson M, Huss L, et al. The implementation of a noninvasive lymph node staging (NILS) preoperative prediction model is cost effective in primary breast cancer. Breast Cancer Res Treat. (2022) 194:577–86. doi: 10.1007/s10549-022-06636-x

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Rejmer C, Dihge L, Bendahl P-O, Förnvik D, Dustler M, Rydén L. Abstract P1-01-09: Prediction of node negative breast cancer and high disease burden through image analysis software on mammographic images and clinicopathological data. Cancer Res. (2022) 82:P1–01–9–P1–9. doi: 10.1158/1538-7445.SABCS21-P1-01-09

CrossRef Full Text | Google Scholar

46. Rejmer C, Dihge L, Bendahl P-O, Förnvik D, Dustler M, Rydén L. A preoperative prediction model for sentinel lymph node status using artificial intelligence on mammographic images and clinicopathological variables in patients with clinically node-negative breast cancer. [Preprint] (Version 1) available at Research Square (2023). doi: 10.21203/rs.3.rs-2590918/v1.

CrossRef Full Text | Google Scholar

Keywords: breast cancer, de-escalation, sentinel lymph node biopsy, artificial intelligence, mammography, prediction model, personalized medicine

Citation: Rejmer C, Dihge L, Bendahl P-O, Förnvik D, Dustler M and Rydén L (2024) Preoperative prediction of nodal status using clinical data and artificial intelligence derived mammogram features enabling abstention of sentinel lymph node biopsy in breast cancer. Front. Oncol. 14:1394448. doi: 10.3389/fonc.2024.1394448

Received: 01 March 2024; Accepted: 26 June 2024;
Published: 10 July 2024.

Edited by:

Marco Materazzo, Policlinico Tor Vergata, Italy

Reviewed by:

Gang Sun, Xinjiang Cancer Center/Key Laboratory of Oncology of Xinjiang Uyghur Autonomous Region, China
Omar Hamdy, Mansoura University, Egypt

Copyright © 2024 Rejmer, Dihge, Bendahl, Förnvik, Dustler and Rydén. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cornelia Rejmer, Y29ybmVsaWEucmVqbWVyQG1lZC5sdS5zZQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.