Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling

Tian, Chang; Ma, Xiaolu; Lu, Haidi; Wang, Qian; Shao, Chengwei; Yuan, Yuan; Shen, Fu

doi:10.3389/fmed.2023.1326324

ORIGINAL RESEARCH article

Front. Med. , 29 November 2023

Sec. Nuclear Medicine

Volume 10 - 2023 | https://doi.org/10.3389/fmed.2023.1326324

This article is part of the Research Topic Workflow Optimisation for Radiological Imaging View all 17 articles

Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling

Chang Tian¹

Xiaolu Ma²

Haidi Lu²

Qian Wang³

Chengwei Shao²

Yuan Yuan²^*^†

Fu Shen²^*^†

¹School of Information Science and Technology and School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
²Department of Radiology, Changhai Hospital, The Navy Medical University, Shanghai, China
³School of Biomedical Engineering, ShanghaiTech University, Shanghai, China

Background: The objective of this study was twofold: firstly, to develop a convolutional neural network (CNN) for automatic segmentation of rectal cancer (RC) lesions, and secondly, to construct classification models to differentiate between different T-stages of RC. Additionally, it was attempted to investigate the potential benefits of rectal filling in improving the performance of deep learning (DL) models.

Methods: A retrospective study was conducted, including 317 consecutive patients with RC who underwent MRI scans. The datasets were randomly divided into a training set (n = 265) and a test set (n = 52). Initially, an automatic segmentation model based on T2-weighted imaging (T2WI) was constructed using nn-UNet. The performance of the model was evaluated using the dice similarity coefficient (DSC), the 95th percentile Hausdorff distance (HD95), and the average surface distance (ASD). Subsequently, three types of DL-models were constructed: Model 1 trained on the total training dataset, Model 2 trained on the rectal-filling dataset, and Model 3 trained on the non-filling dataset. The diagnostic values were evaluated and compared using receiver operating characteristic (ROC) curve analysis, confusion matrix, net reclassification index (NRI), and decision curve analysis (DCA).

Results: The automatic segmentation showed excellent performance. The rectal-filling dataset exhibited superior results in terms of DSC and ASD (p = 0.006 and 0.017). The DL-models demonstrated significantly superior classification performance to the subjective evaluation in predicting T-stages for all test datasets (all p < 0.05). Among the models, Model 1 showcased the highest overall performance, with an area under the curve (AUC) of 0.958 and an accuracy of 0.962 in the filling test dataset.

Conclusion: This study highlighted the utility of DL-based automatic segmentation and classification models for preoperative T-stage assessment of RC on T2WI, particularly in the rectal-filling dataset. Compared with subjective evaluation, the models exhibited superior performance, suggesting their noticeable potential for enhancing clinical diagnosis and treatment practices.

Background

Colorectal cancer (CRC) stands as the second most prevalent contributor to cancer-related mortality in the United States. Projections for the year 2023 indicate that approximately 153,020 individuals will receive a diagnosis of CRC, and regrettably, 52,550 individuals will succumb to the disease. This includes a concerning subset of 19,550 cases and 3,750 deaths among individuals below the age of 50 years old (1). Rectal cancer (RC) is a subset of CRC, a disease that poses a grave risk to people’s lives. Rectal magnetic resonance imaging (MRI) has witnessed widespread utilization in the comprehensive assessment of RC, assuming a vital role in treatment planning for patients by facilitating accurate preoperative tumor staging. Within clinical practice, high-resolution T2-weighted imaging (HR-T2WI) has gained unanimous acceptance as the optimal approach for preoperative staging of RC (2). The precise preoperative differentiation between T1-2 and T3-4 stages in RC holds immense significance for clinicians in guiding individualized treatment strategies. The ability to discern which patients should undergo total mesorectal excision (TME) or receive neoadjuvant treatment while minimizing the risks of both over-treatment and under-treatment has become paramount (3, 4). However, the traditional approach to MRI staging relies heavily on the expertise and subjective evaluation of radiologists, leading to diminished repeatability and accuracy rates. This reliance poses significant challenges in achieving an accurate preoperative T-stage diagnosis for RC (5). Furthermore, a contentious issue surrounds the use of rectal distension during rectal MRI, specifically regarding whether the rectal lumen should be filled with fluid or gel (2–4). While the primary objective of rectal filling is to optimize lesion visualization and improve T-stage assessment on MRI, the question of its routine application remains unresolved due to a lack of robust evidence demonstrating substantial improvements in lesion conspicuity (3–5).

In recent years, radiomics has emerged as a potential method for addressing diverse clinical challenges, surpassing traditional methods in several studies. By leveraging high-throughput analysis to extract a multitude of quantitative features from medical images, radiomics approaches have demonstrated promising potential in the field of digestive tumors (6–17). However, the predominant methodologies in this domain typically involve manually determining the volume of the entire primary tumor. This process is not only arduous and time-consuming but also heavily reliant on the operator’s expertise, demanding a high level of proficiency (9, 16, 17).

Previous study yielded an initial finding indicating the development of two distinct radiomics models utilizing rectal HR-T2WI, both with and without rectal filling. These models were devised to evaluate the T staging of RC. Notably, our results demonstrated the superior performance of the radiomics model incorporating rectal filling in effectively distinguishing between T1-2 and T3 stages. This promising outcome suggests that the utilization of this model could offer valuable support in clinical decision-making when evaluating T-stage in RC patients (6).

Meanwhile, the deep learning (DL)-based method, as a novel technology, could significantly improve lesion automatic localization and segmentation, tumor diagnosis, staging, and prognosis prediction to facilitate treatment strategy, and could even greatly help radiologists work more efficiently and reduce their burden (18–20). Despite the considerable significance of T staging in RC, there exists a notable research gap regarding the validation and comparative analysis of MRI-based DL approaches specifically tailored for T staging evaluation, taking into account the presence or absence of rectal filling.

Therefore, in this study, we initially attempted to construct a convolutional neural network for the automatic localization and segmentation of RC lesions. Subsequently, we developed DL networks for the assessment of RC T-staging. Of utmost importance was our exploration of whether rectal filling could prove beneficial in guiding clinical decision-making for RC T stage evaluation.

Methods

Participants

This study followed the Declaration of Helsinki and had approval from the Ethics Committees of Changhai Hospital. Written informed consent was waived from all patients.

This retrospective trial enrolled a total of 492 consecutive patients with RC who underwent radical resection at Changhai Hospital between January 2017 and May 2023. The study’s inclusion criteria comprised the following: (1) confirmation of rectal adenocarcinoma through postoperative pathological examination; (2) presence of a single lesion; (3) baseline rectal magnetic resonance (MR) examination conducted within 14 days prior to surgical resection. Exclusion criteria included: (1) receipt of any local or systemic treatment prior to surgical resection, such as neoadjuvant chemoradiotherapy (n = 108); (2) concurrent diagnosis of other malignancies (n = 7); (3) poor image quality (n = 25); (4) synchronous distant metastasis (n = 23); (5) palliative resection (n = 7); (6) history of previous pelvic surgery (n = 5). Consequently, a total of 317 cases were included in the final analysis, as depicted in Figure 1.

FIGURE 1

Figure 1. Flowchart of the study.

Clinicopathologic data

Patients’ demographic and clinicopathological data were retrospectively extracted from the clinicopathological databases. These data encompassed various factors, including sex, age, body mass index (BMI), histological differentiation, pathological T-stage, pathological N-stage, carcinoembryonic antigen (CEA) levels (with <5 ng/mL considered as negative), and carbohydrate antigen 19-9 (CA19-9) levels (with <37 U/mL considered as negative). These parameters were recorded concurrently with the baseline MRI examinations. Employing the criteria set forth by the National Comprehensive Cancer Network (NCCN) and American Joint Committee on Cancer (AJCC) staging system (21), the patients involved in the study were meticulously stratified into distinct cohorts, each characterized by their respective pathological T stages. Specifically, the T1-2 group encompassed individuals with tumors confined solely to the submucosal and muscularis propria layers. In contrast, the T3-4 group comprised patients with tumors that exhibited invasive growth beyond the confines of the muscularis propria.

Image acquisition and analysis

Prior to the study, baseline rectal MRI scans were performed using 3.0 T MR systems, including the Siemens 3.0 T MAGNETOM Skyra MRI System, GE 3.0 T Discovery MR 750w, and Signa HDX System, coupled with a specialized phased array coil for enhanced imaging sensitivity. To ensure optimal image quality, intestinal cleaning was meticulously carried out through the administration of a 20 mL glycerin enema. Considering the possibility of contraindications, the administration of raceanisodamine hydrochloride, a commonly used agent, was deliberately omitted. As part of the routine imaging procedure, oblique axial HR-T2WI was conducted with careful consideration of the orientation perpendicular to the long axis of the rectum, encompassing the region of interest (ROI). Notably, detailed information regarding the parameters employed for HR-T2WI, which played a pivotal role in the subsequent analysis, can be found in Supplementary Table S1.

Within the filling group, patients underwent a baseline MRI with rectal filling, involving the administration of warm ultrasound (US) transmission gel to achieve rectal distention. Prior to acquiring the oblique axial HR-T2W images, the volume of gel used for rectal filling was tailored based on the endoscopic evaluation of tumor location. Specifically, 60–80 mL of gel was administered for lesions situated in the lower and middle rectum, while 80–100 mL was utilized for lesions in the upper rectum (22). Conversely, in the non-filling group, rectal distention using US gel was omitted during the baseline MRI procedure.

Subjective evaluation and ROI delineation were performed by 3 radiologists with systematic training, including FS, HL, and YY with 15, 11, and 13 years of experience in MR diagnosis, respectively, who were blinded to pathological data. A subjective classification task was assigned to the experts, requiring them to categorize each lesion as either T1-2 or T3-4 based on the established TNM staging system. Interobserver agreement for MR T-staging among the three radiologists was calculated. To facilitate accurate lesion segmentation, the ROI encompassing the entire rectal lesion was manually delineated in a meticulous slice-by-slice manner on the T2WI using ITK-SNAP 4.0.0 software¹. The delineated borders, representing the ground truth (GT), were meticulously determined by consensus among the experts. In cases of any discrepancies or differences of opinion, a thorough discussion ensued until a consensus was reached, requiring the agreement of at least two experts.

Dataset and pre-processing

A dataset comprising 317 MRI scans of RC and their corresponding T-stage labels was extracted and subsequently divided into a training set (n = 265) and a test set (n = 52) using a random allocation in a ratio of approximately 5:1. For the segmentation task, we utilized the preprocessing pipeline of nn-UNet (23–25), which could select the suitable data fingerprint automatically. We adopted the data preprocessing strategy through data fingerprint information, including resampling strategy, cropping area size, gray value distribution, etc. information, thus forming a so-called “configuration plan.” While for the classification task, to ensure consistency, all images underwent preprocessing too from the “configuration plan,” including resampling to a target spacing of [0.36, 0.36, 0.36] mm. Additionally, the size of each imaging scan was adjusted by cropping or padding to achieve a uniform dimension of 384 × 384 × 64.

DL model construction

The U-Net architecture (23), introduced in 2015 as an Encoder–Decoder model (24), made a significant impact in the field of medical segmentation, generating widespread enthusiasm. Subsequent studies have primarily concentrated on maximizing the potential of U-Net and enhancing its performance through various modifications. Presently, UNet-like Encoder–Decoder architectures remain robust and highly regarded in the field. One notable variant, nn-UNet (25), exemplifies the remarkable qualities of U-Net as a self-configuring approach and pipeline for DL-based biomedical image segmentation, consistently delivering exceptional results.

Taking inspiration from these advancements, we incorporated a powerful and highly acclaimed network architecture, slightly modified by nn-UNet to be tailored for our rectal cancer data, as the backbone in Stage I of our study, where we trained it on a total of 265 cases. To adapt this network to our specific dataset and enable automatic segmentation of RC (Figure 2A), we rebuilt the training pipeline. We employed a larger dropout rate and more data augmentation strategy to prevent overfitting. In order to enhance the performance and generalizability of our model, we randomly divided the dataset with 5-fold cross-validation, implemented group normalization instead of batch normalization, and introduced larger convolution kernels. To evaluate the accuracy of the segmentation results, we calculated the dice similarity coefficient (DSC), the 95th percentile Hausdorff distance (HD95), and the average surface distance (ASD) between the automatically segmented images and GT images (26–28).

FIGURE 2

Figure 2. Structure diagram of the deep learning model. (A) The automated segmentation pipeline; (B) the classification pipeline. The first stage is to construct a segmentation flow chart to segment rectal cancer. The second stage is to build a classification flow chart and use the segmentation results of the first stage to do specific staging tasks.

In contrast to the segmentation sub-task, the classification sub-task focuses on feature extraction after the convolution stage without the need to restore the features to their original size, which is one of the differences between the classification and segmentation tasks. Thus, in Stage II of our study, we designed an appropriate encoder as the backbone, which is the encoder of the 3D UNet, to extract features after convolution. For the output layer, we incorporated a multi-layer perception network to classify the T stages of RC. The flowchart of the classification task is depicted in Figure 2B. To facilitate T-stage classification in Stage II, we introduced a simple and easily manageable padding-cropping strategy. This involved utilizing the segmentation results obtained from Stage I and treating them as the input for the classification task, following the process outlined in Figure 2B. To construct our DL models, we divided the dataset into three categories based on the rectal filling status: model 1 trained on the complete training set of 265 cases, model 2 trained exclusively on rectal-filling cases, and model 3 trained using the non-filling dataset. The construction details of these models are provided in Supplementary material.

Statistical analysis

To perform the statistical analysis, we employed two software tools: MedCalc (version 19.8, MedCalc Software, Mariakerke, Belgium) and the R package (version 4.1.3, Vienna, Austria). Normality testing of all continuous variables was conducted using the Kolmogorov–Smirnov test to assess their distribution. Categorical data were compared using either the Pearson Chi-square test or the Fisher’s exact test, depending on the expected cell counts. For continuous variables, presented as mean ± standard deviation, comparisons were made using either the Student’s t-test for normally distributed data or the Kruskal–Wallis H test for variables with non-normal distributions. To comprehensively evaluate the diagnostic performance of the T-staging classification models, we employed rigorous statistical techniques. Receiver operating characteristic (ROC) curve analysis and the confusion matrix were utilized to assess the models’ discriminative abilities in the independent test datasets. Essential performance measures, such as sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (PLR), and negative likelihood ratio (NLR), were determined to provide a comprehensive understanding of the models’ diagnostic values. Furthermore, to compare the classification models with subjective evaluation, we conducted net reclassification index (NRI) analysis. To gauge the clinical significance of the models, decision curve analysis (DCA) was performed, allowing us to calculate the net benefit. Statistical significance was established at a two-sided p-value less than 0.05, indicating strong evidence for significance.

Results

Patients’ characteristics

A comprehensive overview of patient demographic characteristics can be found in Table 1. After thorough evaluation, a total of 317 patients were included in the final analysis. Among them, 158 out of 317 cases (49.8%) underwent rectal filling, while the remaining 159 cases (50.2%) were in the non-filling group. Notably, there were no significant differences observed between these two cohorts of patient demographic characteristics (p > 0.05). The subsequent examination of the 52 test cases revealed an equal distribution of 26 cases each in both the filling and non-filling groups. Importantly, no statistically significant differences in T stage were detected between the filling and non-filling groups (T1-2/T3-4: 14/12 vs. 10/16, p = 0.404). Moreover, it is noteworthy that none of the cases exhibited positive circumferential resection margin (CRM) involvement.

TABLE 1

Table 1. Pathological characteristics of patients.

Automatic segmentation results

In Stage I, we developed a segmentation pipeline utilizing DL models, which can be succinctly referred to as nn-UNet. These automatic segmentation models exhibited exceptional performance in the test datasets, as illustrated in Figure 3. For the overall test dataset, the median values of DSC, HD95, and ASD were 0.835, 2.236 mm, and 0.647 mm, respectively. In the rectal-filling cases, the median values were 0.862 for DSC, 2.118 mm for HD95, and 0.584 mm for ASD. Conversely, in the non-filling cases, the median values were 0.807 for DSC, 3.000 mm for HD95, and 0.879 mm for ASD. Notably, the DSC and ASD values were higher in the rectal-filling dataset (p = 0.006 and p = 0.017, respectively). The detailed results of the automatic segmentation are presented in Table 2 and Supplementary Figure S1.

FIGURE 3

Figure 3. Representative diagram of automatic segmentation.

TABLE 2

Table 2. Automatic segmentation results.

Classification performance

In Stage II, concentrating on T-staging classification, the subjective evaluation yielded area under the curve (AUC) values of 0.735, 0.810, and 0.713 for the total, filling, and non-filling test datasets, respectively. The corresponding accuracies were 0.731, 0.808, and 0.692. Interobserver agreement for MR T-staging among the three radiologists is presented in Supplementary Table S2. Notably, the DL models outperformed the subjective evaluation across all test datasets. Model 1 achieved AUC values of 0.902, 0.958, and 0.900 for the total, filling, and non-filling test datasets, respectively. The accuracies were 0.846, 0.962, and 0.808, respectively. Model 2, trained on rectal-filling cases, exhibited an AUC of 0.946 and an accuracy of 0.885 in the filling test dataset. Model 3, trained exclusively on non-filling cases, demonstrated an AUC of 0.863 and an accuracy of 0.885 in the non-filling test dataset. Comprehensive ROC analyses are presented in Table 3, and the corresponding curves are depicted in Figure 4.

TABLE 3

Table 3. ROC curve analysis and comparison in the test dataset.

FIGURE 4

Figure 4. ROC curves in the test dataset. (A) Total test dataset; (B) rectal-filling test dataset; (C) non-filling test dataset.

Model comparison and clinical utility

Compared to subjective evaluation for RC T-staging, NRIs of DL-models were 0.167 to 0.310, demonstrating an improved clinical utility in all datasets (Table 3).

Considering the influence of rectal filling or non-filling, the confusion matrix highlighted the superior classification performance of Model 1 in the rectal-filling dataset compared to Model 1 in the non-filling dataset. Likewise, the performance of Model 2 in the filling dataset outperformed that of Model 3 in the non-filling dataset (Figure 5). The net clinical advantage of Model 1 over Model 2 in the rectal-filling dataset and Model 1 over Model 3 in the non-filling dataset is illustrated by the DCA chart (Figure 6). Overall, Model 1 in the rectal-filling dataset demonstrated notably improved diagnostic performance.

FIGURE 5

Figure 5. Confusion matrices.

FIGURE 6

Figure 6. DCA in filling and non-filling test datasets. Results for the rectal-filling dataset. (A) The net benefit analysis showed that for p probability thresholds ranging from 0.25 to 0.88 in the test dataset, Model 1 provided greater benefits compared to Model 2 assessment. Moreover, Model 1 exhibited larger net benefits when compared to all/no intervention methods. (B) Results for the non-filling dataset. The net benefit analysis demonstrated that for P probability thresholds ranging from 0.42 to 0.91 in the test dataset, Model 1 yielded additional benefits compared to Model 3 assessment. Furthermore, Model 1 showed larger net benefits when compared to all/no intervention methods.

Discussion

In this study, we developed an advanced and automated segmentation model based on nn-UNet to achieve precise segmentation of rectal adenocarcinomas from T2W images, particularly in the rectal-filling dataset. Subsequently, we constructed DL-based classification models that exhibited significantly improved performance in T-staging classification compared to subjective evaluation for RC cases. Notably, Model 1, trained on the total training dataset, demonstrated higher AUC and accuracy in the rectal-filling cohort. To the best of our knowledge, this is the first investigation into the impact of rectal filling on DL models, highlighting its influence on classification performance.

In the current landscape of medical practice, rectal MRI has been widely endorsed as the optimal approach for preoperative T-staging in RC. However, its diagnostic accuracy is compromised by the connective tissue hyperplastic response in the surrounding rectal mesenteric fat, leading to indistinct tumor boundaries. This limitation of traditional MRI techniques in distinguishing between T2 and T3 stages has been well-documented in previous studies (3–5), and our own results align with these observations. Our study conducted a comprehensive ROC analysis, revealing that the subjective discrimination of preoperative T-stage by radiologists was significantly inferior to the proposed DL model. The accuracies of radiologists’ assessments ranged from 69.2 to 80.8%. Additionally, the net reclassification index (NRI) analysis demonstrated improved classification performance achieved by employing DL approaches, while decision curve analysis (DCA) highlighted the favorable clinical usefulness of these models. These findings can be attributed to the inherent challenges faced by radiologists in accurately interpreting irregular tumor shapes and blurred boundaries. Therefore, the accurate identification and precise segmentation of lesions serve as crucial prerequisites for future research endeavors aimed at advancing preoperative evaluation and staging methodologies in RC.

The routine utilization of manual or semi-manual segmentation methods is often plagued by inherent challenges, including their arduous and time-consuming nature, as well as their heavy reliance on operator expertise (16, 17). In recent years, several studies have explored the application of 2D convolutional neural network (CNN) models for the discrimination of T2 and T3 stages using 2D MR images (29, 30). However, these approaches introduce an additional burden on radiologists, as they require manual selection of a representative slice (2D) from each MR volume (3D). This manual selection step adds complexity and potential subjectivity to the process. Hou et al. conducted a study where they developed a DL model using 3D T2W images, achieving an impressive AUC value of 0.869 (31). However, it is important to note that the segmentation process in their research was carried out manually, which may introduce subjectivity and potential variability. In a separate study by Wei et al., a multi-parametric MR image fusion model was employed, achieving an AUC of 0.854. This approach involved the manual determination of the location and size of a 3D bounding box containing the tumor (32).

In Stage I, we successfully developed an automatic segmentation model for rectal adenocarcinomas using a 3D nn-UNet architecture. As a standardized and dataset-agnostic framework, nnU-Net was proposed as a robust and powerful tool for medical image segmentation. The results demonstrated impressive performance, with median values of 0.807–0.862 for DSC, 2.118–3.000 for HD95, and 0.584–0.879 for ASD in the test dataset. To enhance the robustness and generalization of the model while avoiding overfitting, we employed a data augmentation strategy along with 5-fold cross-validation. Furthermore, two experienced radiologists carefully examined the visualizations of the segmentation results, and no noticeable segmentation errors were detected. The implementation of this method bears the potential to serve as a viable replacement for the prevailing manual segmentation method, which is notorious for its time-consuming nature and lack of reproducibility. Subsequently, we conducted additional evaluations on the total test dataset, as well as the rectal-filling and non-filling datasets within the test set. Our findings revealed that the DSC and ASD values were significantly better in the rectal-filling datasets compared to both the total datasets and the non-filling datasets (p = 0.006 and p = 0.017, respectively). These results suggest that the model exhibited a tendency towards better performance and metrics in rectal-filling cases.

In the Stage II of this study, we introduced a 3D CNN to classify RC lesions as T1-2 or T3-4 stages on HR-T2WI. For the classification models, we utilized widely-used UNet-like Encoder–Decoder architectures. We recognized that directly inputting the original images into the models would make it challenging to distinguish between T1-2 and T3-4 stages, as the models might concentrate on areas other than the cancer of interest. To address this concern, we devised a novel approach using the information from the automated segmentation results obtained in Stage I. We combined the original MRI of RC with its corresponding segmentation result, incorporating them as the complete input. This approach involved employing a region-of-interest cropping strategy, as mentioned earlier. Our initial experiments demonstrated the effectiveness and correctness of this approach compared to solely using the original MRI of RC with a center-cropping strategy. We believe that the center-cropping strategy may not accurately select the cancerous region, as the cancer might not always be located in the center of every image. Although expanding the cropping area could be considered, it would introduce redundant information that is not helpful. Therefore, our cropping strategy, as described above, represents a promising approach for precisely selecting the cancerous region in each original image.

A distinctive feature of our study was its groundbreaking exploration and validation of DL models for preoperative T staging in RC, with a particular focus on the influence of rectal filling. To the best of our knowledge, this was the first endeavor to address this specific aspect, shedding new light on the application of DL in this context.

Our study encompassed the evaluation of automatic segmentation models for rectal adenocarcinomas across three distinct datasets: the total dataset, the rectal-filling dataset, and the non-filling dataset. Following this, three DL models were trained using these datasets to explore their performance. Through a comprehensive analysis involving segmentation results, ROC evaluation, confusion metrics, and DCA, a noteworthy finding emerged: Model 1 exhibited superior performance specifically in the rectal-filling dataset. These results underscore the additional benefits conferred by the use of rectal contrast material in DL models. Previous research has already demonstrated the advantages of rectal filling, including improved lesion visualization and enhanced evaluation of tumor penetration on MRI (2). Furthermore, our previous study has corroborated the value of rectal-filling in accurately delineating rectal lesions and distinguishing them from normal rectal tissue, thus facilitating precise segmentation (6). This likely explains the higher performance observed in the DL model trained on the rectal-filling dataset compared to the non-filling dataset.

Despite the notable contributions of our study, several limitations warrant consideration. Firstly, our dataset consisted solely of HR-T2WI of RC, lacking the inclusion of other imaging modalities. Moreover, being a retrospective single-center study, potential selection biases may have influenced our findings. Thus, for further validation and to enhance the generalizability of our results, larger datasets and multi-center studies incorporating diverse imaging modalities are necessary. Secondly, it is crucial to acknowledge that factors, such as extramural vascular invasion (EMVI), lymph node metastasis (LNM), and mesorectal fascia (MRF) significantly influence the prognosis and survival of RC patients (2–4, 33). While the impact of rectal luminal distention on DL-models pertaining to MRF, EMVI, and LNM remains a topic of debate, further investigation is essential to elucidate these associations comprehensively. Thirdly, an important consideration is the generalizability of our findings to lesions that have undergone neoadjuvant treatment, as this aspect remains elusive and requires further clarification. Finally, the current investigation primarily employed CNN-based models, which are known to perform well with small datasets. However, we did not explore the use of transformer-based models, which are better suited for larger datasets. Therefore, future research should encompass the incorporation of transformer-based models to leverage their potential in handling larger datasets effectively.

Conclusion

Leveraging high-resolution rectal MR imaging, we developed a DL-based segmentation model to automatically extract the region of RC. Subsequently, we constructed DL-based classification models to explore an innovative approach for preoperative T-staging of RC using DL networks. Through a comprehensive comparison, we observed that the DL models exhibited superior predictive capabilities compared to subjective evaluation, particularly in distinguishing between T1-2 and T3-4 stages in the test dataset with rectal-filling. These findings strongly indicate that the DL model, augmented by rectal-filling, holds significant potential as an optimal tool for guiding clinical practice in the preoperative T-staging of RC patients.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Shanghai Changhai Hospital, Naval Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

CT: Data curation, Formal analysis, Writing – original draft. XM: Data curation, Methodology, Software, Writing – review & editing. HL: Data curation, Writing – review & editing. QW: Data curation, Writing – review & editing. CS: Conceptualization, Writing – review & editing. YY: Conceptualization, Formal analysis, Investigation, Methodology, Software, Supervision, Writing – review & editing. FS: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the Guhai project of Changhai hospital (GH145-09) and the “Yi Yuan Xin Xing” young medical talents funding project of Shanghai (project’s number: N/A).

Acknowledgments

We thank Cheng Li and Xiaolin Meng for providing the DL algorithm support in the Research and Advanced Algorithm Department of HSW BU, Shanghai United Imaging Healthcare Co., Ltd., Shanghai, China.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2023.1326324/full#supplementary-material

Abbreviations

RC, Rectal cancer; TME, Total mesorectal excision; DL, Deep learning; T2WI, T2-weighted imaging; VOI, Volume of interest; ROC, Receiver operating characteristic; AUC, Area under the ROC curve; MRF, Mesorectal fascia; EMVI, Extramural vascular invasion; LNM, Lymph node metastasis; DSC, Dice similarity coefficient; HD95, 95th percentile Hausdorff distance; ASD, Average surface distance.

Footnotes

1. ^http://www.itksnap.org/

References

1. Siegel, RL, Miller, KD, Wagle, NS, and Jemal, A. Cancer statistics. CA Cancer J Clin. (2023) 73:17–48. doi: 10.3322/caac.21763

CrossRef Full Text | Google Scholar

2. Gollub, MJ, Lall, C, Lalwani, N, and Rosenthal, MH. Current controversy, confusion, and imprecision in the use and interpretation of rectal MRI. Abdom Radiol. (2019) 44:3549–58. doi: 10.1007/s00261-019-01996-3

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Horvat, N, Carlos Tavares Rocha, C, Clemente Oliveira, B, Petkovska, I, and Gollub, MJ. MRI of rectal cancer: tumor staging, imaging techniques, and management. Radiographics. (2019) 39:367–87. doi: 10.1148/rg.2019180114

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Nougaret, S, Reinhold, C, Mikhael, HW, Rouanet, P, Bibeau, F, and Brown, G. The use of MR imaging in treatment planning for patients with rectal carcinoma: have you checked the “DISTANCE”? Radiology. (2013) 268:330–44. doi: 10.1148/radiol.13121361

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Maas, M, Lambregts, DMJ, Lahaye, MJ, Beets, GL, Backes, W, Vliegen, RFA, et al. T-staging of rectal cancer: accuracy of 3.0 tesla MRI compared with 1.5 tesla. Abdom Imaging. (2012) 37:475–81. doi: 10.1007/s00261-011-9770-5

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yuan, Y, Lu, H, Ma, X, Chen, F, Zhang, S, Xia, Y, et al. Is rectal filling optimal for MRI-based radiomics in preoperative T staging of rectal cancer? Abdominal Radiol. (2022) 47:1741–9. doi: 10.1007/s00261-022-03477-6

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Li, M, Zhang, J, Dan, Y, Yao, Y, Dai, W, Cai, G, et al. A clinical-radiomics nomogram for the preoperative prediction of lymph node metastasis in colorectal cancer. J Transl Med. (2020) 18:46. doi: 10.1186/s12967-020-02215-0

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Huang, YQ, Liang, CH, He, L, Tian, J, Liang, CS, Chen, X, et al. Development and validation of a radiomics nomogram for preoperative prediction of lymph node metastasis in colorectal cancer. J Clin Oncol. (2016) 34:2157–64. doi: 10.1200/JCO.2015.65.9128

CrossRef Full Text | Google Scholar

9. Ma, X, Shen, F, Jia, Y, Xia, Y, Li, Q, and Lu, J. MRI-based radiomics of rectal cancer: preoperative assessment of the pathological features. BMC Med Imaging. (2019) 19:86. doi: 10.1186/s12880-019-0392-7

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Liu, Z, Wang, Y, Shen, F, Zhang, Z, Gong, J, Fu, C, et al. Radiomics based on readout-segmented echo-planar imaging (RS-EPI) diffusion-weighted imaging (DWI) for prognostic risk stratification of patients with rectal cancer: a two-center, machine learning study using the framework of predictive, preventive, and personalized medicine [J]. EPMA J. (2022) 13:633–47. doi: 10.1007/s13167-022-00303-3

CrossRef Full Text | Google Scholar

11. Jing, G, Chen, Y, Ma, X, Li, Z, Lu, H, Xia, Y, et al. Predicting mismatch-repair status in rectal cancer using multiparametric mri-based radiomics models: a preliminary study. Biomed Res Int. (2022) 2022:1–11. doi: 10.1155/2022/6623574

CrossRef Full Text | Google Scholar

12. Li, Z, Li, S, Zang, S, Ma, X, Chen, F, Xia, Y, et al. Predicting treatment response to neoadjuvant chemoradiotherapy in rectal mucinous adenocarcinoma using an MRI-based radiomics nomogram. Front Oncol. (2021) 11:671636. doi: 10.3389/fonc.2021.671636

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Liu, M, Ma, X, Shen, F, Xia, Y, Jia, Y, and Lu, J. MRI-based radiomics nomogram to predict synchronous liver metastasis in primary rectal cancer patients. Cancer Med. (2020) 9:5155–63. doi: 10.1002/cam4.3185

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Dong, D, Fang, MJ, Tang, L, Shan, XH, Gao, JB, Giganti, F, et al. Deep learning radiomic nomogram can predict the number of lymph node metastasis in locally advanced gastric cancer: an international multicenter study. Ann Oncol. (2020) 31:912–20. doi: 10.1016/j.annonc.2020.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Wang, Y, Liu, W, Yu, Y, Liu, JJ, Xue, HD, Qi, YF, et al. CT radiomics nomogram for the preoperative prediction of lymph node metastasis in gastric cancer. Eur Radiol. (2020) 30:976–86. doi: 10.1007/s00330-019-06398-z

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chen, Q, Zhang, L, Liu, S, You, J, Chen, L, Jin, Z, et al. Radiomics in precision medicine for gastric cancer: opportunities and challenges. Eur Radiol. (2022) 32:5852–68. doi: 10.1007/s00330-022-08704-8

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Qin, Y, Deng, Y, Jiang, H, Hu, N, and Song, B. Artificial intelligence in the imaging of gastric cancer: current applications and future direction. Front Oncol. (2021) 11:631686. doi: 10.3389/fonc.2021.631686

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Sahiner, B, Pezeshk, A, Hadjiiski, LM, Wang, X, Drukker, K, Cha, KH, et al. Deep learning in medical imaging and radiation therapy. Med Phys. (2019) 46:e1–e36. doi: 10.1002/mp.13264

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Chen, C, Qin, C, Qiu, H, Tarroni, G, Duan, J, Bai, W, et al. Deep learning for cardiac image segmentation: a review. Front Cardiovasc Med. (2020) 7:25. doi: 10.3389/fcvm.2020.00025

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Bandyk, MG, Gopireddy, DR, Lall, C, Balaji, KC, and Dolz, J. MRI and CT bladder segmentation from classical to deep learning based approaches: current limitations and lessons. Comput Biol Med. (2021) 134:104472. doi: 10.1016/j.compbiomed.2021.104472

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Amin, MB, Greene, FL, Edge, SB, Compton, CC, Gershenwald, JE, Brookland, RK, et al. The eighth edition AJCC cancer staging manual: continuing to build a bridge from a population-based to a more personalized approach to cancer staging. CA Cancer J Clin. (2017) 67:93–9. doi: 10.3322/caac.21388

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Kaur, H, Choi, H, You, YN, Rauch, GM, Jensen, CT, Hou, P, et al. MR imaging for preoperative evaluation of primary rectal cancer: practical considerations. Radiographics. (2012) 32:389–409. doi: 10.1148/rg.322115122

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Ronneberger, Olaf, Fischer, Philipp, and Brox, Thomas. U-net: convolutional networks for biomedical image segmentation medical image computing and computer-assisted intervention-MICCAI, (2015): 18th international conference, Munich, Germany, October 5–9, 2015, Springer International Publishing

Google Scholar

24. Badrinarayanan, V, Kendall, A, and Cipolla, R. SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell. (2017) 39:2481–95. doi: 10.1109/TPAMI.2016.2644615

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Isensee, F, Jaeger, PF, Kohl, SAA, Petersen, J, and Maier-Hein, KH. nnU-net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. (2021) 18:203–11. doi: 10.1038/s41592-020-01008-z

CrossRef Full Text | Google Scholar

26. Krithika Alias AnbuDevi, M, and Suganthi, K. Review of semantic segmentation of medical images using modified architectures of UNET. Diagnostics. (2022) 12:64. doi: 10.3390/diagnostics12123064

CrossRef Full Text | Google Scholar

27. Taha, AA, and Hanbury, A. An efficient algorithm for calculating the exact Hausdorff distance. IEEE Trans Pattern Anal Mach Intell. (2015) 37:2153–63. doi: 10.1109/TPAMI.2015.2408351

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Yeghiazaryan, V, and Voiculescu, I. Family of boundary overlap metrics for the evaluation of medical image segmentation. J Med Imaging. (2018) 5:015006. doi: 10.1117/1.JMI.5.1.015006

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Wu, QY, Liu, SL, Sun, P, Li, Y, Liu, GW, Liu, SS, et al. Establishment and clinical application value of an automatic diagnosis platform for rectal cancer T-staging based on a deep neural network. Chin Med J. (2021) 134:821–8. doi: 10.1097/CM9.0000000000001401

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Jian, J, Xiong, F, Xia, W, Zhang, R, Gu, J, Wu, X, et al. Fully convolutional networks (FCNs)-based segmentation method for colorectal tumors on T2-weighted magnetic resonance images. Australas Phys Eng Sci Med. (2018) 41:393–401. doi: 10.1007/s13246-018-0636-9

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Hou, M, Zhou, L, and Sun, JH. Deep-learning-based 3D super-resolution MRI radiomics model: superior predictive performance in preoperative T-staging of rectal cancer [J]. Eur Radiol. (2023) 33:1–10. doi: 10.1007/s00330-022-08952-8

CrossRef Full Text | Google Scholar

32. Wei, Y, Wang, H, Chen, Z, et al. Deep learning-based multiparametric MRI model for preoperative T-stage in rectal cancer [published online ahead of print, 2023 Jun 27]. Magn Reson Imaging. (2023) 2:28856. doi: 10.1002/jmri.28856

CrossRef Full Text | Google Scholar

33. Gollub, MJ, Arya, S, Beets-Tan, RG, et al. Use of magnetic resonance imaging in rectal cancer patients: Society of Abdominal Radiology (SAR) rectal cancer disease-focused panel (DFP) recommendations 2017. Abdom Radiol. (2018) 43:2893–902. doi: 10.1007/s00261-018-1642-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: rectal cancer, T staging, MRI, deep learning, rectal filling

Citation: Tian C, Ma X, Lu H, Wang Q, Shao C, Yuan Y and Shen F (2023) Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling. Front. Med. 10:1326324. doi: 10.3389/fmed.2023.1326324

Received: 23 October 2023; Accepted: 14 November 2023;
Published: 29 November 2023.

Edited by:

Jie-Zhi Cheng, Shanghai United Imaging Intelligence, Co., Ltd., China

Reviewed by:

Yi Xiao, Shanghai Changzheng Hospital, China
Zhang Shi, Fudan University, China
Yiqun Sun, Fudan University Shanghai Cancer Center, China

Copyright © 2023 Tian, Ma, Lu, Wang, Shao, Yuan and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuan Yuan, eXVhbnl1YW4xOTg3MDEwOEAxNjMuY29t; Fu Shen, c3NmZl81M0AxNjMuY29t

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling

Background

Methods

Participants

Clinicopathologic data

Image acquisition and analysis

Dataset and pre-processing

DL model construction

Statistical analysis

Results

Patients’ characteristics

Automatic segmentation results

Classification performance

Model comparison and clinical utility

Discussion

Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

Abbreviations

Footnotes

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good