A computer-aided system improves the performance of endoscopists in detecting colorectal polyps: a multi-center, randomized controlled trial

Zhang, Heng; Wu, Qi; Sun, Jing; Wang, Jing; Zhou, Lei; Cai, Wei; Zou, Duowu

doi:10.3389/fmed.2023.1341259

CLINICAL TRIAL article

Front. Med., 24 January 2024

Sec. Gastroenterology

Volume 10 - 2023 | https://doi.org/10.3389/fmed.2023.1341259

A computer-aided system improves the performance of endoscopists in detecting colorectal polyps: a multi-center, randomized controlled trial

Heng Zhang^1†

Qi Wu^2†

Jing Sun³

Jing Wang²

Lei Zhou¹

Wei Cai^4*

Duowu Zou^3*

¹Department of Gastroenterology, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
²Endoscopy Center, Peking University Cancer Hospital and Institute, Beijing, China
³Department of Gastroenterology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
⁴Department of Gastrointestinal Surgery, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China

Background: Up to 45.9% of polyps are missed during colonoscopy, which is the major cause of post-colonoscopy colorectal cancer (CRC). Computer-aided detection (CADe) techniques based on deep learning might improve endoscopists’ performance in detecting polyps. We aimed to evaluate the effectiveness of the CADe system in assisting endoscopists in a real-world clinical setting.

Methods: The CADe system was trained to detect colorectal polyps, recognize the ileocecal region, and monitor the speed of withdrawal during colonoscopy in real-time. Between 17 January 2021 and 16 July 2021. We recruited consecutive patients aged 18–75 years from three centers in China. We randomized patients in 1:1 groups to either colonoscopy with the CADe system or unassisted (control). The primary outcomes were the sensitivity and specificity of the endoscopists. We used subgroup analysis to examine the polyp detection rate (PDR) and the miss detection rate of endoscopists.

Results: A total of 1293 patients were included. The sensitivity of the endoscopists in the experimental group was significantly higher than that of the control group (84.97 vs. 72.07%, p < 0.001), and the specificity of the endoscopists in these two groups was comparable (100.00 vs. 100.00%). In a subgroup analysis, the CADe system improved the PDR of the 6–9 mm polyps (18.04 vs. 13.85%, p < 0.05) and reduced the miss detection rate, especially at 10:00–12:00 am (12.5 vs. 39.81%, p < 0.001).

Conclusion: The CADe system can potentially improve the sensitivity of endoscopists in detecting polyps, reduce the missed detection of polyps in colonoscopy, and reduce the risk of CRC.

Registration: This clinical trial was registered with the Chinese Clinical Trial Registry (Trial Registration Number: ChiCTR2100041988).

Clinical trial registration: website www.chictr.org.cn, identifier ChiCTR2100041988.

Introduction

Colorectal cancer (CRC) is the third most common tumor worldwide and one of the leading causes of cancer-related deaths (1). CRC has a long incubation period with no obvious symptoms in the early stages; the majority of patients are not diagnosed until the disease has developed into an advanced stage. According to the 2020 colorectal cancer statistics, the 5-year survival rate of patients diagnosed with advanced colorectal cancer is only 12%. However, if CRC can be diagnosed and treated at an early stage, the 5-year survival rate of patients is more than 90% (2). Early diagnosis and subsequent treatment can effectively reduce mortality. Colonoscopy is the main method for screening for colorectal neoplasia and precancerous lesions. However, a meta-analysis of 43 tandem colonoscopies showed that up to 25% of colorectal neoplasia is missed after colonoscopy screening (3), which is the most relevant cause of post-colonoscopy colorectal cancer (4, 5).

Adenomatous polyps are the most common colorectal precancerous lesions, and they can be detected and removed by endoscopic procedures to prevent the occurrence and development of CRC. However, studies have shown that 45.9% of polyps are missed during colonoscopy (6). The primary cause of the high miss rate may be incomplete exposure of the colonic mucosal surface, and lesions may be hidden behind folds or food debris and not easily visualized. In addition, colonoscopy is technically challenging and demanding, requiring endoscopists to perform procedures and diagnoses. Less experienced endoscopists may easily ignore some lesions. Even experienced endoscopists may miss a non-obvious lesion due to a lack of concentration or fatigue (7, 8).

Artificial intelligence (AI) is an emerging science and technology that enables machines to simulate specific human thought processes and behaviors (9, 10). The advantage of AI lies in its ability to store more information and quickly parse the available data to perform complex visual perception tasks (11, 12). In 2016, deep learning algorithms were applied to various medical images, starting with diabetic retinopathy and pulmonary nodules. They also play an indispensable role in assisting doctors to diagnose diseases. (13–18). The original signal of the colonoscopy videos contains 25–30 frames per second, and a lesion may appear in only a few frames, which is one of the main reasons why endoscopists fail to detect lesions (19). The AI system is more sensitive and advantageous. It can process a large amount of image information in real time without fatigue and can detect subtle changes that are difficult for the human eye to distinguish. Some progress has been made in colonoscopy quality control and polyp detection (20–23). One study has shown that AI as an adjunct to colonoscopy can significantly improve the detection rate of colorectal neoplasia (24). Another meta-analysis showed that computer-aided detection (CADe) techniques based on AI significantly improved the adenoma detection rate over other techniques aimed at improving mucosal visualization, such as chromoendoscopy or narrow-band imaging (25). However, several key factors still need to be addressed before AI can be implemented clinically. One of them is that an AI model requires a sufficient number of annotated endoscopic images to achieve optimal performance and ensure model versatility, which may be challenging for AI in colorectal polyp detection due to the diversity of polyps and the need for expert annotation.

In this study, we exploited a novel CADe system (EndoAngel, Wuhan ENDOANGEL Medical Technology Co., Ltd.), which is capable of detecting colorectal polyps, recognizing the ileocecal region, and monitoring the speed of withdrawal during colonoscopy in real-time. This multicenter, prospective, randomized, controlled study aimed to evaluate the sensitivity and specificity of endoscopists with and without the assistance of the CADe system in a real-world clinical setting.

Methods

Study design and participants

This parallel, randomized, multi-center study was conducted at three Chinese endoscopy centers. Inclusion criteria were subjects aged 18–75 years with a need for colonoscopy diagnosis or screening, able to sign written informed consent, and with full legal capacity. Exclusion criteria were contraindications to colonoscopy (history of acute myocardial infarction within 6 months, severe hypohepatia, renal failure, and mental disorders), use of anticoagulants (aspirin, warfarin, etc.), known polyposis syndromes, familial polyposis, inflammatory bowel disease, known or highly suspected colorectal cancer, or colorectal surgery. Patients who were currently pregnant or participating in other clinical trials were also excluded. We obtained written informed consent from all patients before the colonoscopy. Our study followed the recommendations of the Consolidated Standards of Reporting Trials statement for reporting randomized controlled trials. This study was approved by the ethics committees of Ruijin Hospital of Shanghai Jiao Tong University School of Medicine, Beijing Cancer Hospital, and the Central Hospital of Wuhan. The study was registered under trial registration number ChiCTR2100041988 with the Chinese Clinical Trial Registry.¹

Randomization and masking

All eligible patients were randomly allocated (1:1) to receive either white light colonoscopy with the assistance of the CADe (experimental group) or without the assistance of the CADe (control group). We used computer-generated random numbers with no restrictions to determine each participant’s assignment. The randomization was done in blocks of four. The random assignment was blinded to the patients. The operating endoscopists were unaware of the overall study design and aims, but they were aware of the randomization status. Group allocations were concealed from data collectors and analysts.

Procedures

A novel deep learning-based system (EndoAngel, Wuhan ENDOANGEL Medical Technology Co., Ltd.) was used in this prospective study. The system was developed on a deep learning framework with the help of endoscopists and modelers. The details of training, validation, and testing of this CADe are presented in the Supplementary material. The CADe was connected to the endoscopy processor, receiving the digital image as input and outputting a blue box only when a suspected polyp was captured in the field of view. The CADe system was installed on a separate computer system, and the output of the system appeared on a second monitor that was connected to the primary monitor via a serial digital interface cable. During the unassisted withdrawal, the second (CADe) monitor was turned off. During Artificial Intelligence (AI)-assisted withdrawal, the monitor was turned on.

The operating endoscopists were 21 staff members of the three endoscopy centers with colonoscopy experience of more than 1 year and a total volume of 100 colonoscopies. The endoscopes used in this study were manufactured by Olympus Optical. Before insertion, the operators were informed about the patient allocation.

Bowel preparation was assessed and graded on site by the endoscopists using the Boston Bowel Preparation Scale (BBPS); the BBPS score was recorded by an independent research assistant. BPPS from 0 to 3 were recorded in the three segments (descending colon, transverse colon, and ascending colon). After cecal intubation, the withdrawal time was measured in real time by the research assistant using a stopwatch. Cecal intubation was assessed by the endoscopists during the insertion procedure. The independent research assistant recorded whether polyps were detected and the location of polyps at each examination. If polyps were found, the routine diagnostic and treatment processes of each hospital were followed to decide whether to perform a polypectomy. The morphology of the colorectal polyps was determined according to the Paris Classification, which was divided into protruding lesions [>2.5 mm elevated above the mucosal layer: pedunculated (0-Ip), sessile (0-Is), or semi-pedunculated (0-Isp)], superficial lesions [slightly elevated by <2.5 mm (0-IIa), flat (0-IIb), or slightly depressed (0-IIc)], and laterally spreading tumors (LSTs).

The raw videos from each examination were screened and further analyzed to generate a gold standard for polyp detection. An independent evaluation group was established and was in charge of the process. Two experts with colonoscopy experience of over 5 years and a total volume of over 3,000 colonoscopies independently reviewed all the raw videos and labeled whether an examination was a positive one (with polyp detected) or a negative one (no polyp detected). The number, size, and morphology of the polyps were recorded by the two experts by reviewing the raw videos. In case of disagreement between the two experts, a third expert with colonoscopy experience of over 8 years and a total volume of over 5,000 colonoscopies would arbitrate and perform the final diagnosis. The diagnostic performance of the CADe system was also evaluated. A research assistant recorded the diagnostic results of the system. The performance of the CADe system was evaluated against the gold standard.

Outcomes

The primary outcomes were endoscopist sensitivity and specificity with and without the assistance of the CADe system. Sensitivity = true positive/(true positive + false negative); specificity = true negative/(true negative + false positive). Secondary outcomes were diagnostic coincidence rate, false positive rate, false negative rate, positive predictive value (PPV), PPV = true positive/(true positive + false positive), negative predictive value (NPV), NPV = true negative/(true negative + false negative), positive likelihood ratio, negative likelihood ratio (NLR), balanced F1 score, polyp detection rate (PDR), BBPS score, and withdrawal time.

Subgroup analysis of polyp detection

We further explored the PDR by stratifying the patients according to the location, size, and morphology of the polyps. Based on the gold standard, we evaluated the miss detection rate of endoscopists in the two groups stratified by the different time periods in a day.

Statistical analysis

Sample size

The sample size was calculated based on the evaluation of the primary outcomes. This study used a co-primary outcome design, and both primary outcomes had to be fulfilled. We determined the specificity and sensitivity indices in this study based on the literature of similar artificial intelligence products (22, 26). The superiority and non-inferiority margins were set according to the Chinese guidelines for the design of medical device clinical trials and combined with the characteristics of the products in this study. As for the sensitivity of endoscopists to detect polyps, the sensitivity with the help of CADe was estimated to be 0.94, and the sensitivity without the help of CADe was estimated to be 0.80, with a superiority margin of 0.05; 217 polyps were identified in each group. According to the specificity of endoscopists for diagnosing polyps with or without the help of CADe, the CADe-assisted specificity was estimated to be 0.95, and the non-CADe-assisted specificity was estimated to be 0.95, with a non-inferiority margin of −0.05; 299 negative patients (no polyp detected) were required in each group. The PDR was estimated to be 45% based on our previous studies, so a total of 966 and 1,088 patients were needed. The larger sample size was obtained by considering a 20% dropout rate of a total of 1,360 patients invited in this trial.

Statistical analysis

Outcomes were analyzed in the FAS (full analysis set) and PPS (per-protocol set) populations. FAS refers to the set of eligible and withdrawn cases but excludes the excluded cases. Data from trials that were conducted and for which the primary outcome was available was entered into the FAS. The PPS included cases that met the study protocol, had good compliance, and completed all outcome evaluation indicators. The primary outcomes and metrics related to the miss detection rate were evaluated based on the gold standard generated by the expert panel. Other metrics were assessed based on the original data. Continuous variables were expressed as mean (SD) or median (IQR), according to their distribution, and categorical variables were expressed as n (%). Comparisons of proportions were done using the chi-square test and Fisher’s exact test. The Wilcoxon signed-rank test was used to compare the withdrawal time and BBPS score of the two groups. A negative binomial regression was used to compare the mean number of polyps in each patient. A two-tailed p-value of less than 0.05 was judged significant. Statistical analysis was performed using SAS 9.4.

Results

Patient enrollment and baseline data

Between 17 January 2021 and 16 July 2021, a total of 1,367 consecutive patients were recruited and assessed for eligibility (Figure 1). In total, 7 patients were excluded. Therefore, 1,360 patients were randomly allocated to either the experimental group (with the assistance of CADe) or the control group (without the assistance of CADe). A total of 1,293 patients were finally included in the FAS analysis (643 in the experimental group and 650 in the control group). Another 23 patients were further excluded, and 1,270 patients were finally included in the PP analysis.

FIGURE 1

Figure 1. The flow diagram of the eligibility of the patients.

The baseline information is presented in Table 1. There was no statistically significant difference between the two groups with respect to demographic information (age, height, weight, or sex) or other baseline information.

TABLE 1

Table 1. Baseline characteristics.

Primary outcomes

Sensitivity comparison

The sensitivity of the endoscopists in detecting polyps with or without CADe was evaluated at the polyp level and the patient level, based on the gold standard generated by the expert panel. At the polyp level, a total of 1,011 and 1,110 polyps were detected in the experimental and control groups in the FAS analysis, respectively. The sensitivity of the endoscopists in the two groups was 84.97% (95% Confidence Interval [CI], 82.76–87.17%) and 72.07% (95% CI, 69.43–74.71%), the difference in sensitivity between the two groups was 12.89% (95% CI, 9.46–16.33%), with a difference in the lower limits of the 95% CI between the two groups of more than 5%, p < 0.001. In the PP analysis, a total of 993 and 1,093 polyps were detected in the experimental and control groups, respectively. The sensitivity of the endoscopists in the two groups was 84.99% (95% CI, 82.77–87.22%) and 72.37% (95% CI, 69.72–75.02%), p < 0.001. At the patient level, the sensitivity of the endoscopists between the two groups was 89.89% (96% CI, 86.85–92.94%) and 82.02% (96% CI, 78.28–85.76%) in the FAS analysis, and was 89.67% (96% CI, 86.56–92.78%) and 82.32% (95% CI, 78.57–86.08%) in the PP analysis. The result showed that in either FAS or PPS analysis, either in the polyp group or in the patient group, the sensitivity of the endoscopists was significantly improved with the assistance of the CADe system, and superiority validation was achieved. The results are shown in Table 2.

TABLE 2

Table 2. Sensitivity and specificity of endoscopists with and without CADe.

Specificity comparison

The specificity of the endoscopists in detecting polyps with or without CADe was evaluated at the patient level and based on FAS and PP analysis. According to the gold standard, a total of 267 and 244 negative patients in the experimental and control groups, respectively, were included in the FAS analysis; the specificity of the endoscopists in these two groups was 100.00% (95% CI, 98.63–100.00%) and 100.00% (95% CI, 98.50–100.00%), respectively. In the PP analysis, 266 and 240 negative patients were included; the specificity of endoscopists in these two groups was 100.00% (95% CI, 98.62–100.00%) and 100.00% (95% CI, 98.47–100.00%), respectively. The analysis showed that in either the FAS or PPS populations, the specificity of the endoscopists using the CADe system showed no significant difference; the difference in the lower limits of the 95% CI between the two groups was greater than −5%, and thus the non-inferiority validation was achieved. The results are shown in Table 2.

Secondary outcomes

At the polyp level, the sensitivity of the CADe system in the FAS and PPS was 99.25% (95% CI, 98.78–99.57%) and 99.28% (95% CI, 98.82–99.60%), respectively. At the patient level, the sensitivity of the CADe was 100.00%. Compared to the gold standard, the diagnostic coincidence rate, false positive rate, false negative rate, positive predictive value (PPV), positive likelihood ratio, and balanced F1 score of the CADe system were 60.48% (57.81, 63.14%), 100.00% (99.28, 100.00%), 0.00% (0.00, 0.47%), 60.48% (57.81, 63.14%), 1, and 76.03% (FAS analysis). The results are presented in Table 3.

TABLE 3

Table 3. Computer-aided detection (CADe) system performance.

The BBPS score of the experimental and control groups had no significant difference in the FAS or PP analysis (7.19 [SD = 1.32] vs. 7.21 [SD = 1.37], p = 0.528; 7.19 [SD = 1.32] vs. 7.22 [SD = 1.37], p = 0.526). Details of the BBPS are presented in Tables 4, 5. The withdrawal times of these two groups were 430.31 (SD = 111.06) s and 421.01 (SD = 100.83) s (p = 0.062) in the FAS analysis and 430.23 (SD = 111.62) s and 421.38 (SD = 101.72) s (p = 0.074) in the PPS analysis, respectively, without significant difference. The PDR was 52.57% (338/643) and 51.23% (333/650) in the CADe-assisted group and the control group in the FAS analysis, respectively, without significant difference (p = 0.631). Similar results were found in the PPS analysis.

TABLE 4

Table 4. Boston bowel preparation score (BBPS) in detail.

TABLE 5

Table 5. Boston bowel preparation score (BBPS) evaluation and comparison.

Subgroup analysis of polyp detection

In both the FAS and PPS analyses, the PDR of polyps sized 6–9 mm showed a significant difference between the CADe-assisted group and the control group. The CADe system improved the PDR of 6–9 mm polyps. (18.04 vs. 13.85%, p < 0.05). The results are shown in Table 6. In the PPS analysis, the findings were similar.

TABLE 6

Table 6. Subgroup analysis of polyp detection rate.

In any time period of a day, the miss detection rate of the CADe-assisted group was significantly lower than that of the control group. In the 10:00–12:00 am period, the difference was much more significant (12.5 vs. 39.81%, p < 0.001). The results are shown in Table 7.

TABLE 7

Table 7. Analysis of miss detection rate stratified by time of day.

Adverse effects

One case of adverse effect was found in the experimental group, which was slight bleeding during the procedure.

Discussion

In this multi-center, parallel-controlled study, we evaluated the impact of endoscopists with and without the assistance of the CADe system on polyp detection. We statistically analyzed the data at the FAS and PPS levels. The results at both levels showed that the CADe could significantly improve the sensitivity of endoscopists to detecting polyps. It confirmed the effectiveness of the CADe in improving polyp detection during a colonoscopy screening. The specificity in both the experimental and control groups was 100%, indicating that the performance of the endoscopists using the CADe system was not inferior to that of the endoscopists in the control group without CADe.

The incidence and mortality of colorectal cancer remain high. Adenomatous polyps are important precursors of this type of malignancy. However, the polyp miss rate in colonoscopy is still high (7). This may be due to the small size of the early polyps and the slight mucosal changes that are difficult to detect with the naked eye. This is also susceptible to the patient’s bowel preparation and the physician’s level of experience and fatigue. In recent years, AI has made significant progress in the field of endoscopy (20). Compared with endoscopists, AI has a strong ability to identify tiny mucosal features, is less prone to fatigue, is not affected by the environment, etc. It can realize real-time localization and identification of colon polyps, which is expected to reduce the missed detection of polyps, thereby indirectly reducing the risk of CRC.

In our study, endoscopists using CADe achieved an absolute increase in sensitivity of 12% compared to the control group. There was no significant difference in PDR between the two groups. However, the PDR of medium-sized (6–9 mm) polyps was higher than that of the control group by more than 4.1% in both FAS and PPS analyses (P < 0.05). No statistical difference was found between the PDR of small polyps (≤5 mm) and large polyps (≥10 mm). Large polyps have a large mass and apparent mucosal lesions, so they may not be easily missed. For the PDR difference of small polyps (≤5 mm), it may require a larger sample size to verify the difference between the two groups. One of the main reasons for missed polyp detection is the difficulty in distinguishing suspicious lesions from normal colonic mucosa, which is also a cognitive challenge faced by endoscopists during colonoscopy. AI has unique advantages in identifying subtle features that can assist endoscopists in improving polyp detection.

We also counted the polyp miss rate (PMR) of different time periods of the day. The overall PMR of the experimental group was 12% lower than in the control group. The PMR in the experimental group was more than 10% lower than that of the control group at different times. Especially in the 10:00–12:00 period, which is usually considered the most tiring work period for endoscopists, the PMR decreased from 39.81 to 12.50% with the assistance of the CADe system. This confirmed that the AI system could offset part of the missed polyps due to fatigue.

In addition, in the experimental group, endoscopists’ specificity for polyps was not reduced by the potential effects of the CADe system, nor was the false positive rate for polyps increased. This suggests that the CADe system does not cause additional misdiagnosis by endoscopists. Interestingly, the average withdrawal time was 430.31 (SD = 111.06) s and 421.01 (SD = 100.83) s (p = 0.062) in the experimental and control groups, respectively, which both met the 6-min withdrawal time recommendation of international guidelines. The withdrawal time in the experimental group was longer; this may be due to the CADe system acting as a potential “supervisor” for the endoscopist. The lesion detection function with blue boxes shown on the colonoscopy monitor helped the endoscopist focus on suspicious lesions, thereby increasing the withdrawal time. This may also be one of the reasons for the increased sensitivity. However, the withdrawal times of the two groups showed no significant difference. Therefore, we are confident that the CADe system will improve the performance of the endoscopists without increasing their workload.

Most CRCs arise from traditional adenomas (including tubular adenomas, villous adenomas, and mixed tubular-villous adenomas) via the classic adenoma-carcinoma pathway (23). The detection and endoscopic resection of adenomas are essential for the prevention of colorectal cancer. Therefore, the adenoma detection rate (ADR) is an important criterion for assessing the quality of colonoscopy. However, recent studies have shown that 15–30% of sporadic CRCs develop through serrated lesions (24). Using the ADR as an evaluation index will lead to ignoring the serrated lesions. Polyps contain both traditional adenomas and serrated lesions. The improvement of the PDR is still significant for the long-term significance of preventing the occurrence of tumors in patients. Therefore, in this study, we paid great attention to the indicators related to the polyps from a more comprehensive perspective.

There are several limitations to our study. First, our study was conducted in three major digestive endoscopy centers in China, and the results may not have broad applicability. Further experiments can be conducted in hospitals in communities and remote, underdeveloped areas. In addition, we cannot rule out the subjective bias of endoscopists because the experiment could not be blinded to the operators. Operators tend to be more attentive when they learn that they are being observed, and the operation process will be more serious.

In conclusion, in this study, we evaluated the efficacy and safety of the CADe-assisted system in real-time colonoscopy in a natural clinical setting. The CADe system can potentially improve the sensitivity of endoscopists in detecting polyps, reduce the missed detection of polyps in colonoscopy, and reduce the risk of CRC.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ruijin Hospital Ethics Committee, Shanghai Jiao Tong University, School of Medicine. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

HZ: Methodology, Writing – original draft. QW: Methodology, Writing – original draft. JS: Data curation, Writing – original draft. JW: Data curation, Validation, Writing – original draft. LZ: Investigation, Writing – original draft. WC: Supervision, Writing – review and editing. DZ: Conceptualization, Supervision, Writing – review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This was a sponsor-initiated study, with research software and study funding provided by Wuhan ENDOANGEL Medical Technology Co., Ltd.

Acknowledgments

We thank the physicians in the Department of Gastroenterology of the three participating hospitals for patient screening. We are indebted to our patients for their willingness to participate in this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2023.1341259/full#supplementary-material

Footnotes

^ www.chictr.org.cn

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | Crossref Full Text | Google Scholar

2. Siegel RL, Miller KD, Goding Sauer A, Fedewa SA, Butterly LF, Anderson JC, et al. Colorectal cancer statistics, 2020. CA Cancer J Clin. (2020) 70:145–64. doi: 10.3322/caac.21601

PubMed Abstract | Crossref Full Text | Google Scholar

3. Zhao S, Wang S, Pan P, Xia T, Chang X, Yang X, et al. Magnitude, risk factors, and factors associated with adenoma miss rate of tandem colonoscopy: a systematic review and meta-analysis. Gastroenterology. (2019) 156:1661–1674.e11. doi: 10.1053/j.gastro.2019.01.260

PubMed Abstract | Crossref Full Text | Google Scholar

4. Anderson R, Burr N, Valori R. Causes of post-colonoscopy colorectal cancers based on world endoscopy organization system of analysis. Gastroenterology. (2020) 158:1287–1299.e2. doi: 10.1053/j.gastro.2019.12.031

PubMed Abstract | Crossref Full Text | Google Scholar

5. le Clercq CM, Bouwens MW, Rondagh EJ, Bakker CM, Keulen ET, de Ridder RJ, et al. Postcolonoscopy colorectal cancers are preventable: a population-based study. Gut. (2014) 63:957–63. doi: 10.1136/gutjnl-2013-304880

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang P, Liu P, Glissen Brown J, Berzin T, Zhou G, Lei S, et al. Lower adenoma miss rate of computer-aided detection-assisted colonoscopy vs routine white-light colonoscopy in a prospective tandem study. Gastroenterology. (2020) 159:1252–1261.e5. doi: 10.1053/j.gastro.2020.06.023

PubMed Abstract | Crossref Full Text | Google Scholar

7. Leufkens A, van Oijen M, Vleggaar F, Siersema P. Factors influencing the miss rate of polyps in a back-to-back colonoscopy study. Endoscopy. (2012) 44:470–5. doi: 10.1055/s-0031-1291666

PubMed Abstract | Crossref Full Text | Google Scholar

8. van Rijn J, Reitsma J, Stoker J, Bossuyt P, van Deventer S, Dekker E. Polyp miss rate determined by tandem colonoscopy: a systematic review. Am J Gastroenterol. (2006) 101:343–50. doi: 10.1111/j.1572-0241.2006.00390.x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Rawat W, Wang Z. Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput. (2017) 29:2352–449. doi: 10.1162/NECO_a_00990

PubMed Abstract | Crossref Full Text | Google Scholar

10. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. (2015) 521:436–44. doi: 10.1038/nature14539

PubMed Abstract | Crossref Full Text | Google Scholar

11. Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS, et al. Deep learning for visual understanding: a review. Neurocomputing. (2016) 187:27–48.

Google Scholar

12. Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Commun ACM. (2017) 60:84–90.

Google Scholar

13. Zakhem GA, Fakhoury JW, Motosko CC, Ho RS. Characterizing the role of dermatologists in developing artificial intelligence for assessment of skin cancer. J Am Acad Dermatol. (2021) 85:1544–56. doi: 10.1016/j.jaad.2020.01.028

PubMed Abstract | Crossref Full Text | Google Scholar

14. Sechopoulos I, Teuwen J, Mann R. Artificial intelligence for breast cancer detection in mammography and digital breast tomosynthesis: state of the art. Semin Cancer Biol. (2021) 72:214–25. doi: 10.1016/j.semcancer.2020.06.002

PubMed Abstract | Crossref Full Text | Google Scholar

15. Lu M, Raghu V, Mayrhofer T, Aerts H, Hoffmann U. Deep learning using chest radiographs to identify high-risk smokers for lung cancer screening computed tomography: development and validation of a prediction model. Ann Intern Med. (2020) 173:704–13. doi: 10.7326/M20-1868

PubMed Abstract | Crossref Full Text | Google Scholar

16. Le Berre C, Sandborn W, Aridhi S, Devignes M, Fournier L, Smaïl-Tabbone M, et al. Application of artificial intelligence to gastroenterology and hepatology. Gastroenterology. (2020) 158:76–94.e2. doi: 10.1053/j.gastro.2019.08.058

PubMed Abstract | Crossref Full Text | Google Scholar

17. Gulshan V, Peng L, Coram M, Stumpe M, Wu D, Narayanaswamy A, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. (2016) 316:2402–10. doi: 10.1001/jama.2016.17216

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bi W, Hosny A, Schabath M, Giger M, Birkbak N, Mehrtash A, et al. Artificial intelligence in cancer imaging: clinical challenges and applications. CA Cancer J Clin. (2019) 69:127–57. doi: 10.3322/caac.21552

PubMed Abstract | Crossref Full Text | Google Scholar

19. Repici A, Badalamenti M, Maselli R, Correale L, Radaelli F, Rondonotti E, et al. Efficacy of real-time computer-aided detection of colorectal neoplasia in a randomized trial. Gastroenterology. (2020) 159:512–520.e7. doi: 10.1053/j.gastro.2020.04.062

PubMed Abstract | Crossref Full Text | Google Scholar

20. Misawa M, Kudo S, Mori Y, Cho T, Kataoka S, Yamauchi A, et al. Artificial intelligence-assisted polyp detection for colonoscopy: initial experience. Gastroenterology. (2018) 154:2027–2029.e3. doi: 10.1053/j.gastro.2018.04.003

PubMed Abstract | Crossref Full Text | Google Scholar

21. Hassan C, Wallace M, Sharma P, Maselli R, Craviotto V, Spadaccini M, et al. New artificial intelligence system: first validation study versus experienced endoscopists for colorectal polyp detection. Gut. (2020) 69:799–800. doi: 10.1136/gutjnl-2019-319914

PubMed Abstract | Crossref Full Text | Google Scholar

22. Urban G, Tripathi P, Alkayali T, Mittal M, Jalali F, Karnes W, et al. Deep learning localizes and identifies polyps in real time with 96% accuracy in screening colonoscopy. Gastroenterology. (2018) 155:1069–1078.e8. doi: 10.1053/j.gastro.2018.06.037

PubMed Abstract | Crossref Full Text | Google Scholar

23. Gong D, Wu L, Zhang J, Mu G, Shen L, Liu J, et al. Detection of colorectal adenomas with a real-time computer-aided system (ENDOANGEL): a randomised controlled study. Lancet Gastroenterol Hepatol. (2020) 5:352–61. doi: 10.1016/S2468-1253(19)30413-3

PubMed Abstract | Crossref Full Text | Google Scholar

24. Hassan C, Spadaccini M, Iannone A, Maselli R, Jovani M, Chandrasekar V, et al. Performance of artificial intelligence in colonoscopy for adenoma and polyp detection: a systematic review and meta-analysis. Gastrointest Endosc. (2021) 93:77–85.e6. doi: 10.1016/j.gie.2020.06.059

PubMed Abstract | Crossref Full Text | Google Scholar

25. Spadaccini M, Iannone A, Maselli R, Badalamenti M, Desai M, Chandrasekar V, et al. Computer-aided detection versus advanced imaging for detection of colorectal neoplasia: a systematic review and network meta-analysis. Lancet Gastroenterol Hepatol. (2021) 6:793–802. doi: 10.1016/S2468-1253(21)00215-6

PubMed Abstract | Crossref Full Text | Google Scholar

26. Kudo S, Misawa M, Mori Y, Hotta K, Ohtsuka K, Ikematsu H, et al. Artificial intelligence-assisted system improves endoscopic identification of colorectal neoplasms. Clin Gastroenterol Hepatol. (2020) 18:1874–1881.e2. doi: 10.1016/j.cgh.2019.09.009

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: computer-aided detection, artificial intelligence, colorectal polyps, colonoscopy, sensitivity

Citation: Zhang H, Wu Q, Sun J, Wang J, Zhou L, Cai W and Zou D (2024) A computer-aided system improves the performance of endoscopists in detecting colorectal polyps: a multi-center, randomized controlled trial. Front. Med. 10:1341259. doi: 10.3389/fmed.2023.1341259

Received: 20 November 2023; Accepted: 28 December 2023;
Published: 24 January 2024.

Edited by:

Jinhang Gao, Sichuan University, China

Reviewed by:

Jonathan Soldera, University of Caxias do Sul, Brazil
Xie Rui, Affiliated Hospital of Zunyi Medical University, China

Copyright © 2024 Zhang, Wu, Sun, Wang, Zhou, Cai and Zou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Duowu Zou, emR1b3d1QDE2My5jb20=; Wei Cai, NzYzMDg1MDFAcXEuY29t

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.