Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 17 August 2023
Sec. Gastrointestinal Cancers: Colorectal Cancer
This article is part of the Research Topic Colorectal Surgery and Proctology: Past, Present, and Future View all 9 articles

Development and validation of a web-based predictive model for preoperative diagnosis of localized colorectal cancer and colorectal adenoma

Yan Lu*Yan Lu*Haoyang GuoHaoyang GuoJinwen JiangJinwen Jiang
  • Clinical Laboratory, DongYang People’s Hospital, Dongyang, Zhejiang, China

Background: Localized colorectal cancer (LCC) has obscure clinical signs, which are difficult to distinguish from colorectal adenoma (CA). This study aimed to develop and validate a web-based predictive model for preoperative diagnosis of LCC and CA.

Methods: We conducted a retrospective study that included data from 500 patients with LCC and 980 patients with CA who were admitted to Dongyang People’s Hospital between November 2012 and June 2022. Patients were randomly divided into the training (n=1036) and validation (n=444) cohorts. Univariate logistic regression, least absolute shrinkage and selection operator regression, and multivariate logistic regression were used to select the variables for predictive models. The area under the curve (AUC), calibration curve, decision curve analysis (DCA), and clinical impact curve (CIC) were used to evaluate the performance of the model.

Results: The web-based predictive model was developed, including nine independent risk factors: age, sex, drinking history, white blood cell count, lymphocyte count, red blood cell distribution width, albumin, carcinoembryonic antigen, and fecal occult blood test. The AUC of the prediction model in the training and validation cohorts was 0.910 (0.892–0.929) and 0.894 (0.862–0.925), respectively. The calibration curve showed good consistency between the outcome predicted by the model and the actual diagnosis. DCA and CIC showed that the predictive model had a good clinical application value.

Conclusion: This study first developed a web-based preoperative prediction model, which can discriminate LCC from CA and can be used to quantitatively assess the risks and benefits in clinical practice.

1 Introduction

Colorectal cancer (CRC) is the second leading cause of cancer-related death (1). However, for patients with localized colorectal cancer (LCC), the 5-year survival rate after timely surgical treatment can reach 90% (2). Generally, endoscopy predicts potential malignant tumors based on the size and shape of colorectal tumors and ultimately guides tumor treatment (3). LCC and colorectal adenoma (CA) are local lesions that require different treatment approaches. Patients with LCC should undergo laparoscopic or open surgery as soon as it is practicable (4) and may require chemotherapy and adjuvant radiation before surgery (5). On the contrary, patients with CA can be treated using selective endoscopic removal based on their preference (6). However, LCC has obscure clinical signs, and is difficult to distinguish from CA, which depends on biopsy and pathological evaluation (7).

Pathological diagnosis is the gold standard for differentiating between benign and malignant colorectal tumors. However, an endoscopic biopsy is an invasive examination that can lead to complications, such as bleeding, perforation, and infection; thus, it is limited to the patient’s willingness and compliance (8). Due to the advancements in molecular diagnostic technology, DNA (9) and microRNA (10) are now being used as CRC biomarkers. However, they are expensive with unstable diagnostic performance limiting their use in clinical settings. Therefore, fecal occult blood test (FOBT) (11) and serum carcinoembryonic antigen (CEA) (12) detection are preferred as CRC biomarkers because of their accessibility and affordability. However, for CRC, particularly in the early stage, a single detection biomarker has limited sensitivity and a high probability of misdiagnosis. To select the appropriate treatment method and reduce the rate of misdiagnosis of LCC, it is crucial to create a diagnostic prediction model with excellent diagnostic performance using readily accessible and affordable markers.

In previous studies, there are many potential predictive biomarkers for CRC preoperative diagnosis models, such as platelet-related parameters (13), red blood cell distribution width (RDW) (14), and hemoglobin (15). However, some of those studies had a small sample size (16) and almost all of them included patients with advanced CRC (17, 18). To our knowledge, patients with advanced CRC present with more pronounced clinical symptoms; as such, less sensitive indicators have an exaggerated role in the prediction model, resulting in the reduced diagnostic performance of the LCC prediction model.

In this single-center retrospective study, we used clinical and laboratory data of patients from 2012 to 2022 to develop and validate the first web-based predictive model for preoperative diagnosis of LCC and CA.

2 Materials and methods

The design and reporting of this study were guided by transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) (19).

2.1 Study population

Demographic and clinical data of 2070 patients were collected from the medical records at the Dongyang People’s Hospital between November 2012 and June 2022. The inclusion criteria were adult patients (≥ 18 years) who were diagnosed with CRC or CA using histopathology. The exclusion criteria were (1): preoperative anti-tumor treatment (2), patients diagnosed with advanced CRC (Tumor-Node-Metastasis [TNM] stage III-IV) according to TNM staging of the eighth edition of the American Joint Committee on Cancer (3), patients with other primary malignant tumors, and (4) patients with clinical data loss rate >10%. According to the inclusion and exclusion criteria, 500 patients with LCC and 980 patients with CA were finally included in this study (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1 Flow chart of the study.

The study was approved by the ethics committee of Dongyang People’s Hospital. The data analysis was anonymous. Informed consent was waived since this is a retrospective study and the diagnosis as well as treatment of the patients were not affected.

2.2 Data collection

Preoperative data of eligible patients were extracted, including age, sex, routine blood parameters, FOBT, serum albumin, serum glucose, and tumor biomarkers. Routine blood parameters included white blood cell (WBC), neutrophil, and lymphocyte counts, as well as hemoglobin, RDW, platelet count, and mean platelet volume. Selected tumor biomarkers were CEA and carbohydrate antigen 199. FOBT-positive patients were defined as those with a positive immunochemical test and guaiac-based test ≥ 1+. When the proportion of missing values in the variables was less than 10%, the missing data were filled using multiple imputations in the two cohorts respectively (20). For outliers, which were defined as values other than 1-99% in continuous variables winsorized by 1% on both sides (21), artificial discrimination was used in categorical variables. In addition, data on smoking, drinking, diabetes, and hypertension statuses of the eligible patients were collected. A history of smoking or drinking was noted if smoking or the use of alcohol was reported in a patient’s medical record.

2.3 Statistical analysis

Included patients were randomly divided into the training (70%) and validation (30%) cohorts. Data were analyzed using Stata version 14.0 (Stata Corp LP, College Station, TX, USA) and R software version 4.1.0. Continuous variables are expressed as means and standard deviations or medians and interquartile intervals as appropriate. A Wilcox test or Student’s t-test was used to assess between-group differences. Categorical variables were compared using the chi-square test and are presented as quantities (percentages).

Data from the training cohort were used for the development of the prediction model. Univariable logistic regression analysis was used preliminarily to screen candidate variables. The least absolute shrinkage and selection operator (LASSO) regression analysis was then used to remove collinear independent variables to prevent overfitting (22). Subsequently, indicators with coefficients that were not zero were included in a multivariable logistic regression to complete the final selection of variables. Variables with statistical significance (P<0.05) were included in the prediction model. The prediction probability of the model was calculated using the following formula:

Probability=11+exp((β0+β1χ1+β2χ2+...+βnχn))

A web-based prediction model was developed based on the “DynNom” and “rsconnect” packages in R software (23). The prediction probability can be automatically calculated after inputting the expression of each variable.

The performance of the model in the training and validation cohort was assessed. After data were normalized, the discriminative ability was expressed using the area under the receiver operating characteristic (ROC) curve (AUC). The Youden index was used to determine the best cut-off value, and the corresponding sensitivity, specificity, accuracy, positive prediction value (PPV), negative prediction value (NPV), positive likelihood ratio (PLR), as well as negative effect ratio (NLR), were calculated. To examine the consistency between the actual risk of LCC and the probability predicted using the new model, calibration curves were plotted and the Hosmer–Lemeshow goodness of fit test was performed. The clinical utility of the decision curve analysis (DCA) and clinical impact curve (CIC) was used to demonstrate the clinical utility of the prediction model (24).

3 Results

3.1 Patient characteristics

Demographic and clinical data were extracted from medical records of 2070 patients between November 2012 and June 2022. After data were excluded based on our criteria, records of 1480 patients (LCC: n=500 and CA: n=980) were finally included in this study (Figure 1). Among LCC patients, five (1.0%), 161 (32.2%), and 334 (66.8%) patients had TNM stage 0, I, and II disease, respectively. The patients were then randomly divided into the training (n=1036) and validation (n=444) cohorts at a ratio of 7:3. There was no statistical difference between the training and validation cohorts in each variable (Table 1).

TABLE 1
www.frontiersin.org

Table 1 Preoperative baseline characteristics of patients in the training and validation cohorts.

3.2 Variable selection and development of a web-based predictive model

As shown in Table 2, in the training cohort, there were significant differences in 16 clinical parameters between LCC and CA groups using univariable logistic regression. Subsequently, 15 potential predictors without multicollinearity were identified using LASSO regression analysis (Table 2, Supplementary Figure 1). Finally, age, sex, drinking history, WBC, lymphocyte count, RDW, albumin, CEA, and FOBT were considered independent predictors of LCC in patients according to the results of multivariate logistic regression (Table 3). The probability of LCC can be calculated according to the following formula: Probability (LCC) = 1/(1 + exp (− (-1.703 + 0.024 × age -1.035 × sex (male=1, female=0) + 0.482 × drinking history (yes=1, no=0) + 0.215 × WBC - 0.694 × lymphocyte count + 0.334 × RDW - 0.131 × albumin + 0.203 × CEA + 2.357 × FOBT (positive=1, negative=0)))).

TABLE 2
www.frontiersin.org

Table 2 Univariate logistic analysis and LASSO regression analysis in the training cohort.

TABLE 3
www.frontiersin.org

Table 3 Multivariate logistic analysis in the training cohort.

The web-based dynamic prediction model developed using the selected variables can be used through the following link: https://ly11219.shinyapps.io/dynnomapp/.The interface of this webpage is shown in Figure 2. Figure 2A displays the input nomograph interface, in which users can adjust the expression of each item. Figure 2B shows a graphical summary of the LCC probability and 95% confidence interval (CI) predicted for the three patients according to the nomogram. The page also provides a numerical summary, shown in Figure 2C.

FIGURE 2
www.frontiersin.org

Figure 2 Web interface for distinguishing localized colorectal cancer (LCC) from colorectal adenoma (CA). (A) The area where the expression of each item can be adjusted by the user. (B) Graphical summary of the LCC probability and 95% confidence interval (CI) predicted by the prediction model. (C) Numerical summary of LCC probability and 95% CI.

3.3 Evaluation of the performance of the prediction model

The AUC in the training and validation cohorts was 0.910 (0.892–0.929) and 0.894 (0.862–0.925), respectively (Figure 3A). There was no significant difference in the diagnostic performance of the prediction model between the two cohorts (P=0.379). The optimal cut-off value of the probatility of the prediction model was 26.41%. The result of sensitivity, specificity, PPV, NPV, PLR, and NLR used to distinguish between LCC and CA was 86.4%, 80.1%, 82.2%, 69.1%, 91.9%, 4.342, and 0.170 in the training cohort, as well as 85.1%, 79.1%, 81.1%, 67.0%, 91.4%, 4.07 and 0.188 in the validation cohort, respectively (Table 4). The results of the Hosmer–Lemeshow goodness of fit test showed good consistency between the outcome predicted by the model and the actual diagnosis, which was reflected in both the training and validation cohorts (Figures 3B, C).

FIGURE 3
www.frontiersin.org

Figure 3 Evaluation of the performance of the prediction model to distinguish localized colorectal cancer (LCC) and colorectal adenoma (CA). (A) The receiver operating characteristic (ROC) curve of the prediction model in the training and validation cohorts. (B) The calibration curve of the prediction model in the training cohort. (C) The calibration curve of the prediction model in the validation cohort.

TABLE 4
www.frontiersin.org

Table 4 Predictive performance of the models used to estimate the risk of LCC.

When drawing the DCA to reflect the advantages of the new model, we added a comparison between the new model and the two common CRC screening indicators, CEA and FOBT. In the training and validation cohorts, the threshold probability was between 0.05 and 1.00. The performance of the prediction model was better than that of the CEA, FOBT, and two extreme cases (treat-none and treat-all) as shown in Figures 4A, B. For example, when the risk threshold was set to 0.3 (i.e. if the LCC probability of the patient was >30%), the patient would receive further treatment. In the training cohort, the net benefit of the new prediction model was 0.25, which was higher than that of of the FOBT (0.20), CEA (0.10), treat-all (0.05), and treat-none (0.00).

FIGURE 4
www.frontiersin.org

Figure 4 DCA and CIC of the prediction model. (A) DCA of the training cohort. (B) DCA of the validation cohort. (C) CIC of the training cohort. (D) CIC of the validation cohort. DCA, decision curve analysis; CIC, clinical impact curve; CEA, carcinoembryonic antigen; FOBT, fecal occult blood test.

CIC results also showed a similar number of high-risk LCC patients predicted by the new model and the actual number of LCC patients in both the training and validation cohorts (Figures 4C, D). For instance, when the risk threshold was set to 0.4, almost 400 out of every 1000 persons in the training cohort were deemed at high risk, and approximately 350 of them were diagnosed with LCC.

4 Discussion

CA is a benign tumor in the colorectal region, of which only 5% will eventually advance to CRC, and the overall tumor progression is slow (25). Preoperative differentiation between LCC and CA is helpful to reduce unnecessary treatment and promote the early detection of CRC. In most cases, the differentiation between LCC and CA patients depends on invasive colonoscopy (26). In this study, we found that age, sex, drinking history, WBC, lymphocyte count, RDW, albumin, CEA, and FOBT were independent predictors of LCC in patients, and successfully developed a web-based prediction model. Through evaluating calibration and validation, we believe that our prediction model has a good discrimination performance and clinical application value.

To predict LCC and CA, a wide range of variables were considered in this model when selecting preoperative markers. Age and sex play a significant role in the diagnosis of many tumors, including CRC (27) and lung cancer (28). In this study, being female and older were considered risk factors for LCC. Previous studies reported smoking and drinking as risk factors for CRC, and these were associated with a poor prognosis (2932). The current study also showed that individuals with a drinking history were at a higher risk of being diagnosed with LCC, but there was no association between LCC and smoking. Routine blood cell parameters are often used as inflammatory markers to reflect the patient’s inflammatory immune status. Various studies have shown that higher neutrophil-lymphocyte and platelet-lymphocyte ratios or a high systemic immune inflammatory index (platelet count × neutrophil count/lymphocyte count) were associated with a higher tumor stage, worse differentiation level, and worse prognosis of CRC (3336). Our prediction model used single blood cell parameters, such as WBC, lymphocyte count, or RDW, instead of the ratio between parameters, which reduces the steps of numerical conversion and simplifies the calculation process. Serum albumin not only reflects the nutritional status of patients but also has a negative correlation with the inflammatory reaction in vivo. It is an independent predictor of the prognosis of CRC (37). In this study, patients with lower albumin levels had a higher probability of LCC diagnosis. CEA and FOBT are currently widely used non-invasive markers for screening CRC (12, 38), which played a significant role in this prediction model and had better diagnostic performance than being used individually.

Previous studies attempted to use molecular detection for early diagnosis of CRC; however, this is expensive and incomparable to colonoscopy or fecal immunochemical tests (3941). Some studies have used markers of the systemic inflammatory response as diagnostic tools for CRC (18, 42). However, the results of these trials have limited diagnostic performance in individuals with early CRC because a large number of patients with advanced CRC were included. Additionally, the static nomograph model requires the manual calculation of the prediction probability corresponding to the total score, which is less intuitive. This study included nine easily available and inexpensive preoperative variables and developed a web-based prediction model. As a preoperative prediction tool, the model is not only easy to popularize and use, but also has a high prediction accuracy and good discrimination characteristics.

Our study has some limitations. First, since this is a single-center retrospective study, it is necessary to validate our model with data from external prospective studies. Second, neither the diagnostic information of patients with advanced CRC nor the prognosis was included in this study. In most cases, the model was mainly used as a tool for early screening of LCC and CA. Third, other indicators with potential predictive value, such as gene expression and coagulation markers, were not included due to restrictions posed by retrospective data as well as the feasibility of sample collection and costs. To validate the accuracy of our findings, a multicenter prospective investigation is required.

5 Conclusion

A web-based preoperative prediction model incorporating nine preoperative variables was developed. The model can directly and quantitatively assess the risks and benefits in clinical practice and has strong performance in recognizing LCC and CA.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of Dongyang People’s Hospital. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

YL contributed to the research concept and design. HG and JJ conducted data collection and analysis. All authors participated in the first draft of this manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1199868/full#supplementary-material

Abbreviations

CRC, Colorectal cancer; LCC, localized colorectal cancer; CA, colorectal adenoma; FOBT, fecal occult blood test; CEA, carcinoembryonic antigen; TNM, Tumor-Node-Metastasis; WBC, white blood cell; RDW, red blood cell distribution width; LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic; AUC, arear under the receiver operating characteristic curve; PPV, positive prediction value; NPV, negative prediction value; PLR, positive likelihood ratio; NLR, negative effect ratio; DCA, decision curve analysis; CIC, clinical impact curve; CI, confidence interval.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Miller KD, Nogueira L, Devasia T, Mariotto AB, Yabroff KR, Jemal A, et al. Cancer treatment and survivorship statistics, 2022. CA: Cancer J Clin (2022) 72(5):409–36. doi: 10.3322/caac.21731

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Bronzwaer MES, Depla A, van Lelyveld N, Spanier BWM, Oosterhout YH, van Leerdam ME, et al. Quality assurance of colonoscopy within the Dutch national colorectal cancer screening program. Gastrointestinal Endoscopy (2019) 89(1):1–13. doi: 10.1016/j.gie.2018.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Sheng S, Zhao T, Wang X. Comparison of robot-assisted surgery, laparoscopic-assisted surgery, and open surgery for the treatment of colorectal cancer: A network meta-analysis. Medicine (2018) 97(34):e11817. doi: 10.1097/md.0000000000011817

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Benson AB, Venook AP, Al-Hawary MM, Arain MA, Chen YJ, Ciombor KK, et al. NCCN guidelines insights: rectal cancer, version 6.2020. J Natl Compr Cancer Network JNCCN (2020) 18(7):806–15. doi: 10.6004/jnccn.2020.0032

CrossRef Full Text | Google Scholar

6. Aarons CB, Shanmugan S, Bleier JI. Management of Malignant colon polyps: current status and controversies. World J Gastroenterol (2014) 20(43):16178–83. doi: 10.3748/wjg.v20.i43.16178

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Das V, Kalita J, Pal M. Predictive and prognostic biomarkers in colorectal cancer: A systematic review of recent advances and challenges. Biomedicine pharmacotherapy = Biomedecine pharmacotherapie (2017) 87:8–19. doi: 10.1016/j.biopha.2016.12.064

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kim SY, Kim HS, Park HJ. Adverse events related to colonoscopy: Global trends and future challenges. World J Gastroenterol (2019) 25(2):190–204. doi: 10.3748/wjg.v25.i2.190

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Müller D, Győrffy B. DNA methylation-based diagnostic, prognostic, and predictive biomarkers in colorectal cancer. Biochim Biophys Acta Rev Cancer (2022) 1877(3):188722. doi: 10.1016/j.bbcan.2022.188722

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Toiyama Y, Okugawa Y, Fleshman J, Richard Boland C, Goel A. MicroRNAs as potential liquid biopsy biomarkers in colorectal cancer: A systematic review. Biochim Biophys Acta Rev Cancer (2018) 1870(2):274–82. doi: 10.1016/j.bbcan.2018.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Li JN, Yuan SY. Fecal occult blood test in colorectal cancer screening. J Digestive Dis (2019) 20(2):62–4. doi: 10.1111/1751-2980.12712

CrossRef Full Text | Google Scholar

12. Rao H, Wu H, Huang Q, Yu Z, Zhong Z. Clinical value of serum CEA, CA24-2 and CA19-9 in patients with colorectal cancer. Clin Lab (2021) 67(4). doi: 10.7754/Clin.Lab.2020.200828

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wu YY, Zhang X, Qin YY, Qin JQ, Lin FQ. Mean platelet volume/platelet count ratio in colorectal cancer: a retrospective clinical study. BMC Cancer (2019) 19(1):314. doi: 10.1186/s12885-019-5504-9

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Song Y, Huang Z, Kang Y, Lin Z, Lu P, Lin Q, et al. Clinical usefulness and prognostic value of red cell distribution width in colorectal cancer. BioMed Res Int (2018) 2018:9858943. doi: 10.1155/2018/9858943

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Li H, Lin J, Xiao Y, Zheng W, Zhao L, Yang X, et al. Colorectal cancer detected by machine learning models using conventional laboratory test data. Technol Cancer Res Treat (2021) 20:15330338211058352. doi: 10.1177/15330338211058352

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Záhorec R, Marek V, Waczulikova I, Veselovský T, Palaj J, Kečkéš Š, et al. Predictive model using hemoglobin, albumin, fibrinogen, and neutrophil-to-lymphocyte ratio to distinguish patients with colorectal cancer from those with benign adenoma. Neoplasma (2021) 68(6):1292–300. doi: 10.4149/neo_2021_210331N435

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Huang J, Xu P, Chen Y. A retrospective study from a single center to identify hematological factors that distinguish between patients with colorectal carcinoma and colorectal adenoma. Med Sci monitor Int Med J Exp Clin Res (2022) 28:e936745. doi: 10.12659/msm.936745

CrossRef Full Text | Google Scholar

18. Hernandez-Ainsa M, Velamazan R, Lanas A, Carrera-Lasfuentes P, Piazuelo E. Blood-cell-based inflammatory markers as a useful tool for early diagnosis in colorectal cancer. Front Med (2022) 9:843074. doi: 10.3389/fmed.2022.843074

CrossRef Full Text | Google Scholar

19. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Internal Med (2015) 162(1):W1–73. doi: 10.7326/m14-0698

CrossRef Full Text | Google Scholar

20. Austin PC, White IR, Lee DS, van Buuren S. Missing data in clinical research: A tutorial on multiple imputation. Can J Cardiol (2021) 37(9):1322–31. doi: 10.1016/j.cjca.2020.11.010

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Sharma S, Chatterjee S. Winsorization for robust bayesian neural networks. Entropy (Basel Switzerland) (2021) 23(11). doi: 10.3390/e23111546

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Friedman J, Hastie T, TibshIrani R. Regularization paths for generalized linear models via coordinate descent. J Stat Software (2010) 33(1):1–22.

Google Scholar

23. Jalali A, Alvarez-Iglesias A, Roshan D, Newell J. Visualising statistical models using dynamic nomograms. PloS One (2019) 14(11):e0225253. doi: 10.1371/journal.pone.0225253

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Kerr KF, Brown MD, Zhu K, Janes H. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J Clin Oncol Off J Am Soc Clin Oncol (2016) 34(21):2534–40. doi: 10.1200/jco.2015.65.5654

CrossRef Full Text | Google Scholar

25. Keum N, Giovannucci E. Global burden of colorectal cancer: emerging trends, risk factors and prevention strategies. Nat Rev Gastroenterol Hepatol (2019) 16(12):713–32. doi: 10.1038/s41575-019-0189-8

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chen H, Li N, Ren J, Feng X, Lyu Z, Wei L, et al. Participation and yield of a population-based colorectal cancer screening programme in China. Gut (2019) 68(8):1450–7. doi: 10.1136/gutjnl-2018-317124

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Erichsen R, Baron JA, Stoffel EM, Laurberg S, Sandler RS, Sørensen HT. Characteristics and survival of interval and sporadic colorectal cancer patients: a nationwide population-based cohort study. Am J Gastroenterol (2013) 108(8):1332–40. doi: 10.1038/ajg.2013.175

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Liu D, Sun X, Liu A, Li L, Li S, Li J, et al. Predictive value of a novel Asian lung cancer screening nomogram based on artificial intelligence and epidemiological characteristics. Thorac Cancer (2021) 12(23):3130–40. doi: 10.1111/1759-7714.14140

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Klarich DS, Brasser SM, Hong MY. Moderate alcohol consumption and colorectal cancer risk. Alcoholism Clin Exp Res (2015) 39(8):1280–91. doi: 10.1111/acer.12778

CrossRef Full Text | Google Scholar

30. Walter V, Jansen L, Ulrich A, Roth W, Bläker H, Chang-Claude J, et al. Alcohol consumption and survival of colorectal cancer patients: a population-based study from Germany. Am J Clin Nutr (2016) 103(6):1497–506. doi: 10.3945/ajcn.115.127092

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Gong J, Hutter C, Baron JA, Berndt S, Caan B, Campbell PT, et al. A pooled analysis of smoking and colorectal cancer: timing of exposure and interactions with environmental factors. Cancer epidemiology Biomarkers Prev Publ Am Assoc Cancer Research cosponsored by Am Soc Prev Oncol (2012) 21(11):1974–85. doi: 10.1158/1055-9965.epi-12-0692

CrossRef Full Text | Google Scholar

32. Walter V, Jansen L, Hoffmeister M, Ulrich A, Chang-Claude J, Brenner H. Smoking and survival of colorectal cancer patients: population-based study from Germany. Int J Cancer (2015) 137(6):1433–45. doi: 10.1002/ijc.29511

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Yang J, Xu H, Guo X, Zhang J, Ye X, Yang Y, et al. Pretreatment inflammatory indexes as prognostic predictors for survival in colorectal cancer patients receiving neoadjuvant chemoradiotherapy. Sci Rep (2018) 8(1):3044. doi: 10.1038/s41598-018-21093-7

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Lin SQ, Xie HL, Ge YZ, Ruan GT, Zhang Q, Song MM, et al. Association between systemic inflammation and water composition and survival in colorectal cancer. Front Oncol (2022) 12:896160. doi: 10.3389/fonc.2022.896160

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Song Y, Yang Y, Gao P, Chen X, Yu D, Xu Y, et al. The preoperative neutrophil to lymphocyte ratio is a superior indicator of prognosis compared with other inflammatory biomarkers in resectable colorectal cancer. BMC Cancer (2017) 17(1):744. doi: 10.1186/s12885-017-3752-0

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Chan JC, Chan DL, Diakos CI, Engel A, Pavlakis N, Gill A, et al. The lymphocyte-to-monocyte ratio is a superior predictor of overall survival in comparison to established biomarkers of resectable colorectal cancer. Ann Surg (2017) 265(3):539–46. doi: 10.1097/sla.0000000000001743

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Zhang N, Ning F, Guo R, Pei J, Qiao Y, Fan J, et al. Prognostic values of preoperative inflammatory and nutritional markers for colorectal cancer. Front Oncol (2020) 10:585083. doi: 10.3389/fonc.2020.585083

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Lisi D, Hassan C, Crespi M. Participation in colorectal cancer screening with FOBT and colonoscopy: an Italian, multicentre, randomized population study. Digestive liver Dis Off J Ital Soc Gastroenterol Ital Assoc Study Liver (2010) 42(5):371–6. doi: 10.1016/j.dld.2009.07.019

CrossRef Full Text | Google Scholar

39. Heiss JA, Brenner H. Epigenome-wide discovery and evaluation of leukocyte DNA methylation markers for the detection of colorectal cancer in a screening setting. Clin Epigenet (2017) 9:24. doi: 10.1186/s13148-017-0322-x

CrossRef Full Text | Google Scholar

40. Peterse EFP, Meester RGS, de Jonge L, Omidvari AH, Alarid-Escudero F, Knudsen AB, et al. Comparing the cost-effectiveness of innovative colorectal cancer screening tests. J Natl Cancer Institute (2021) 113(2):154–61. doi: 10.1093/jnci/djaa103

CrossRef Full Text | Google Scholar

41. Jin P, You P, Fang J, Kang Q, Gu F, Cai Y, et al. Comparison of performance of two stool DNA tests and a fecal immunochemical test in detecting colorectal neoplasm: A multicenter diagnostic study. Cancer epidemiology Biomarkers Prev Publ Am Assoc Cancer Research cosponsored by Am Soc Prev Oncol (2022) 31(3):654–61. doi: 10.1158/1055-9965.epi-21-0991

CrossRef Full Text | Google Scholar

42. Kilincalp S, Çoban Ş, Akinci H, Hamamcı M, Karaahmet F, Coşkun Y, et al. Neutrophil/lymphocyte ratio, platelet/lymphocyte ratio, and mean platelet volume as potential biomarkers for early detection and monitoring of colorectal adenocarcinoma. Eur J Cancer Prev Off J Eur Cancer Prev Organisation (ECP) (2015) 24(4):328–33. doi: 10.1097/cej.0000000000000092

CrossRef Full Text | Google Scholar

Keywords: colorectal cancer, colorectal adenoma, predictive model, preoperative diagnosis, dynamic nomogram

Citation: Lu Y, Guo H and Jiang J (2023) Development and validation of a web-based predictive model for preoperative diagnosis of localized colorectal cancer and colorectal adenoma. Front. Oncol. 13:1199868. doi: 10.3389/fonc.2023.1199868

Received: 28 April 2023; Accepted: 03 August 2023;
Published: 17 August 2023.

Edited by:

Sharon R. Pine, University of Colorado Anschutz Medical Campus, United States

Reviewed by:

Sergey Achkasov, Ryzhikh National Medical Research Centre of Coloproctology, Russia
Dipendra Khadka, Wonkwang University School of Medicine, Republic of Korea
Qisong Zhang, Guangxi University, China

Copyright © 2023 Lu, Guo and Jiang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yan Lu, bHV5YW4xMTIxOUBob3RtYWlsLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.