AUTHOR=Zhu Jiang , Zheng Jinxin , Li Longfei , Huang Rui , Ren Haoyu , Wang Denghui , Dai Zhijun , Su Xinliang 

TITLE=Application of Machine Learning Algorithms to Predict Central Lymph Node Metastasis in T1-T2, Non-invasive, and Clinically Node Negative Papillary Thyroid Carcinoma

JOURNAL=Frontiers in Medicine

VOLUME=Volume 8 - 2021

YEAR=2021

URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2021.635771

DOI=10.3389/fmed.2021.635771

ISSN=2296-858X

ABSTRACT=<p><bold>Purpose:</bold> While there are no clear indications of whether central lymph node dissection is necessary in patients with T1-T2, non-invasive, clinically uninvolved central neck lymph nodes papillary thyroid carcinoma (PTC), this study seeks to develop and validate models for predicting the risk of central lymph node metastasis (CLNM) in these patients based on machine learning algorithms.</p><p><bold>Methods:</bold> This is a retrospective study comprising 1,271 patients with T1-T2 stage, non-invasive, and clinically node negative (cN0) PTC who underwent surgery at the Department of Endocrine and Breast Surgery of The First Affiliated Hospital of Chongqing Medical University from February 1, 2016, to December 31, 2018. We applied six machine learning (ML) algorithms, including Logistic Regression (LR), Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Decision Tree (DT), and Neural Network (NNET), coupled with preoperative clinical characteristics and intraoperative information to develop prediction models for CLNM. Among all the samples, 70% were randomly selected to train the models while the remaining 30% were used for validation. Indices like the area under the receiver operating characteristic (AUROC), sensitivity, specificity, and accuracy were calculated to test the models' performance.</p><p><bold>Results:</bold> The results showed that ~51.3% (652 out of 1,271) of the patients had pN1 disease. In multivariate logistic regression analyses, gender, tumor size and location, multifocality, age, and Delphian lymph node status were all independent predictors of CLNM. In predicting CLNM, six ML algorithms posted AUROC of 0.70–0.75, with the extreme gradient boosting (XGBoost) model standing out, registering 0.75. Thus, we employed the best-performing ML algorithm model and uploaded the results to a self-made online risk calculator to estimate an individual's probability of CLNM (<ext-link ext-link-type="uri" xlink:href="https://jin63.shinyapps.io/ML_CLNM/" xmlns:xlink="http://www.w3.org/1999/xlink">https://jin63.shinyapps.io/ML_CLNM/</ext-link>).</p><p><bold>Conclusions:</bold> With the incorporation of preoperative and intraoperative risk factors, ML algorithms can achieve acceptable prediction of CLNM with Xgboost model performing the best. Our online risk calculator based on ML algorithm may help determine the optimal extent of initial surgical treatment for patients with T1-T2 stage, non-invasive, and clinically node negative PTC.</p>