AUTHOR=Prusty Sashikanta , Patnaik Srikanta , Dash Sujit Kumar TITLE=SKCV: Stratified K-fold cross-validation on ML classifiers for predicting cervical cancer JOURNAL=Frontiers in Nanotechnology VOLUME=4 YEAR=2022 URL=https://www.frontiersin.org/journals/nanotechnology/articles/10.3389/fnano.2022.972421 DOI=10.3389/fnano.2022.972421 ISSN=2673-3013 ABSTRACT=

Cancer is the unregulated development of abnormal cells in the human body system. Cervical cancer, also known as cervix cancer, develops on the cervix’s surface. This causes an overabundance of cells to build up, eventually forming a lump or tumour. As a result, early detection is essential to determine what effective treatment we can take to overcome it. Therefore, the novel Machine Learning (ML) techniques come to a place that predicts cervical cancer before it becomes too serious. Furthermore, four common diagnosis testing namely, Hinselmann, Schiller, Cytology, and Biopsy have been compared and predicted with four common ML models, namely Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (K-NNs), and Extreme Gradient Boosting (XGB). Additionally, to enhance the better performance of ML models, the Stratified k-fold cross-validation (SKCV) method has been implemented over here. The findings of the experiments demonstrate that utilizing an RF classifier for analyzing the cervical cancer risk, could be a good alternative for assisting clinical specialists in classifying this disease in advance.