AUTHOR=Ivanov Deyan , Dremin Viktor , Genova Tsanislava , Bykov Alexander , Novikova Tatiana , Ossikovski Razvigor , Meglinski Igor TITLE=Polarization-Based Histopathology Classification of Ex Vivo Colon Samples Supported by Machine Learning JOURNAL=Frontiers in Physics VOLUME=9 YEAR=2022 URL=https://www.frontiersin.org/journals/physics/articles/10.3389/fphy.2021.814787 DOI=10.3389/fphy.2021.814787 ISSN=2296-424X ABSTRACT=

In biophotonics, novel techniques and approaches are being constantly sought to assist medical doctors and to increase both sensitivity and specificity of the existing diagnostic methods. In such context, tissue polarimetry holds promise to become a valuable optical diagnostic technique as it is sensitive to tissue alterations caused by different benign and malignant formations. In our studies, multiple Mueller matrices were recorded for formalin-fixed, human, ex vivo colon specimens containing healthy and tumor zones. The available data were pre-processed to filter noise and experimental errors, and then all Mueller matrices were decomposed to derive polarimetric quantities sensitive to malignant formations in tissues. In addition, the Poincaré sphere representation of the experimental results was implemented. We also used the canonical and natural indices of polarimetric purity depolarization spaces for plotting our experimental data. A feature selection was used to perform a statistical analysis and normalization procedure on the available data, in order to create a polarimetric model for colon cancer assessment with strong predictors. Both unsupervised (principal component analysis) and supervised (logistic regression, random forest, and support vector machines) machine learning algorithms were used to extract particular features from the model and for classification purposes. The results from logistic regression allowed to evaluate the best polarimetric quantities for tumor detection, while the use of random forest yielded the highest accuracy values. Attention was paid to the correlation between the predictors in the model as well as both losses and relative risk of misclassification. Apart from the mathematical interpretation of the polarimetric quantities, the presented polarimetric model was able to support the physical interpretation of the results from previous studies and relate the latter to the samples’ health condition, respectively.