AUTHOR=Liang Jinglun , Ye Guoliang , Guo Jianwen , Huang Qifan , Zhang Shaohui TITLE=Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets JOURNAL=Frontiers in Public Health VOLUME=9 YEAR=2021 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2021.671070 DOI=10.3389/fpubh.2021.671070 ISSN=2296-2565 ABSTRACT=

Malignant pulmonary nodules are one of the main manifestations of lung cancer in early CT image screening. Since lung cancer may have no early obvious symptoms, it is important to develop a computer-aided detection (CAD) system to assist doctors to detect the malignant pulmonary nodules in the early stage of lung cancer CT diagnosis. Due to the recent successful applications of deep learning in image processing, more and more researchers have been trying to apply it to the diagnosis of pulmonary nodules. However, due to the ratio of nodules and non-nodules samples used in the training and testing datasets usually being different from the practical ratio of lung cancer, the CAD classification systems may easily produce higher false-positives while using this imbalanced dataset. This work introduces a filtering step to remove the irrelevant images from the dataset, and the results show that the false-positives can be reduced and the accuracy can be above 98%. There are two steps in nodule detection. Firstly, the images with pulmonary nodules are screened from the whole lung CT images of the patients. Secondly, the exact locations of pulmonary nodules will be detected using Faster R-CNN. Final results show that this method can effectively detect the pulmonary nodules in the CT images and hence potentially assist doctors in the early diagnosis of lung cancer.