The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Phys.
Sec. Radiation Detectors and Imaging
Volume 12 - 2024 |
doi: 10.3389/fphy.2024.1526412
This article is part of the Research Topic Multi-Sensor Imaging and Fusion: Methods, Evaluations, and Applications, Volume III View all articles
Characters Interested Binary-like Image Learning for Text Image Demoir éing
Provisionally accepted- Harbin Institute of Technology, Shenzhen, Shenzhen, China
Despite the fact that the text image based Optical Character Recognition (OCR) methods have been applied to wide applications, they do suffer from performance degradation when the image is contaminated with moir é patterns for the sake of interference between the display screen and the camera. To tackle this problem, in this paper, we propose a novel network for text image demoir éing. Specifically, to encourage our study on text images, a dataset including an amount of pairs of images with/without moir é patterns is collected, which is specific for text image demoir éing. In addition, due to the statistical difference among various channels on moir é patterns, a multi-channel strategy is proposed which roughly extracts the information associated with moir é patterns and subsequently contributes to moir é removal. Besides, our purpose on the text image is to increase the OCR accuracy, while other background pixels are insignificant. Instead of restoring all pixels like that in natural images, a character attention module is conducted, allowing the network to pay more attention on the optical character associated pixels and also achieving a consistent image style. Being beneficial from this strategy, characters can be more easily detected and more accurately recognized. Dramatic experimental results on our conducted dataset demonstrate the significance of our study and the superiority of our proposed method compared with state-of-the-art image restoration approaches. Specifically, the metrics of Recall and F1-measure on recognition are increased from 56.32%/70.18% to 85.34%/89.36%.
Keywords: Multi-sensor imaging, deep learning, Text image, Demoir éing, multi-channel, Moir é pattern, Optical character recognition (OCR)
Received: 11 Nov 2024; Accepted: 19 Nov 2024.
Copyright: © 2024 Zhang, Liang, Ren, Fan, Li and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Mu Li, Harbin Institute of Technology, Shenzhen, Shenzhen, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.