Journal of Signal and Information Processing

Volume 10, Issue 1 (February 2019)

ISSN Print: 2159-4465   ISSN Online: 2159-4481

Google-based Impact Factor: 1.19  Citations  

Pre-Processing Images of Public Signage for OCR Conversion

HTML  XML Download Download as PDF (Size: 1075KB)  PP. 1-11  
DOI: 10.4236/jsip.2019.101001    1,510 Downloads   3,135 Views  

ABSTRACT

In this paper, we propose a novel method to enhance the OCR (Optical Character Recognition) readability of public signboards captured by smart-phone cameras—both outdoors and indoors, and subject to various lighting conditions. A distinct feature of our technique is the detection of these signs in the HSV (Hue, Saturation and Value) color space, done in order to filter out the signboard from the background, and correctly interpret the textual details of each signboard. This is then binarized using a thresholding technique that is optimized for text printed on contrasting backgrounds, and passed through the Tesseract engine to detect individual characters. We test out our technique on a dataset of over 200 images taken in and around the campus of our college, and are successful in attaining better OCR results in comparison to traditional methods. Further, we suggest the utilization of a method to automatically assign ROIs (Regions Of Interest) to detected signboards, for better recognition of textual information.

Share and Cite:

Khan, A. , Nida Usmani, M. , Rahman, N. and Prasad, D. (2019) Pre-Processing Images of Public Signage for OCR Conversion. Journal of Signal and Information Processing, 10, 1-11. doi: 10.4236/jsip.2019.101001.

Cited by

No relevant information.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.