Document Image Binarization Using Retinex and Global Thresholding

Marian Wagdy, Ibrahima Faye, Dayang Rohaya

Abstract

Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. Degradation problems seems negligible to human eyes but can be responsible for an abrupt decline in accuracy by the current generation of optical character recognition (OCR) systems. In this paper we present binarization method based on retinex theory followed by a global threshold. High quality results in terms of visual criteria and OCR performance is produced compared to the previous works.

Keywords

Thresholding; binarization; retinex theory; degradation; lightness; digitization; OCR systems.

Full Text:

PDF (1.58Mb)
Copyright (c) 2015 Marian Wagdy, Ibrahima Faye, Dayang Rohaya