A multi-scale framework for adaptive binarization of degraded document images

Reza Farrahi Moghaddam,Mohamed Cheriet
DOI: https://doi.org/10.1016/j.patcog.2009.12.024
IF: 8
2010-06-01
Pattern Recognition
Abstract:In this work, a multi-scale binarization framework is introduced, which can be used along with any adaptive threshold-based binarization method. This framework is able to improve the binarization results and to restore weak connections and strokes, especially in the case of degraded historical documents. This is achieved thanks to localized nature of the framework on the spatial domain. The framework requires several binarizations on different scales, which is addressed by introduction of fast grid-based models. This enables us to explore high scales which are usually unreachable to the traditional approaches. In order to expand our set of adaptive methods, an adaptive modification of Otsu's method, called AdOtsu, is introduced. In addition, in order to restore document images suffering from bleed-through degradation, we combine the framework with recursive adaptive methods. The framework shows promising performance in subjective and objective evaluations performed on available datasets.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?