Premium
Text binarization in color documents
Author(s) -
Badekas Efthimios,
Nikolaou Nikos,
Papamarkos Nikos
Publication year - 2006
Publication title -
international journal of imaging systems and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.359
H-Index - 47
eISSN - 1098-1098
pISSN - 0899-9457
DOI - 10.1002/ima.20092
Subject(s) - artificial intelligence , computer science , pixel , binary image , computer vision , grayscale , pattern recognition (psychology) , property (philosophy) , color balance , block (permutation group theory) , image (mathematics) , color image , binary number , image processing , mathematics , arithmetic , philosophy , geometry , epistemology
This article presents a new method for the binarization of color document images. Initially, the colors of the document image are reduced to a small number using a new color reduction technique. Specifically, this technique estimates the dominant colors and then assigns the original image colors to them in order that the background and text components to become uniform. Each dominant color defines a color plane in which the connected components ( CC s) are extracted. Next, in each color plane a CC filtering procedure is applied which is followed by a grouping procedure. At the end of this stage, blocks of CC s are constructed which are next redefined by obtaining the direction of connection (DOC) property for each CC . Using the DOC property, the blocks of CC s are classified as text or nontext. The identified text blocks are binarized properly using suitable binarization techniques, considering the rest of the pixels as background. The final result is a binary image which contains always black characters in white background independently of the original colors of each text block. The proposed document binarization approach can also be used for binarization of noisy color (or gray‐scale) document images. Several experiments that confirm the effectiveness of the proposed technique are presented. © 2007 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 16, 262–274, 2006