z-logo
open-access-imgOpen Access
A Framework for the Selection of Binarization Techniques on Palm Leaf Manuscripts Using Support Vector Machine
Author(s) -
Rapeeporn Chamchong,
Chun Che Fung
Publication year - 2015
Publication title -
advances in decision sciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.178
H-Index - 13
eISSN - 2090-3367
pISSN - 2090-3359
DOI - 10.1155/2015/925935
Subject(s) - computer science , artificial intelligence , support vector machine , pattern recognition (psychology) , grayscale , selection (genetic algorithm) , process (computing) , benchmarking , oversampling , segmentation , image (mathematics) , feature (linguistics) , feature selection , feature extraction , pixel , data mining , computer vision , computer network , linguistics , philosophy , bandwidth (computing) , marketing , business , operating system
Challenges for text processing in ancient document images are mainly due to the high degree of variations in foreground and background. Image binarization is an image segmentation technique used to separate the image into text and background components. Although several techniques for binarizing text documents have been proposed, the performance of these techniques varies and depends on the image characteristics. Therefore, selecting binarization techniques can be a key idea to achieve improved results. This paper proposes a framework for selecting binarizing techniques of palm leaf manuscripts using Support Vector Machines (SVMs). The overall process is divided into three steps: (i) feature extraction: feature patterns are extracted from grayscale images based on global intensity, local contrast, and intensity; (ii) treatment of imbalanced data: imbalanced dataset is balanced by using Synthetic Minority Oversampling Technique as to improve the performance of prediction; and (iii) selection: SVM is applied in order to select the appropriate binarization techniques. The proposed framework has been evaluated with palm leaf manuscript images and benchmarking dataset from DIBCO series and compared the performance of prediction between imbalanced and balanced datasets. Experimental results showed that the proposed framework can be used as an integral part of an automatic selection process

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom