z-logo
open-access-imgOpen Access
Semi-supervised Persian font recognition
Author(s) -
Maryam Bahojb Imani,
Mohammad Reza Keyvanpour,
Reza Azmi
Publication year - 2011
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2010.12.057
Subject(s) - computer science , artificial intelligence , classifier (uml) , pattern recognition (psychology) , training set , labeled data , supervised learning , machine learning , optical character recognition , test data , artificial neural network , image (mathematics) , programming language
Font recognition is one of the fundamental tasks in document recognition, because it is an important factor in optical character recognition. Classical supervised methods need lot of labeled data to train a classifier. Since it is very costly and time consuming to label large amounts of data, it is useful to use data sets without labels. So many different semi-supervised learning methods have been studied recently. Among the semi-supervised methods, self-training is one of the important learning algorithms that classify the unlabeled samples with small amount of labeled ones and add the most confident samples to the training set. In this paper, we apply majority vote approach to classify the unlabeled data to reliable and unreliable classes. Then, we add the reliable data to training set and classify the remaining data including unreliable data in iterative process. We test this method on the extracted features of ten common Persian fonts. Experimental result indicates that proposed method improves the classification performance and it’s effective

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom