z-logo
open-access-imgOpen Access
Word Extraction and Recognition in Arabic Handwritten Text
Author(s) -
Nabil Aouadi,
Afef Kacem Echi
Publication year - 2016
Publication title -
international journal of computing and information sciences
Language(s) - English
Resource type - Journals
eISSN - 1708-0479
pISSN - 1708-0460
DOI - 10.21700/ijcis.2016.103
Subject(s) - arabic , natural language processing , word (group theory) , computer science , artificial intelligence , word recognition , speech recognition , linguistics , reading (process) , philosophy
Segmenting arabic manuscripts into text-lines and words is an important step to make recognition systems more efficient and accurate. The major problem making this task crucial is the word extraction process: first, words are often a succession of sub-words where the space value between these sub-words do not respect any rules. Second, the presence of connections even between non adjacent sub-words in the same text-line, makes word’s parts identification and the entire word extraction difficult. This work proposes an automatic system for arabic handwritten word extraction and recognition based on 1) localizing and segmenting touching characters, 2) extracting real subwords and structural features from word images and 3) recognizing them by a Markovian classifier. The performance of the proposed system is tested using samples extracted from historical handwritten documents. The obtained results are encouraging. We achieved an average rate of recognition of 87%.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom