Text Line Segmentation of Historical Arabic Documents | Zendy

Abderrazak  Zahour | Zendy; Laurence  Likforman-Sulem | Zendy; Wafa  Boussellaa | Zendy; Bruno  Taconet | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Text Line Segmentation of Historical Arabic Documents

Author(s) -

Abderrazak Zahour,

Laurence Likforman-Sulem,

Wafa Boussellaa,

Bruno Taconet

Publication year - 2007

Publication title -

ninth international conference on document analysis and recognition (icdar 2007)

Language(s) - English

DOI - 10.1109/icdar.2007.245

This paper presents a text line segmentation method for printed or handwritten historical Arabic documents. Documents are first classified into 2 classes using a K-means scheme. These classes correspond to document complexity (easy or not easy to segment). Then, a document which includes overlapping and touching characters, is divided into vertical strips. The extracted text blocks obtained by horizontal projection are classified into three categories: small, average and large text blocks. After segmenting the large text blocks, the lines are obtained by matching adjacent blocks within two successive strips using spatial relationship. The document without overlapping or touching characters is segmented by making abstraction on the segmentation module of the large text blocks. The text line segmentation method has a 96% accuracy on a collection of 100 historical documents

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research