An Arabic CCG approach for determining constituent types from Arabic Treebank | Zendy

Ahmed I. El-taher | Zendy; Hitham M. Abo Bakr | Zendy; Ibrahim Zidan | Zendy; Khaled Shaalan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

An Arabic CCG approach for determining constituent types from Arabic Treebank

Author(s) -

Ahmed I. El-taher,

Hitham M. Abo Bakr,

Ibrahim Zidan,

Khaled Shaalan

Publication year - 2014

Publication title -

journal of king saud university - computer and information sciences

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.617

H-Index - 33

eISSN - 2213-1248

pISSN - 1319-1578

DOI - 10.1016/j.jksuci.2014.06.005

Subject(s) - treebank , computer science , combinatory categorial grammar , natural language processing , arabic , artificial intelligence , preprocessor , identification (biology) , process (computing) , annotation , linguistics , rule based machine translation , programming language , tree adjoining grammar , botany , context free grammar , philosophy , biology

Converting a treebank into a CCGbank opens the respective language to the sophisticated tools developed for Combinatory Categorial Grammar (CCG) and enriches cross-linguistic development. The conversion is primarily a three-step process: determining constituents’ types, binarization, and category conversion. Usually, this process involves a preprocessing step to the Treebank of choice for correcting brackets and normalizing tags for any changes that were introduced during the manual annotation, as well as extracting morpho-syntactic information that is necessary for determining constituents’ types. In this article, we describe the required preprocessing step on the Arabic Treebank, as well as how to determine Arabic constituents’ types. We conducted an experiment on parts 1 and 2 of the Penn Arabic Treebank (PATB) aimed at converting the PATB into an Arabic CCGbank. The performance of our algorithm when applied to ATB1v2.0 & ATB2v2.0 was 99% identification of head nodes and 100% coverage over the Treebank data

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research