z-logo
open-access-imgOpen Access
Author Attribution of Arabic Texts Using Extended Probabilistic Context Free Grammar Language Model
Author(s) -
Ibrahim S. I. Abuhaiba,
Mohammad F. Eltibi
Publication year - 2016
Publication title -
international journal of intelligent systems and applications
Language(s) - English
Resource type - Journals
eISSN - 2074-9058
pISSN - 2074-904X
DOI - 10.5815/ijisa.2016.06.04
Subject(s) - computer science , natural language processing , artificial intelligence , punctuation , discriminative model , classifier (uml) , grammar , probabilistic logic , word error rate , arabic , language model , linguistics , philosophy
Author attribution is the problem of assigning an author to an unknown text. We propose a new approach to solve such a problem using an extended version of the probabilistic context free grammar language model, supplied by more informative lexical and syntactic features. In addition to the probabilities of the production rules in the generated model, we add probabilit ies to terminals, non-terminals, and punctuation marks. Also, the new model is augmented with a scoring function which assigns a score for each p roduction rule. Since the new model contains different features, optimu m weights, found using a genetic algorithm, are added to the model to govern how each feature participates in the classification. The advantage of using many features is to successfully capture the different writ ing styles of authors. Also, using a scoring function identifies the most discriminative ru les. Using optimu m weights supports capturing different authors' styles, which increases the classifier's performance. The new model is tested over nine authors, 20 Arabic documents per author, where the training and testing are done using the leave-one-out method. The initial error rate of the system is 20.6%. Using the optimu m weights for features reduces the error rate to 12.8%.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom