z-logo
open-access-imgOpen Access
Author Based Rank Vector Coordinates (ARVC) Model for Authorship Attribution
Author(s) -
N. Raju,
Vikram Kumar,
O. Srinivasa Rao
Publication year - 2016
Publication title -
international journal of image graphics and signal processing
Language(s) - English
Resource type - Journals
eISSN - 2074-9082
pISSN - 2074-9074
DOI - 10.5815/ijigsp.2016.05.06
Subject(s) - singular value decomposition , computer science , cosine similarity , rank (graph theory) , authorship attribution , vector space model , identification (biology) , ambiguity , similarity (geometry) , information retrieval , dimension (graph theory) , matrix (chemical analysis) , field (mathematics) , data mining , artificial intelligence , pattern recognition (psychology) , mathematics , botany , materials science , composite material , combinatorics , biology , pure mathematics , image (mathematics) , programming language
Authorship attribution is one of the important problem, with many applications of practical use in the real-world. Authorship identification determines the likelihood of a piece of writing produced by a particular author by examining the other writings of that author. Most of the research in this field is carried out by using instance based model. One of the disadvantages of this model is that it treats the different documents of each author differently. It produces a matrix per each document of the author, thus creating a huge number of matrices per author, i.e. the dimensionality is very high. This paper presents authorship identification using Author based Rank Vector Coordinates (ARVC) model. The advantage of the proposed ARVC model is that it integrates all the author's profile documents into a single integrated profile document (IPD) and thus overcomes the above disadvantage. To overcome the ambiguity created by common words of authors ARVC model removes the common words based on a threshold. Singular value decomposition (SVD) is used on IPD after removing the common words. To reduce the overall dimension of the matrix, without affecting its semantic meaning a rank- based vector coordinates are derived. The eigenvector features are derived on ARVC model. The present paper used cosine similarity measure for author attribution and carries out authorship attribution on English poems and editorial documents Index Terms—Threshold, Common words, Integration, ARVC model, SVD technique.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom