A method for detecting the profile of an author
Author(s) -
Jesus Silva,
Silvia García,
María Alejandra Binda,
F. González,
Rosio Barrios,
Bellanit Leon Castro,
Ligia Inés García Castro
Publication year - 2020
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2020.03.101
Subject(s) - computer science , profiling (computer programming) , natural language processing , competence (human resources) , training set , artificial intelligence , software , information retrieval , programming language , psychology , social psychology
This paper presents a method for detecting an author’s profile using the following two elements: gender and age. This is based on a set of dialogues, written in two languages: English and Spanish, provided for Author Profiling competence within the evaluation forum "Uncovering Plagiarism, Authorship, and Social Software Misuse" (PAN2018). Counts of lexical, semantic, and syntactic characteristics are used to generate a two-phase classification system, which first classifies gender and then age. The results obtained show that, with the amount of data available, it is possible to characterize both the age and gender of an author with an accuracy greater than 50%. However, these values could be improved by having more evidence of information in the training data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom