z-logo
open-access-imgOpen Access
Algorithm for psycholinguistic analysis of social networks texts using the Big Five Personality Traits
Author(s) -
N.G. Yarushkina,
Vadim С. Moshkin,
I.A. Andreev
Publication year - 2022
Publication title -
ontologiâ proektirovaniâ
Language(s) - English
Resource type - Journals
eISSN - 2313-1039
pISSN - 2223-9537
DOI - 10.18287/2223-9537-2022-12-1-82-92
Subject(s) - computer science , natural language processing , artificial intelligence , machine learning , respondent , social network (sociolinguistics) , support vector machine , information retrieval , social media , world wide web , political science , law
The paper presents an approach to determining the psychological characteristics of a user of social networks through the analysis of text messages in social networks. The proposed approach includes the user's texts classification using machine learning. The results of the analysis of user surveys in accordance with the Big Five model, as well as a set of author's text data from social network pages, are used as training data. The questionnaire contains paired statements, and the respondent determines the degree of their own agreement with one or another statement on a scale from 0 to 4. Natural language text processing (NLP) methods were applied to the text resources used as input data for the classifier, as well as the RuWordNet linguistic ontology, in order to level out a number of features of social network texts, for ex-ample, the presence of grammatical errors and emoticons that complicate the process. semantic analysis. Two models were used as classifiers: the support vector machine and the random forest method. The area under the error curve (AUC ROC) metric was used to evaluate performance. The experiments used open text data of more than 1000 users of social networks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here