Gender Identification via Reposting Behaviors in Social Media
Author(s) -
Dongxu Li,
Yongjun Li,
Wenli Ji
Publication year - 2017
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2017.2785813
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Social media has drawn scientists’ attention due to their potential values for health and marketing, among other activities. To effectively exploit such huge online resources by using incomplete user profiles, which are very common on social media sites and can be caused by user privacy settings, it is meaningful to explore user profile identification methods. In this paper, we focus on gender identification using reposting behaviors on social networks. Whereas most existing works rely on pure statistical methods, we propose a scheme that is underpinned by homophily and four intuitive methods based on it by combining knowledge of statistics and sociology. For our data set, which was obtained from Sina Weibo and contained 1039 test samples and 528k user profiles, our methods perform with 86.7% accuracy. We explore the sensitivity of our methods on the scale of the data set and find surprisingly competitive results surpassing the binary classification baseline. Finally, we further suggest possible extensions to our methods.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom