z-logo
open-access-imgOpen Access
User Profiling for CSDN: Keyphrase Extraction, User Tagging and User Growth Value Prediction: First-place Entry for User Profiling Technology Evaluation Campaign in SMP Cup 2017
Author(s) -
Guoliang Xing,
Hao Gao,
Qi Cao,
Yue Xinyu,
Bingbing Xu,
Keting Cen,
Huawei Shen
Publication year - 2019
Publication title -
data intelligence
Language(s) - English
Resource type - Journals
eISSN - 2096-7004
pISSN - 2641-435X
DOI - 10.1162/dint_a_00015
Subject(s) - computer science , profiling (computer programming) , classifier (uml) , softmax function , artificial intelligence , generality , machine learning , data mining , artificial neural network , operating system , psychology , psychotherapist
The Chinese Software Developer Network (CSDN) is one of the largest information technology communities and service platforms in China. This paper describes the user profiling for CSDN, an evaluation track of SMP Cup 2017. It contains three tasks: (1) user document keyphrase extraction, (2) user tagging and (3) user growth value prediction. In the first task, we treat keyphrase extraction as a classification problem and train a Gradient-Boosting-Decision-Tree model with comprehensive features. In the second task, to deal with class imbalance and capture the interdependency between classes, we propose a two-stage framework: (1) for each class, we train a binary classifier to model each class against all of the other classes independently; (2) we feed the output of the trained classifiers into a softmax classifier, tagging each user with multiple labels. In the third task, we propose a comprehensive architecture to predict user growth value. Our contributions in this paper are summarized as follows: (1) we extract various types of features to identify the key factors in user value growth; (2) we use the semi-supervised method and the stacking technique to extend labeled data sets and increase the generality of the trained model, resulting in an impressive performance in our experiments. In the competition, we achieved the first place out of 329 teams.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom