z-logo
open-access-imgOpen Access
Churners Prediction Based on Mining the Content of Social Network Taxonomy
Author(s) -
Naser Alzubaidi,
Eman S. Al-Shamery
Publication year - 2019
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.b1056.0982s1019
Subject(s) - computer science , data mining , cluster analysis , leverage (statistics) , centrality , boosting (machine learning) , raw data , outlier , data science , artificial intelligence , machine learning , statistics , mathematics , programming language
Churner Customer is a main tricky and one of the most important issues for large companies, due to the straight impact on the incomes of the companies especially in the telecom domain, companies are searching for advance strategies to predict churn/non-churn customer. This research focuses on the construction of a predictive model to identify each customer as churner or not and gain additional insights about their service consumers. The main contribution is to overcome the limitation of independently based on data mining strategies by developing approaches and derived network metrics such as centrality and connectivity between customers to incorporate network mining with traditional data mining. Social network measurements e.g. Leverage, flow Bet, Page Rank, Cluster Coefficients and Eccentricity are joined with other attributes in the original network dataset to enhance the performance of the proposed methodology. The risk of churn can be predictive by preparing an extensive cleaning the raw data for churn modeling, It divides customers into clusters based on Gower distance and k-medoids algorithm to help understand and predict churner users, classification model using Extreme Gradient Boosting “XGBoost”, assessment the model performance by computation the centralities metrics as new attributes appended to the original network dataset. Experiments conducted on Telecom shows that with an average value of all statistics accuracy not lower than 98.27%, while the average accuracy for the original dataset with it is clusters is not exceeded than 0.97%. The proposed method for churners detection which combines social impacts and network contents based on clustering significantly improved the prediction accuracy for telecom dataset as compared to prediction using the call log details, network information without implement of clustering , thus validate the hypothesis that combining social network attributes and Call/SMS information of the users for churn prediction could yields substantially improved of customer churn prediction.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here