z-logo
open-access-imgOpen Access
Providing a Link Prediction Model based on Structural and Homophily Similarity in Social Networks
Author(s) -
Alireza Eshaghpoor,
Mostafa Salehi,
Vahid Ranjbar
Publication year - 2020
Publication title -
signal and data processing
Language(s) - English
Resource type - Journals
eISSN - 2538-421X
pISSN - 2538-4201
DOI - 10.29252/jsdp.16.4.45
Subject(s) - homophily , similarity (geometry) , computer science , metric (unit) , graph , social network (sociolinguistics) , network science , data mining , field (mathematics) , data science , machine learning , complex network , artificial intelligence , theoretical computer science , social media , mathematics , world wide web , image (mathematics) , operations management , combinatorics , pure mathematics , economics
In recent years, with the growing number of online social networks, these networks have become one of the best markets for advertising and commerce, so studying these networks is very important. Forecasting new edges in online social networks can give us a better understanding of the growth of these networks. There have been many studies of link prediction in the field of engineering and humanities. Scientists attribute the existence of a new relationship between two individuals for two reasons: 1) Proximity to the graph (structure) 2) Similar properties of the two individuals (Homophile law). However, studying the impact of the two approaches working together to create new edges remains an open problem. Similarity metrics can also be divided into two categories; Neighborhood-based and path-based. So far, the above two theoretical approaches (proximity and homophile) have not been found together in the neighborhood-based metrics. In this paper, we first attempt to provide a solution to determine importance of the proximity to the graph and similar features in the connectivity of the graphs. Then obtained weights are assigned to both proximity and homophile. Then the best similarity metric in each approach are obtained. Finally, the selected metric of homophily similarity and structural similarity are combined with the obtained weights. The results of this study were evaluated on two datasets; Zanjan University Graduate School of Social Sciences and Pokec online Social Network. The first data set was collected for this study and then the questionnaires and data collection methods were filled out. Since this dataset is one of the few Iranian datasets that has been compiled with its users' specifications, it can be of great value. In this paper, we have been able to increase the accuracy of Neighborhood-based similarity metric by using two proximity in graph and homophily approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom