Link Definition Ameliorating Community Detection in Collaboration Networks
Author(s) -
Saharnaz Dilmaghani,
Matthias R. Brust,
Apivadee Piyatumrong,
Grégoire Danoy,
Pascal Bouvry
Publication year - 2019
Publication title -
frontiers in big data
Language(s) - English
Resource type - Journals
ISSN - 2624-909X
DOI - 10.3389/fdata.2019.00022
Subject(s) - centrality , computer science , set (abstract data type) , data mining , cluster analysis , quality (philosophy) , clustering coefficient , data science , network analysis , linkage (software) , machine learning , mathematics , statistics , engineering , philosophy , biochemistry , chemistry , epistemology , electrical engineering , gene , programming language
Collaboration networks are defined as a set of individuals who come together and collaborate on particular tasks such as publishing a paper. The analysis of such networks permits to extract knowledge on the structure and patterns of communities. The link definition and network extraction have a high impact on the analysis of collaboration networks. Previous studies model the connectivity in a network considering it as a binomial problem with respect to the existence of a collaboration between individuals. However, such a data consists of a high diversity of features that describe the quality of the interaction such as the contribution amount of each individual. In this paper, we have determined a solution to extract collaboration networks using corresponding features in a dataset. We define collaboration score to quantify the collaboration between collaborators. In order to validate our proposed method, we benefit from a scientific research institute dataset in which researchers are co–authors who are involved in the production of papers, prototypes, and intellectual properties (IP). We evaluated the generated networks, produced through different thresholds of collaboration score , by employing a set of network analysis metrics such as clustering coefficient, network density, and centrality measures. We investigated more the obtained networks using a community detection algorithm to further discuss the impact of our model on community detection. The outcome shows that the quality of resulted communities on the extracted collaboration networks can differ significantly based on the choice of the linkage threshold.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom