z-logo
Premium
Modeling user reputation in wikis
Author(s) -
Javanmardi Sara,
Lopes Cristina,
Baldi Pierre
Publication year - 2010
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.10070
Subject(s) - reputation , computer science , parsing , quality (philosophy) , classifier (uml) , world wide web , information retrieval , data science , data mining , artificial intelligence , social science , philosophy , epistemology , sociology
Collaborative systems available on the Web allow millions of users to share information through a growing collection of tools and platforms such as wikis, blogs, and shared forums. By their very nature, these systems contain resources and information with different quality levels. The open nature of these systems, however, makes it difficult for users to determine the quality of the available information and the reputation of its providers. Here, we first parse and mine the entire English Wikipedia history pages in order to extract detailed user edit patterns and statistics. We then use these patterns and statistics to derive three computational models of a user's reputation. Finally, we validate these models using ground‐truth Wikipedia data associated with vandals and administrators. When used as a classifier, the best model produces an area under the receiver operating characteristic (ROC) curve (AUC) of 0.98. Furthermore, we assess the reputation predictions generated by the models on other users, and show that all three models can be used efficiently for predicting user behavior in Wikipedia. Copyright © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 126‐139, 2010

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here