z-logo
open-access-imgOpen Access
Automatically Determining Versions of Scholarly Articles
Author(s) -
Daniel Rothchild,
Stuart M. Shieber
Publication year - 2017
Publication title -
scholarly and research communication
Language(s) - English
Resource type - Journals
ISSN - 1923-0702
DOI - 10.22230/src.2017v8n1a268
Subject(s) - pairwise comparison , computer science , information retrieval , simple (philosophy) , data science , data mining , artificial intelligence , philosophy , epistemology
Background: Repositories of scholarly articles should provide authoritative information about the materials they distribute and should distribute those materials in keeping with pertinent laws. To do so, it is important to have accurate information about the versions of articles in a collection. Analysis: This article presents a simple statistical model to classify articles as author manuscripts or versions of record, with parameters trained on a collection of articles that have been hand-annotated for version. The algorithm achieves about 94 percent accuracy on average (cross-validated). Conclusion and implications: The average pairwise annotator agreement among a group of experts was 94 percent, showing that the method developed in this article displays performance competitive with human experts.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom