z-logo
open-access-imgOpen Access
Extracting and matching authors and affiliations in scholarly documents
Author(s) -
Huy Hoang Nhat,
Muthu Kumar Chandrasekaran,
Philip S. Cho,
MinYen Kan
Publication year - 2013
Publication title -
citeseer x (the pennsylvania state university)
Language(s) - English
Resource type - Conference proceedings
ISSN - 2575-7865
DOI - 10.1145/2467696.2467703
Subject(s) - benchmark (surveying) , computer science , conditional random field , matching (statistics) , set (abstract data type) , field (mathematics) , digital library , support vector machine , information retrieval , data science , world wide web , artificial intelligence , geography , programming language , statistics , mathematics , art , poetry , geodesy , literature , pure mathematics
We introduce Enlil, an information extraction system that discovers the institutional affiliations of authors in scholarly papers. Enlil consists of two steps: one that first identifies authors and affiliations using a conditional random field; and a second support vector machine that connects authors to their affiliations. We benchmark Enlil in three separate experiments drawn from three different sources: the ACL Anthology Corpus, the ACM Digital Library, and a set of cross-disciplinary scientific journal articles acquired by querying Google Scholar. Against a state-of-the-art production baseline, Enlil reports a statistically significant improvement in F_1 of nearly 10% (p 90%) and automatically-acquired input (F_1 80%). We have deployed Enlil in a case study involving Asian genomics research publication patterns to understand how government sponsored collaborative links evolve. Enlil has enabled our team to construct and validate new metrics to quantify the facilitation of research as opposed to direct publication.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom