Bringing Order to Digital Libraries: From Keyphrase Extraction to Index Term Assignment
Author(s) -
Nicolai Erbs,
Iryna Gurevych,
Marc Rittberger
Publication year - 2013
Publication title -
d-lib magazine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.208
H-Index - 52
ISSN - 1082-9873
DOI - 10.1045/september2013-erbs
Subject(s) - term (time) , index (typography) , computer science , order (exchange) , information retrieval , world wide web , business , physics , finance , quantum mechanics
Collections of topically related documents held by digital libraries are valuable resources for users; however, as collections grow, it becomes more difficult to search them for specific information. Structure needs to be introduced to facilitate searching. Assigning index terms is helpful, but it is a tedious task even for professional indexers, requiring knowledge about the collection in general, and the document in particular. Automatic index term assignment (ITA) is considered to be a great improvement. In this paper we present a hybrid approach to index term assignment, using a combination of keyphrase extraction and multi-label classification. Keyphrase extraction efficiently assigns infrequently used
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom