Leveraging metadata to recommend keywords for academic papers | Zendy

Blank Ido | Zendy; Rokach Lior | Zendy; Shani Guy | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Leveraging metadata to recommend keywords for academic papers

Author(s) -

Blank Ido,

Rokach Lior,

Shani Guy

Publication year - 2016

Publication title -

journal of the association for information science and technology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.903

H-Index - 145

eISSN - 2330-1643

pISSN - 2330-1635

DOI - 10.1002/asi.23571

Subject(s) - computer science , metadata , citation , listing (finance) , information retrieval , set (abstract data type) , world wide web , finance , economics , programming language

Users of research databases, such as CiteS eer X , G oogle S cholar, and M icrosoft A cademic, often search for papers using a set of keywords. Unfortunately, many authors avoid listing sufficient keywords for their papers. As such, these applications may need to automatically associate good descriptive keywords with papers. When the full text of the paper is available this problem has been thoroughly studied. In many cases, however, due to copyright limitations, research databases do not have access to the full text. On the other hand, such databases typically maintain metadata, such as the title and abstract and the citation network of each paper. In this paper we study the problem of predicting which keywords are appropriate for a research paper, using different methods based on the citation network and available metadata. Our main goal is in providing search engines with the ability to extract keywords from the available metadata. However, our system can also be used for other applications, such as for recommending keywords for the authors of new papers. We create a data set of research papers, and their citation network, keywords, and other metadata, containing over 470 K papers with and more than 2 million keywords. We compare our methods with predicting keywords using the title and abstract, in offline experiments and in a user study, concluding that the citation network provides much better predictions.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research