Topical Word Importance for Fast Keyphrase Extraction
Author(s) -
Lucas Sterckx,
Thomas Demeester,
Johannes Deleu,
Chris Develder
Publication year - 2015
Publication title -
ghent university academic bibliography (ghent university)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2740908.2742730
Subject(s) - computer science , pagerank , topic model , word (group theory) , random walk , artificial intelligence , natural language processing , information retrieval , statistics , mathematics , linguistics , philosophy
We propose an improvement on a state-of-the-art keyphrase extraction algorithm, Topical PageRank (TPR), incorporating topical information from topic models. While the original algorithm requires a random walk for each topic in the topic model being used, ours is independent of the topic model, computing but a single PageRank for each text regardless of the amount of topics in the model. This increases the speed drastically and enables it for use on large collections of text using vast topic models, while not altering performance of the original algorithm
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom