Evaluating topic representations for exploring document collections | Zendy

Aletras Nikolaos | Zendy; Baldwin Timothy | Zendy; Lau Jey Han | Zendy; Stevenson Mark | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Evaluating topic representations for exploring document collections

Author(s) -

Aletras Nikolaos,

Baldwin Timothy,

Lau Jey Han,

Stevenson Mark

Publication year - 2017

Publication title -

journal of the association for information science and technology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.903

H-Index - 145

eISSN - 2330-1643

pISSN - 2330-1635

DOI - 10.1002/asi.23574

Subject(s) - computer science , information retrieval , phrase , topic model , term (time) , representation (politics) , visualization , task (project management) , natural language processing , artificial intelligence , physics , management , quantum mechanics , politics , political science , law , economics

Topic models have been shown to be a useful way of representing the content of large document collections, for example, via visualization interfaces (topic browsers). These systems enable users to explore collections by way of latent topics. A standard way to represent a topic is using a term list; that is the top‐ n words with highest conditional probability within the topic. Other topic representations such as textual and image labels also have been proposed. However, there has been no comparison of these alternative representations. In this article, we compare 3 different topic representations in a document retrieval task. Participants were asked to retrieve relevant documents based on predefined queries within a fixed time limit, presenting topics in one of the following modalities: (a) lists of terms, (b) textual phrase labels, and (c) image labels. Results show that textual labels are easier for users to interpret than are term lists and image labels. Moreover, the precision of retrieved documents for textual and image labels is comparable to the precision achieved by representing topics using term lists, demonstrating that labeling methods are an effective alternative topic representation.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research