z-logo
open-access-imgOpen Access
Investigating the performance of automatic new topic identification across multiple datasets
Author(s) -
Özmutlu H. Cenk,
Cavdur Fatih,
Spink Amanda,
Ozmutlu Seda
Publication year - 2006
Publication title -
proceedings of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1550-8390
pISSN - 0044-7870
DOI - 10.1002/meet.1450430129
Subject(s) - identification (biology) , computer science , search engine , information retrieval , continuation , artificial neural network , data mining , sample (material) , data science , machine learning , botany , biology , chemistry , chromatography , programming language
Recent studies on automatic new topic identification in Web search engine user sessions demonstrated that neural networks are successful in automatic new topic identification. However most of this work applied their new topic identification algorithms on data logs from a single search engine. In this study, we investigate whether the application of neural networks for automatic new topic identification are more successful on some search engines than others. Sample data logs from the Norwegian search engine FAST (currently owned by Overture) and Excite are used in this study. Findings of this study suggest that query logs with more topic shifts tend to provide more successful results on shift‐based performance measures, whereas logs with more topic continuations tend to provide better results on continuation‐based performance measures.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom