z-logo
open-access-imgOpen Access
Word Sieve: A Method for Real-Time Context Extraction
Author(s) -
Travis Bauer,
David Leake
Publication year - 2001
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-42379-6
DOI - 10.1007/3-540-44607-9_3
Subject(s) - computer science , search engine indexing , information retrieval , context (archaeology) , automatic indexing , word (group theory) , tf–idf , task (project management) , information extraction , term (time) , index (typography) , document retrieval , world wide web , paleontology , linguistics , philosophy , physics , management , economics , biology , quantum mechanics
In order to be useful, intelligent information retrieval agents must provide their users with context-relevant information. This paper presents WordSieve, an algorithm for automatically extracting information about the context in which documents are consulted during web browsing. Using information extracted from the stream of documents consulted by the user, WordSieve automatically builds context profiles which differentiate sets of documents that users tend to access in groups. These profiles are used in a research-aiding system to index documents consulted in the current context and pro-actively suggest them to users in similar future contexts. In initial experiments on the capability to match documents to the task contexts in which they were consulted, WordSieve indexing outperformed indexing based on Term Frequency/Inverse Document Frequency, a common document indexing approach for intelligent agents in information retrieval.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom