z-logo
open-access-imgOpen Access
Automatic Phrase Recognition and Extraction from Text
Author(s) -
Fergus Kelledy,
Alan F. Smeaton
Publication year - 1997
Publication title -
electronic workshops in computing
Language(s) - English
Resource type - Conference proceedings
ISSN - 1477-9358
DOI - 10.14236/ewic/ir1997.3
Subject(s) - computer science , phrase , natural language processing , artificial intelligence , search engine indexing , vocabulary , noun phrase , field (mathematics) , lexicon , automatic indexing , information retrieval , process (computing) , linguistics , philosophy , mathematics , noun , pure mathematics , operating system
One of the problems facing researchers in the field of Information Retrieval (IR) is that the search criteria used during retrieval (the query) contains terms which are very ambiguous and common. By this we mean that terms can have multiple meanings and occur in a large percentage of the documents in a text collection. Many approaches to addressing this problem have been tried with varying degrees of success. One approach to this problem is to attempt to make the vocabulary used by the IR system less ambiguous by using terms which occur only infrequently. In our case this is achieved through an automatic process of phrase recognition and the incorporation of these phrases into the lexicon of the indexing mechanism used. Unlike previous phrase recognition approaches based on NLP, our work requires no linguistic processing of the text in order to extract phrases but is comparable to what is called 'statistical phrases'. In this paper we describe experiments where we evaluate our phrase recognition on the TREC-4 and TREC-5 collections.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom