New Term Weighting Formulas for the Vector Space Method in Information Retrieval
Author(s) -
E. Chisholm,
Tamara G. Kolda
Publication year - 1999
Publication title -
osti oai (u.s. department of energy office of scientific and technical information)
Language(s) - English
Resource type - Reports
DOI - 10.2172/5698
Subject(s) - weighting , term (time) , vector space model , computer science , space (punctuation) , information retrieval , vector space , data mining , scheme (mathematics) , term discrimination , function (biology) , mathematics , search engine , web search query , concept search , physics , quantum mechanics , operating system , medicine , mathematical analysis , geometry , evolutionary biology , biology , radiology
The goal in information retrieval is to enable users to automatically and accurately find data relevant to their queries. One possible approach to this problem i use the vector space model, which models documents and queries as vectors in the term space. The components of the vectors are determined by the term weighting scheme, a function of the frequencies of the terms in the document or query as well as throughout the collection. We discuss popular term weighting schemes and present several new schemes that offer improved performance
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom