CLIR Experiments at Maryland for TREC-2002: Evidence Combination for Arabic-English Retrieval
Author(s) -
Kareem Darwish,
Douglas W. Oard
Publication year - 2003
Publication title -
digital repository at the university of maryland (university of maryland college park)
Language(s) - English
Resource type - Reports
DOI - 10.21236/ada452814
Subject(s) - arabic , natural language processing , computer science , information retrieval , artificial intelligence , linguistics , philosophy
The focus of the experiments reported in this paper was techniques for combining evidence for crosslanguage retrieval, searching Arabic documents using English queries. Evidence from multiple sources of translation knowledge was combined to estimate translation probabilities, and four techniques for estimating query-language term weights from document-language evidence were tried. A new technique that exploits translation probability information was found to outperform a comparable technique in which that information was not used. Comparative results for three variants of Arabic “light” stemming are also presented. A simple variant of an existing stemming algorithm was found to result in significantly better retrieval effectiveness.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom