
Processing queries with metrical constraints in XML‐based IR systems
Author(s) -
Klein Shmuel T.
Publication year - 2007
Publication title -
journal of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1532-2890
pISSN - 1532-2882
DOI - 10.1002/asi.20734
Subject(s) - computer science , xml , information retrieval , document structure description , xml validation , efficient xml interchange , streaming xml , precision and recall , xml database , xml schema editor , xml schema (w3c) , xml encryption , database , world wide web
XML documents combine features from classical IR systems allowing free text, with explicit structures as in databases. Many query languages have been specially designed for IR applications on XML documents. This work concentrates on a special type of language for which the problem of processing queries including metrical constraints is investigated. The main question is how to define the distance between terms in different locations of the XML tree in an intuitively justifiable way, without jeopardizing the ability to get good retrieval results in terms of recall and precision. A new definition is given and its usefulness is shown on several examples from the INEX collection.