A Group Testing Framework for Similarity Search in High-dimensional Spaces
Author(s) -
Miaojing Shi,
Teddy Furon,
Hervé Jeǵou
Publication year - 2014
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2647868.2654895
Subject(s) - computer science , search engine indexing , curse of dimensionality , exploit , nearest neighbor search , set (abstract data type) , similarity (geometry) , information retrieval , group (periodic table) , encode , fraction (chemistry) , vector space , group testing , data mining , theoretical computer science , artificial intelligence , mathematics , image (mathematics) , biochemistry , organic chemistry , gene , programming language , combinatorics , chemistry , geometry , computer security
International audienceThis paper introduces a group testing framework for detecting large similarities between high-dimensional vectors, such as descriptors used in state-of-the-art description of multimedia documents. At the crossroad of multimedia information retrieval and signal processing, we produce a set of group representations that jointly encode several vectors into a single one, in the spirit of group testing approaches. By comparing a query vector to several of these intermediate representations, we screen the large values taken by the similarities between the query and all the vectors, at a fraction of the cost of exhaustive similarity calculation. Unlike concurrent indexing methods that suffer from the curse of dimensionality, our method exploits the properties of high-dimensional spaces. It therefore complements other strategies for approximate nearest neighbor search. Our preliminary experiments demonstrate the potential of group testing for searching large databases of multimedia objects represented by vectors. We obtain a large improvement in terms of the theoretical complexity, at the cost of a small or negligible decrease of accuracy. We hope that this preliminary work will pave the way to subsequent works for multimedia retrieval with limited resources
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom