Optimal Tag Sets for Automatic Image Annotation
Author(s) -
Seán Moran,
Victor Lavrenko
Publication year - 2011
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5244/c.25.1
Subject(s) - computer science , kernel (algebra) , precision and recall , set (abstract data type) , artificial intelligence , pattern recognition (psychology) , image retrieval , feature (linguistics) , image (mathematics) , relevance (law) , representation (politics) , tree (set theory) , automatic image annotation , data mining , mathematics , mathematical analysis , linguistics , philosophy , combinatorics , politics , political science , law , programming language
In this paper we introduce a new form of the Continuous Relevance Model (the BSCRM) that captures the correlation between tags in a formal and consistent manner. We apply a beam search algorithm to find a near optimal set of mutually correlated tags for an image in a time that is linear in the depth of the search tree. We conduct an examination of the model performance under different kernels for the representation of the image feature distributions and suggest a method of adapting the kernel to the dataset. BS-CRM with a Minkowski kernel is found to significantly increase recall by 42% and precision by 38% over the original CRM model and outperforms more recent baselines on the standard Corel 5k dataset.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom