z-logo
open-access-imgOpen Access
Document dependent fusion in multimodal music retrieval
Author(s) -
Zhonghua Li,
Bingjun Zhang,
Ye Wang
Publication year - 2011
Publication title -
proceedings of the 30th acm international conference on multimedia
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2072298.2071949
Subject(s) - computer science , fusion , information retrieval , process (computing) , modalities , sensor fusion , data mining , artificial intelligence , social science , philosophy , linguistics , sociology , operating system
In this paper, we propose a novel multimodal fusion framework, document dependent fusion (DDF), which derives the optimal combination strategy for each individual document in the fusion process. For each document, we derive a document weight vector by estimating the descriptive abilities of its different modalities. The document weight vector also enables our framework to be easily integrated with existing multimodal fusion schemes, and achieve a better combination strategy for each document given a query. Experiments are conducted on a 17174-song music database to compare the retrieval accuracy of traditional query independent fusion and query dependent fusion approaches, and that obtained after integrating DDF with them. Experimental results indicate that DDF can significantly improve the retrieval performance of current fusion approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom