z-logo
open-access-imgOpen Access
Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors
Author(s) -
Hélène Papadopoulos,
Matthieu Kowalski
Publication year - 2013
Publication title -
the journal of the acoustical society of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.4807821
Subject(s) - computer science , prior probability , interpretability , audio signal , speech recognition , chord (peer to peer) , sparse approximation , polyphony , sound quality , signal (programming language) , representation (politics) , orthonormal basis , audio analyzer , pattern recognition (psychology) , artificial intelligence , audio signal processing , acoustics , speech coding , bayesian probability , distributed computing , physics , political science , law , programming language , quantum mechanics , politics
This paper investigates the use of musical priors for sparse expansion of audio signals of music, on an overcomplete dual-resolution dictionary taken from the union of two orthonormal bases that can describe both transient and tonal components of a music audio signal. More specifically, chord and metrical structure information are used to build a structured model that takes into account dependencies between coefficients of the decomposition, both for the tonal and for the transient layer. The denoising task application is used to provide a proof of concept of the proposed musical priors. Several configurations of the model are analyzed. Evaluation on monophonic and complex polyphonic excerpts of real music signals shows that the proposed approach provides results whose quality measured by the signal-to-noise ratio is competitive with state-of-the-art approaches, and more coherent with the semantic content of the signal. A detailed analysis of the model in terms of sparsity and in terms of interpretability of the representation is also provided and shows that the model is capable of giving a relevant and legible representation of Western tonal music audio signals.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom