
An Efficient Time‐Frequency Representation for Parametric‐Based Audio Object Coding
Author(s) -
Beack Seungkwon,
Lee Taejin,
Kim Minje,
Kang Kyeongok
Publication year - 2011
Publication title -
etri journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.295
H-Index - 46
eISSN - 2233-7326
pISSN - 1225-6463
DOI - 10.4218/etrij.11.0211.0007
Subject(s) - sound quality , coding (social sciences) , computer science , interactivity , parametric statistics , object based , digital audio , sub band coding , mpeg 4 , speech coding , speech recognition , audio signal , representation (politics) , object (grammar) , multimedia , artificial intelligence , mathematics , statistics , politics , political science , law
Object‐based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband‐based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time‐frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as ‘karaoke’ and ‘solo’ play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.