Performance Evaluation of Selected Cost Functions in Non Negative Matrix Factorization Based Decomposition of Acoustic Mixture
Author(s) -
Adeoluwawale Adewusi,
Kamoli Akinwale Amusa,
Aliu S. Are
Publication year - 2019
Publication title -
fuoye journal of engineering and technology
Language(s) - English
Resource type - Journals
eISSN - 2579-0625
pISSN - 2579-0617
DOI - 10.46792/fuoyejet.v4i1.320
Subject(s) - non negative matrix factorization , monaural , matrix decomposition , divergence (linguistics) , source separation , computer science , euclidean distance , matrix (chemical analysis) , blind signal separation , speech recognition , mathematics , algorithm , pattern recognition (psychology) , artificial intelligence , channel (broadcasting) , computer network , linguistics , eigenvalues and eigenvectors , physics , philosophy , materials science , composite material , quantum mechanics
Interaction of acoustic signals when several audio sources are active simultaneously results in the disturbance of estimation of an individual source by co-occurring sounds. Data decomposition therefore constitutes one of the core tasks in monaural source separation. Particularly, in semi-supervised learning approach, viable means of achieving this is through the application of Non-negative Matrix Factorization (NMF). Owing to a paucity of information on the application of this method, especially in a speech system, evaluation of some cost functions in NMF-based monaural speech decomposition was investigated in this study. A generalized gradient descent algorithm is derived for the minimization while three cost functions: Euclidean Distance, Kullback-Leibler Divergence and Itakura-Saito divergences are applied to the derived separation NMF algorithm. These divergences are evaluated using experimental data while the performance of each of these is evaluated based on the cost values and convergence rate. Itakura-Saito divergence yields optimal performance over the other two divergences for given number of iterations and number of channels. Keywords— Cost functions, non-negative matrix factorization, speech separation, evaluation
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom