Outcome measures based on classification performance fail to predict the intelligibility of binary-masked speech | Zendy

Abigail A. Kressner | Zendy; Tobias May | Zendy; Christopher J. Rozell | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Outcome measures based on classification performance fail to predict the intelligibility of binary-masked speech

Author(s) -

Abigail A. Kressner,

Tobias May,

Christopher J. Rozell

Publication year - 2016

Publication title -

the journal of the acoustical society of america

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.619

H-Index - 187

eISSN - 1520-8524

pISSN - 0001-4966

DOI - 10.1121/1.4952439

Subject(s) - intelligibility (philosophy) , binary number , computer science , word error rate , speech recognition , false alarm , constant false alarm rate , algorithm , artificial intelligence , mathematics , arithmetic , philosophy , epistemology

To date, the most commonly used outcome measure for assessing ideal binary mask estimation algorithms is based on the difference between the hit rate and the false alarm rate (H-FA). Recently, the error distribution has been shown to substantially affect intelligibility. However, H-FA treats each mask unit independently and does not take into account how errors are distributed. Alternatively, algorithms can be evaluated with the short-time objective intelligibility (STOI) metric using the reconstructed speech. This study investigates the ability of H-FA and STOI to predict intelligibility for binary-masked speech using masks with different error distributions. The results demonstrate the inability of H-FA to predict the behavioral intelligibility and also illustrate the limitations of STOI. Since every estimation algorithm will make errors that are distributed in different ways, performance evaluations should not be made solely on the basis of these metrics.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research