z-logo
open-access-imgOpen Access
Evaluating Low-Level Speech Features Against Human Perceptual Data
Author(s) -
Caitlin Richter,
Naomi H. Feldman,
Harini Salgado,
Aren Jansen
Publication year - 2017
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00071
Subject(s) - normalization (sociology) , computer science , perception , speech recognition , speech perception , speech processing , artificial intelligence , natural language processing , psychology , neuroscience , sociology , anthropology
We introduce a method for measuring the correspondence between low-level speech features and human perception, using a cognitive model of speech perception implemented directly on speech recordings. We evaluate two speaker normalization techniques using this method and find that in both cases, speech features that are normalized across speakers predict human data better than unnormalized speech features, consistent with previous research. Results further reveal differences across normalization methods in how well each predicts human data. This work provides a new framework for evaluating low-level representations of speech on their match to human perception, and lays the groundwork for creating more ecologically valid models of speech perception.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom