Author(s) -
Yen-Liang Shue,
Patricia Keating,
Chad Vicenik
Publication year - 2009
Publication title -
the journal of the acoustical society of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.3248865
Subject(s) - formant , matlab , computer science , harmonic , fast fourier transform , speech recognition , amplitude , cepstrum , acoustics , energy (signal processing) , algorithm , mathematics , physics , programming language , statistics , optics , vowel
VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom