z-logo
open-access-imgOpen Access
Machine-Aided Formant Determination for Speech Synthesis
Author(s) -
William J. Strong
Publication year - 1966
Publication title -
the journal of the acoustical society of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.2143463
Subject(s) - formant , sequence (biology) , band pass filter , speech synthesis , computer science , filter (signal processing) , range (aeronautics) , digital filter , trigonometric functions , section (typography) , operator (biology) , speech recognition , acoustics , mathematics , electronic engineering , engineering , physics , vowel , repressor , aerospace engineering , chemistry , biology , genetics , operating system , biochemistry , geometry , transcription factor , computer vision , gene
HIS paper describes a method for speech analysis that is an outgrowth of an attempt to do speech synthesis by rule using a terminal analog synthesizer. In speech synthesis by rule, the rules accept an input string of phonemes and, based on the input string, generate control parameters that can then be used to control a speech synthesizer. The synthetic speech produced by the synthesizer can be subjected to various measures to determine its validity, but the ear must be the final criterion. However, the ear does not indicate in an explicit way wherein the control parameters may be improved. We feel, therefore, that being able to compare control data generated by rules with those extracted from the real speech of some talker can be a useful guide. Our approach is similar to that of Holmes, Mattingly, and Shearme, • who started with a set rules and then modified these rules, guided by spectrographic analysis and listening. However, a basic difference between our approach and that of Holmes et al. is that, in their case, one knows the general bounds of the control parameters and tries to write and modify rules that will generate satisfactory control parameters within these bounds; whereas in our case, one uses analysis to derive detailed control parameters and tries to write rules that will generate these parameters. (The latter scheme has the characteristic of being closely related to the particular speaker whose speech is analyzed.) To implement our scheme, it was necessary to perform some extensive

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom