A Flexible Scheme for Assigning Timing and Pitch To Synthetic Speech
Author(s) -
Ian H. Witten
Publication year - 1977
Publication title -
language and speech
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.713
H-Index - 52
eISSN - 1756-6053
pISSN - 0023-8309
DOI - 10.1177/002383097702000306
Subject(s) - prosody , computer science , scheme (mathematics) , utterance , speech recognition , simple (philosophy) , string (physics) , speech synthesis , point (geometry) , natural (archaeology) , mathematics , mathematical analysis , philosophy , geometry , archaeology , epistemology , mathematical physics , history
A flexible scheme for assigning timing and pitch to synthetic speech is described. The scheme is designed to provide sufficiently adequate and natural control of the prosody of utterances for people to be able to use the speech which it generates as a simple but effective tool for man-machine interaction. The basic prosodic features of the utterance must be marked in the phonemic input string, while more global features are defined by parameters which can be changed as and when desired. This combines a simple input format with the capability of synthesizing many of the nuances encountered when connected passages are spoken in a rich and natural way. Since the scheme has been implemented on a computer, many decisions about precise details of pitch and timing have had to be made, usually in the absence of proper experimental justification. Each point of detail is discussed in the paper, so that the procedure is described sufficiently fully to enable it to be replicated in other research laboratories.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom