TTS-Driven Synthetic Behaviour-Generation Model for Artificial Bodies
Author(s) -
Izidor Mlakar,
Zdravko Kačič,
Matej Rojc
Publication year - 2013
Publication title -
international journal of advanced robotic systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.394
H-Index - 46
eISSN - 1729-8814
pISSN - 1729-8806
DOI - 10.5772/56870
Subject(s) - naturalness , computer science , embodied cognition , gesture , conversation , intonation (linguistics) , perception , natural (archaeology) , movement (music) , robot , humanoid robot , speech synthesis , motion (physics) , meaning (existential) , asynchrony (computer programming) , human–computer interaction , phone , artificial intelligence , communication , psychology , linguistics , asynchronous communication , computer network , philosophy , physics , archaeology , quantum mechanics , neuroscience , psychotherapist , history , aesthetics
Visual perception, speech perception and the understanding of perceived information are linked through complex mental processes. Gestures, as part of visual perception and synchronized with verbal information, are a key concept of human social interaction. Even when there is no physical contact (e.g., a phone conversation), humans still tend to express meaning through movement. Embodied conversational agents (ECAs), as well as humanoid robots, are visual recreations of humans and are thus expected to be able to perform similar behaviour in communication. The behaviour generation system proposed in this paper is able to specify expressive behaviour strongly resembling natural movement performed within social interaction. The system is TTS-driven and fused with the time-and-space efficient TTS-engine, called ‘PLATTOS’. Visual content and content presentation is formulated based on several linguistic features that are extrapolated from arbitrary input text sequences and prosodic features (e.g., pitch, intonation, stress, emphasis, etc.), as predicted by several verbal modules in the system. According to the evaluation results, when using the proposed system the synchronized co-verbal behaviour can be recreated with a very high-degree of naturalness, either by ECAs or humanoid robots alike
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom