User-Adaptive A Posteriori Restoration for Incorrectly Segmented Utterances in Spoken Dialogue Systems
Author(s) -
Kazunori Komatani,
N. Hotta,
Satoshi Sato,
Mikio Nakano
Publication year - 2017
Publication title -
dialogue and discourse
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.25
H-Index - 5
ISSN - 2152-9620
DOI - 10.5087/dad.2017.209
Subject(s) - computer science , thresholding , a priori and a posteriori , focus (optics) , maximum a posteriori estimation , speech recognition , adaptation (eye) , artificial intelligence , natural language processing , machine learning , maximum likelihood , mathematics , statistics , psychology , philosophy , physics , epistemology , neuroscience , optics , image (mathematics)
Ideally, the users of spoken dialogue systems should be able to speak at their own tempo. Thus, the systems needs to interpret utterances from various users correctly, even when the utterances contain pauses. In response to this issue, we propose an approach based on a posteriori restoration for incorrectly segmented utterances. A crucial part of this approach is to determine whether restoration is required. We use a classication-based approach, adapted to each user. We focus on each user’s dialogue tempo, which can be obtained during the dialogue, and determine the correlation between each user’s tempo and the appropriate thresholds for classication. A linear regression function used to convert the tempos into thresholds is also derived. Experimental results show that the proposed user adaptation approach applied to two restoration classication methods, thresholding and decision trees, improves classication accuracies by 3.0% and 7.4%, respectively, in cross validation.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom