
What’s the Matter? Knowledge Acquisition by Unsupervised Multi-Topic Labeling for Spoken Utterances
Author(s) -
Sebastian Weigelt
Publication year - 2020
Publication title -
international journal of humanized computing and communication
Language(s) - English
Resource type - Journals
ISSN - 2641-953X
DOI - 10.35708/hcc1868-126364
Subject(s) - computer science , utterance , natural language processing , artificial intelligence , spoken language , classifier (uml) , task (project management) , word (group theory) , linguistics , philosophy , management , economics
Systems such as Alexa, Cortana, and Siri appear rather smart.However, they only react to predefined wordings and do not actuallygrasp the user’s intent. To overcome this limitation, a system must understand the topics the user is talking about. Therefore, we apply unsupervised multi-topic labeling to spoken utterances. Although topic labeling is a well-studied task on textual documents, its potential for spokeninput is almost unexplored. Our approach for topic labeling is tailoredto spoken utterances; it copes with short and ungrammatical input.The approach is two-tiered. First, we disambiguate word senses. We utilize Wikipedia as pre-labeled corpus to train a naïve-bayes classifier.Second, we build topic graphs based on DBpedia relations. We use twostrategies to determine central terms in the graphs, i.e. the shared topics. One focuses on the dominant senses in the utterance and the othercovers as many distinct senses as possible. Our approach creates multipledistinct topics per utterance and ranks results.The evaluation shows that the approach is feasible; the word sense disambiguation achieves a recall of 0.799. Concerning topic labeling, in a userstudy subjects assessed that in 90.9% of the cases at least one proposedtopic label among the first four is a good fit. With regard to precision,the subjects judged that 77.2% of the top ranked labels are a good fit orgood but somewhat too broad (Fleiss’ kappa κ = 0.27).We illustrate areas of application of topic labeling in the field of programming in spoken language. With topic labeling applied to the spokeninput as well as ontologies that model the situational context we are ableto select the most appropriate ontologies with an F1-score of 0.907.