Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach
Author(s) -
Shafiq Joty,
Tasnim Mohiuddin
Publication year - 2018
Publication title -
computational linguistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.314
H-Index - 98
eISSN - 1530-9312
pISSN - 0891-2017
DOI - 10.1162/coli_a_00339
Subject(s) - computer science , asynchronous communication , recurrent neural network , conditional random field , sentence , artificial intelligence , speech recognition , conversation , natural language processing , task (project management) , language model , adversarial system , artificial neural network , communication , computer network , management , sociology , economics
Participants in an asynchronous conversation e.g., forum, e-mail interact with each other at different times, performing certain communicative acts, called speech acts e.g., question, request. In this article, we propose a hybrid approach to speech act recognition in asynchronous conversations. Our approach works in two main steps: a long short-term memory recurrent neural network LSTM-RNN first encodes each sentence separately into a task-specific distributed representation, and this is then used in a conditional random field CRF model to capture the conversational dependencies between sentences. The LSTM-RNN model uses pretrained word embeddings learned from a large conversational corpus and is trained to classify sentences into speech act types. The CRF model can consider arbitrary graph structures to model conversational dependencies in an asynchronous conversation. In addition, to mitigate the problem of limited annotated data in the asynchronous domains, we adapt the LSTM-RNN model to learn from synchronous conversations e.g., meetings, using domain adversarial training of neural networks. Empirical evaluation shows the effectiveness of our approach over existing ones: i LSTM-RNNs provide better task-specific representations, ii conversational word embeddings benefit the LSTM-RNNs more than the off-the-shelf ones, iii adversarial training gives better domain-invariant representations, and iv the global CRF model improves over local models.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom