Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach | Zendy

Shafiq Joty | Zendy; Tasnim Mohiuddin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach

Author(s) -

Shafiq Joty,

Tasnim Mohiuddin

Publication year - 2018

Publication title -

computational linguistics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.314

H-Index - 98

eISSN - 1530-9312

pISSN - 0891-2017

DOI - 10.1162/coli_a_00339

Subject(s) - computer science , asynchronous communication , recurrent neural network , conditional random field , sentence , artificial intelligence , speech recognition , conversation , natural language processing , task (project management) , language model , adversarial system , artificial neural network , communication , computer network , management , sociology , economics

Participants in an asynchronous conversation e.g., forum, e-mail interact with each other at different times, performing certain communicative acts, called speech acts e.g., question, request. In this article, we propose a hybrid approach to speech act recognition in asynchronous conversations. Our approach works in two main steps: a long short-term memory recurrent neural network LSTM-RNN first encodes each sentence separately into a task-specific distributed representation, and this is then used in a conditional random field CRF model to capture the conversational dependencies between sentences. The LSTM-RNN model uses pretrained word embeddings learned from a large conversational corpus and is trained to classify sentences into speech act types. The CRF model can consider arbitrary graph structures to model conversational dependencies in an asynchronous conversation. In addition, to mitigate the problem of limited annotated data in the asynchronous domains, we adapt the LSTM-RNN model to learn from synchronous conversations e.g., meetings, using domain adversarial training of neural networks. Empirical evaluation shows the effectiveness of our approach over existing ones: i LSTM-RNNs provide better task-specific representations, ii conversational word embeddings benefit the LSTM-RNNs more than the off-the-shelf ones, iii adversarial training gives better domain-invariant representations, and iv the global CRF model improves over local models.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research