Dynamic Language Models for Streaming Text
Author(s) -
Dani Yogatama,
Chong Wang,
Bryan Routledge,
Noah A. Smith,
Eric P. Xing
Publication year - 2014
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00175
Subject(s) - computer science , language model , dynamics (music) , context (archaeology) , probabilistic logic , social media , scalability , streaming data , task (project management) , artificial intelligence , natural language processing , world wide web , data mining , paleontology , physics , management , database , acoustics , economics , biology
We present a probabilistic language model that captures temporal dynamics and conditions on arbitrary non-linguistic context features. These context features serve as important indicators of language changes that are otherwise difficult to capture using text data by itself. We learn our model in an efficient online fashion that is scalable for large, streaming data. With five streaming datasets from two different genres—economics news articles and social media—we evaluate our model on the task of sequential language modeling. Our model consistently outperforms competing models.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom