A Latent Variable Model Approach to PMI-based Word Embeddings | Zendy

Sanjeev Arora | Zendy; Yuanzhi Li | Zendy; Yingyu Liang | Zendy; Tengyu Ma | Zendy; Andrej Risteski | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Latent Variable Model Approach to PMI-based Word Embeddings

Author(s) -

Sanjeev Arora,

Yuanzhi Li,

Yingyu Liang,

Tengyu Ma,

Andrej Risteski

Publication year - 2016

Publication title -

transactions of the association for computational linguistics

Language(s) - English

Resource type - Journals

ISSN - 2307-387X

DOI - 10.1162/tacl_a_00106

Subject(s) - word (group theory) , computer science , hyperparameter , generative grammar , word2vec , novelty , artificial intelligence , latent variable , generative model , natural language processing , latent variable model , mathematics , philosophy , geometry , theology , embedding

Semantic word embeddings represent the meaning of a word via a vector, and are created by diverse methods. Many use nonlinear operations on co-occurrence statistics, and have hand-tuned hyperparameters and reweighting methods. This paper proposes a new generative model, a dynamic version of the log-linear topic model of Mnih and Hinton (2007). The methodological novelty is to use the prior to compute closed form expressions for word statistics. This provides a theoretical justification for nonlinear models like PMI, word2vec, and GloVe, as well as some hyperparameter choices. It also helps explain why low-dimensional semantic embeddings contain linear algebraic structure that allows solution of word analogies, as shown by Mikolov et al. (2013a) and many subsequent papers. Experimental support is provided for the generative model assumptions, the most important of which is that latent word vectors are fairly uniformly dispersed in space.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research