Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning | Zendy

Shay B. Cohen | Zendy; Noah A. Smith | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning

Author(s) -

Shay B. Cohen,

Noah A. Smith

Publication year - 2011

Publication title -

computational linguistics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.314

H-Index - 98

eISSN - 1530-9312

pISSN - 0891-2017

DOI - 10.1162/coli_a_00092

Subject(s) - sample complexity , probabilistic logic , computer science , rule based machine translation , empirical risk minimization , machine learning , artificial intelligence , maximization , structural risk minimization , generative grammar , computational complexity theory , minification , mathematical optimization , mathematics , algorithm , artificial neural network , programming language

Probabilistic grammars are generative statistical models that are useful for compositional and sequential structures. They are used ubiquitously in computational linguistics. We present a framework, reminiscent of structural risk minimization, for empirical risk minimization of probabilistic grammars using the log-loss. We derive sample complexity bounds in this framework that apply both to the supervised setting and the unsupervised setting. By making assumptions about the underlying distribution that are appropriate for natural language scenarios, we are able to derive distribution-dependent sample complexity bounds for probabilistic grammars. We also give simple algorithms for carrying out empirical risk minimization using this framework in both the supervised and unsupervised settings. In the unsupervised case, we show that the problem of minimizing empirical risk is NP-hard. We therefore suggest an approximate algorithm, similar to expectation-maximization, to minimize the empirical risk.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research