Many Languages, One Parser | Zendy

Waleed Ammar | Zendy; George Mulcaire | Zendy; Miguel Ballesteros | Zendy; Chris Dyer | Zendy; Noah A. Smith | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Many Languages, One Parser

Author(s) -

Waleed Ammar,

George Mulcaire,

Miguel Ballesteros,

Chris Dyer,

Noah A. Smith

Publication year - 2016

Publication title -

transactions of the association for computational linguistics

Language(s) - English

Resource type - Journals

ISSN - 2307-387X

DOI - 10.1162/tacl_a_00109

Subject(s) - treebank , computer science , natural language processing , parsing , artificial intelligence , bottom up parsing , top down parsing

We train one multilingual model for dependency parsing and use it to parse sentences in several languages. The parsing model uses (i) multilingual word clusters and embeddings; (ii) token-level language information; and (iii) language-specific features (fine-grained POS tags). This input representation enables the parser not only to parse effectively in multiple languages, but also to generalize across languages based on linguistic universals and typological similarities, making it more effective to learn from limited annotations. Our parseru0027s performance compares favorably to strong baselines in a range of data scenarios, including when the target language has a large treebank, a small treebank, or no treebank for training.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research