EXPLOITING SUBTREES IN AUTO‐PARSED DATA TO IMPROVE DEPENDENCY PARSING | Zendy

Chen Wenliang | Zendy; Kazama Jun’ichi | Zendy; Uchimoto Kiyotaka | Zendy; Torisawa Kentaro | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

EXPLOITING SUBTREES IN AUTO‐PARSED DATA TO IMPROVE DEPENDENCY PARSING

Author(s) -

Chen Wenliang,

Kazama Jun’ichi,

Uchimoto Kiyotaka,

Torisawa Kentaro

Publication year - 2012

Publication title -

computational intelligence

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.353

H-Index - 52

eISSN - 1467-8640

pISSN - 0824-7935

DOI - 10.1111/j.1467-8640.2012.00451.x

Subject(s) - treebank , computer science , parsing , artificial intelligence , dependency grammar , dependency (uml) , natural language processing , bottom up parsing , top down parsing

Dependency parsing has attracted considerable interest from researchers and developers in natural language processing. However, to obtain a high‐accuracy dependency parser, supervised techniques require a large volume of hand‐annotated data, which are extremely expensive. This paper presents a simple and effective approach for improving dependency parsing with subtrees derived from unannotated data, which are easy to obtain. First, we use a baseline parser to parse large‐scale unannotated data. Then, we extract subtrees from dependency parse trees in the auto‐parsed data. Next, the extracted subtrees are classified into several sets according to their frequency. Finally, we design new features based on the subtree sets for parsing algorithms. To demonstrate the effectiveness of our proposed approach, we conduct experiments on the English Penn Treebank and Chinese Penn Treebank. The results show that our approach significantly outperforms baseline systems. It also achieves the best accuracy for the Chinese data and an accuracy competitive with the best known systems for the English data.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research