Better tags give better trees – or do they? | Zendy

Ines Rehbein | Zendy; Hagen Hirschmann | Zendy; Anke Lüdeling | Zendy; Marc Reznicek | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Better tags give better trees – or do they?

Author(s) -

Ines Rehbein,

Hagen Hirschmann,

Anke Lüdeling,

Marc Reznicek

Publication year - 2012

Publication title -

linguistic issues in language technology

Language(s) - English

Resource type - Journals

eISSN - 1945-3590

pISSN - 1945-3604

DOI - 10.33011/lilt.v7i.1279

Subject(s) - parsing , computer science , natural language processing , annotation , artificial intelligence , bottom up parsing , top down parsing , task (project management) , management , economics

Parsing learner data poses a great challenge for standard tools, since non-canonical and unusual structures may lead to wrong interpretations on the part of the taggers and parsers. It is well known that providing a statistical parser with perfect part-of-speech (POS) tags is of great benefit for parsing accuracy, and that parsing results can decrease considerably when the parser has to predict its own POS tags. Therefore one might expect that even small improvements in POS accuracy have a positive effect on parsing performance. In this paper we test this assumption and assess the impact of POS tag accuracy on constituency parsing for German learner language. We compare different strategies to manual correction of the learner text and specific POS tags, and we measure the time requirements for each strategy. We show that tagging a canonical equivalent of the non-canonical learner text substantially improves POS tag accuracy. Correcting selected POS tags can only lead to parsing results comparable to a setting where all POS tags are corrected, while reducing annotation time substantially. However, the manual corrections of the POS tags do not result in a statistically significant improvement for parsing, giving evidence for the high quality of the automatically predicted parts-of-speech for the corrected learner data.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research