Unsupervised Grammar Induction with Depth-bounded PCFG
Author(s) -
Lifeng Jin,
Finale DoshiVelez,
Timothy A. Miller,
William Schuler,
Lane Schwartz
Publication year - 2018
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00016
Subject(s) - computer science , grammar induction , grammar , parsing , recursion (computer science) , bounding overwatch , artificial intelligence , probabilistic logic , natural language processing , context (archaeology) , space (punctuation) , treebank , limit (mathematics) , rule based machine translation , algorithm , linguistics , mathematics , paleontology , mathematical analysis , philosophy , biology , operating system
There has been recent interest in applying cognitively or empirically motivated bounds on recursion depth to limit the search space of grammar induction models (Ponvert et al., 2011; Noji and Johnson, 2016; Shain et al., 2016). This work extends this depth-bounding approach to probabilistic context-free grammar induction (DB-PCFG), which has a smaller parameter space than hierarchical sequence models, and therefore more fully exploits the space reductions of depth-bounding. Results for this model on grammar acquisition from transcribed child-directed speech and newswire text exceed or are competitive with those of other models when evaluated on parse accuracy. Moreover, grammars acquired from this model demonstrate a consistent use of category labels, something which has not been demonstrated by other acquisition models.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom