z-logo
open-access-imgOpen Access
Sampling Tree Fragments from Forests
Author(s) -
Tagyoung Chung,
Licheng Fang,
Daniel Gildea,
Daniel Štefankovič
Publication year - 2013
Publication title -
computational linguistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.314
H-Index - 98
eISSN - 1530-9312
pISSN - 0891-2017
DOI - 10.1162/coli_a_00170
Subject(s) - computer science , tree (set theory) , rule based machine translation , sampling (signal processing) , fractal tree index , grammar , set (abstract data type) , tree structure , markov chain monte carlo , regular tree grammar , theoretical computer science , artificial intelligence , machine translation , algorithm , interval tree , mathematics , generative grammar , combinatorics , phrase structure rules , programming language , binary tree , mildly context sensitive grammar formalism , bayesian probability , linguistics , philosophy , filter (signal processing) , computer vision
We study the problem of sampling trees from forests, in the setting where probabilities for each tree may be a function of arbitrarily large tree fragments. This setting extends recent work for sampling to learn Tree Substitution Grammars to the case where the tree structure TSG derived tree is not fixed. We develop a Markov chain Monte Carlo algorithm which corrects for the bias introduced by unbalanced forests, and we present experiments using the algorithm to learn Synchronous Context-Free Grammar rules for machine translation. In this application, the forests being sampled represent the set of Hiero-style rules that are consistent with fixed input word-level alignments. We demonstrate equivalent machine translation performance to standard techniques but with much smaller grammars.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom