Tree Structured Dirichlet Processes for Hierarchical Morphological Segmentation
Author(s) -
Burcu Can,
Suresh Manandhar
Publication year - 2018
Publication title -
computational linguistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.314
H-Index - 98
eISSN - 1530-9312
pISSN - 0891-2017
DOI - 10.1162/coli_a_00318
Subject(s) - computer science , hierarchical dirichlet process , artificial intelligence , segmentation , tree (set theory) , hierarchical clustering , cluster analysis , tree structure , probabilistic logic , pattern recognition (psychology) , latent dirichlet allocation , dirichlet distribution , brown clustering , machine learning , topic model , fuzzy clustering , mathematics , binary tree , canopy clustering algorithm , algorithm , mathematical analysis , boundary value problem
This article presents a probabilistic hierarchical clustering model for morphological segmentation. In contrast to existing approaches to morphology learning, our method allows learning hierarchical organization of word morphology as a collection of tree structured paradigms. The model is fully unsupervised and based on the hierarchical Dirichlet process. Tree hierarchies are learned along with the corresponding morphological paradigms simultaneously. Our model is evaluated on Morpho Challenge and shows competitive performance when compared to state-of-the-art unsupervised morphological segmentation systems. Although we apply this model for morphological segmentation, the model itself can also be used for hierarchical clustering of other types of data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom