z-logo
open-access-imgOpen Access
Learning Structural Kernels for Natural Language Processing
Author(s) -
Daniel Beck,
Trevor Cohn,
Christian Hardmeier,
Lucia Specia
Publication year - 2015
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00151
Subject(s) - computer science , hyperparameter optimization , hyperparameter , machine learning , kernel (algebra) , artificial intelligence , grid , model selection , selection (genetic algorithm) , context (archaeology) , tree (set theory) , support vector machine , paleontology , mathematical analysis , geometry , mathematics , combinatorics , biology
Structural kernels are a flexible learning paradigm that has been widely used in Natural Language Processing. However, the problem of model selection in kernel-based methods is usually overlooked. Previous approaches mostly rely on setting default values for kernel hyperparameters or using grid search, which is slow and coarse-grained. In contrast, Bayesian methods allow efficient model selection by maximizing the evidence on the training data through gradient-based methods. In this paper we show how to perform this in the context of structural kernels by using Gaussian Processes. Experimental results on tree kernels show that this procedure results in better prediction performance compared to hyperparameter optimization via grid search. The framework proposed in this paper can be adapted to other structures besides trees, e.g., strings and graphs, thereby extending the utility of kernel-based methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom