z-logo
open-access-imgOpen Access
TreeTalk: Composition and Compression of Trees for Image Descriptions
Author(s) -
Полина Кузнецова,
Vicente Ordóñez,
Tamara L. Berg,
Yejin Choi
Publication year - 2014
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00188
Subject(s) - computer science , generalization , tree (set theory) , image (mathematics) , composition (language) , key (lock) , sequence (biology) , artificial intelligence , tree structure , theoretical computer science , data structure , programming language , combinatorics , mathematics , linguistics , mathematical analysis , philosophy , computer security , biology , genetics
We present a new tree based approach to composing expressive image descriptions that makes use of naturally occuring web images with captions. We investigate two related tasks: image caption generalization and generation, where the former is an optional subtask of the latter. The high-level idea of our approach is to harvest expressive phrases (as tree fragments) from existing image descriptions, then to compose a new description by selectively combining the extracted (and optionally pruned) tree fragments. Key algorithmic components are tree composition and compression, both integrating tree structure with sequence structure. Our proposed system attains significantly better performance than previous approaches for both image caption generalization and generation. In addition, our work is the first to show the empirical benefit of automatically generalized captions for composing natural image descriptions.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom