z-logo
Premium
Generating descriptive multi‐document summaries of geo‐located entities using entity type models
Author(s) -
Aker Ahmet,
Gaizauskas Robert
Publication year - 2015
Publication title -
journal of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.903
H-Index - 145
eISSN - 2330-1643
pISSN - 2330-1635
DOI - 10.1002/asi.23211
Subject(s) - computer science , information retrieval , document type definition , type (biology) , natural language processing , world wide web , document structure description , xml , biology , ecology
In this article, we investigate the application of entity type models in extractive multi‐document summarization using automatic caption generation for images of geo‐located entities (e.g., W estminster   A bbey ) as an application scenario. Entity type models contain sets of patterns aiming to capture the ways geo‐located entities are described in natural language. They are automatically derived from texts about geo‐located entities of the same type (e.g., churches, lakes). We integrate entity type models into a multi‐document summarizer and use them to address the 2 major tasks in extractive multi‐document summarization: sentence scoring and summary composition . We experiment with 3 different representation methods for entity type models: signature words , n‐gram language models, and dependency patterns . We evaluate the summarizer with integrated entity type models relative to (a) a summarizer using standard text‐related features commonly used in text summarization and (b) the W ikipedia location descriptions. Our results show that entity type models significantly improve the quality of output summaries over that of summaries generated using standard summarization features and W ikipedia summaries. The representation of entity type models using dependency patterns is superior to the representations using signature words and n‐gram language models.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here