Premium
Automatic system for identifying and categorizing temporal relations in natural language
Author(s) -
Llorens Hector,
Saquete Estela,
NavarroColorado Borja
Publication year - 2012
Publication title -
international journal of intelligent systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.291
H-Index - 87
eISSN - 1098-111X
pISSN - 0884-8173
DOI - 10.1002/int.21542
Subject(s) - categorization , computer science , artificial intelligence , relation (database) , identification (biology) , natural language processing , relationship extraction , field (mathematics) , information extraction , natural language , interpretation (philosophy) , machine learning , data mining , botany , mathematics , pure mathematics , biology , programming language
Abstract Nowadays, the automatic processing of digitalized documents is crucial to cope with the increasing amount of information available. This issue is addressed from the natural language processing (NLP) research field. One of the tasks required for many NLP applications is temporal information processing. It involves the automatic extraction and interpretation of temporal expressions, events, and their relations. Specifically, the identification and the categorization of temporal relations are the most complex subtasks yet to solve, judging from the results reported in the latest international evaluation exercise. Temporal relation identification has been addressed by very few approaches, and the current categorization approaches are still not a definitive solution. This paper presents a system that approaches temporal relation identification and categorization. The former is approached with a knowledge‐driven strategy and the later with data‐driven strategy based on different machine‐learning techniques. Our proposal has been empirically evaluated over the currently available English data sets annotated with temporal information (TimeBank and AQUAINT) in a 10‐fold cross‐validated experiment. The results obtained support that the presented approach achieves a high performance. It improves the baseline F1 by 46% and outperforms the state of the art. © 2012 Wiley Periodicals, Inc.