Open Access
Navigating Multilingual News Collections Using Automatically Extracted Information
Author(s) -
Ralf Steinberger,
Bruno Pouliquen,
Camelia Ignat
Publication year - 2005
Publication title -
cit. journal of computing and information technology/journal of computing and information technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.169
H-Index - 27
eISSN - 1846-3908
pISSN - 1330-1136
DOI - 10.2498/cit.2005.04.01
Subject(s) - computer science , hyperlink , relevance (law) , set (abstract data type) , information retrieval , world wide web , web page , programming language , political science , law
We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that is of relevance to them. For a given document collection, the tool set automatically clusters the texts into groups of similar articles, extracts names of places, people and organisations, lists the user-defined specialist terms found, links clusters and entities, and generates hyperlinks. Through its daily news analysis operating on thousands of articles per day, the tool also learns relationships between people and other entities. The fully functional prototype system allows users to explore and navigate multilingual document collections across languages and time