z-logo
open-access-imgOpen Access
TDT-2002 Topic Tracking at Maryland: First Experiments with the Lemur Toolkit
Author(s) -
Daqing He,
Hyuk R. Park,
Gabriel Murray,
Michael Subotin,
Douglas W. Oard
Publication year - 2003
Publication title -
digital repository at the university of maryland (university of maryland college park)
Language(s) - English
Resource type - Reports
DOI - 10.21236/ada459303
Subject(s) - perl , lemur , computer science , lexicon , tracking (education) , arabic , orthography , natural language processing , term (time) , artificial intelligence , programming language , world wide web , linguistics , biology , reading (process) , psychology , pedagogy , primate , philosophy , physics , quantum mechanics , neuroscience
: The University of Maryland submitted six topic tracking runs for the 2002 Topic Detection and Tracking evaluation. Two runs were produced using the Lemur language modeling toolkit, the remaining four were produced using a separate system coded in Perl. The Lemur runs outperformed the Perl runs on the required condition because term frequency information was better handled. Two of the Perl runs used native Arabic orthography with two-best translation based on a statistical lexicon, obtaining similar results to those obtained with the Arabic-to-English translations provided with the collection.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom