Translating–transliterating named entities for multilingual information access
Author(s) -
Chen HsinHsi,
Lin WenCheng,
Yang Changhua,
Lin WeiHao
Publication year - 2006
Publication title -
journal of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1532-2890
pISSN - 1532-2882
DOI - 10.1002/asi.20327
Subject(s) - transliteration , computer science , natural language processing , artificial intelligence , similarity (geometry) , named entity , information retrieval , translation (biology) , biochemistry , chemistry , messenger rna , image (mathematics) , gene
Named entities are major constituents of a document but are usually unknown words. This work proposes a systematic way of dealing with formulation, transformation, translation, and transliteration of multilingual‐named entities. The rules and similarity matrices for translation and transliteration are learned automatically from parallel‐named‐entity corpora. The results are applied in cross‐language access to collections of images with captions. Experimental results demonstrate that the similarity‐based transliteration of named entities is effective, and runs in which transliteration is considered outperform the runs in which it is neglected.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom