Empirical Methods for Exploiting Parallel Texts I. Dan Melamed (New York University) Cambridge, MA: MIT Press, 2001, xi+195 pp; hardbound, ISBN 0-262-13380-6, $32.95, £22.95
Author(s) -
Ted Pedersen
Publication year - 2002
Publication title -
computational linguistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.314
H-Index - 98
eISSN - 1530-9312
pISSN - 0891-2017
DOI - 10.1162/coli.2002.28.2.235
Subject(s) - computer science , philosophy
Parallel translations of written texts have long been useful tools for human students of language, and have begun to serve as an intriguing source of data for corpus-based approaches to natural language processing. A source text and its translation can be viewed as a coarse map between the two languages, and an industrious student or clever computer program may wish to refine that mapping so that it shows which sentences, phrases, and words are translations of one another. Humans are very adept at finding such relations in parallel text. This is true even when one or both of the languages is unfamiliar, as can be seen in a simple but convincing exercise in (Knight, 1997). While there was considerable early success in automatically identifying sentences in parallel text that are translations of each other (e.g., (Brown, Lai, and Mercer, 1991), (Gale and Church, 1993)), a variety of challenging problems has emerged since that time. Empirical Methods for Exploiting Parallel Texts is a revision of the author’s 1998 Ph.D. dissertation (University of Pennsylvania), and succeeds in capturing the range of problems inherent in parallel text. It presents a variety of techniques for finding translation equivalents and demonstrates that once these are available they can be used to align text segments, detect omissions in translations, identify non-compositional compounds, and discriminate among word senses.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom