z-logo
open-access-imgOpen Access
Where Google fails
Author(s) -
Maria Stambolieva
Publication year - 2015
Publication title -
english studies at nbu
Language(s) - English
Resource type - Journals
eISSN - 2367-8704
pISSN - 2367-5705
DOI - 10.33919/esnbu.15.2.8
Subject(s) - computer science , string (physics) , natural language processing , machine translation , identification (biology) , artificial intelligence , focus (optics) , categorization , set (abstract data type) , transformation (genetics) , linguistics , programming language , biochemistry , chemistry , botany , physics , quantum mechanics , gene , optics , biology , philosophy
The paper presents ongoing research in contrastive corpus linguistics with envisaged applications in machine translation (MT) and with focus on Google Translate (GT) performance in English-Bulgarian translation. Structural patterns, forms or expressions where automatic translation fails are identified and analysed in view of creating a GT-editing tool providing improved target language output. The paper presents the corpus and the corpus analysis method applied, including the identification of inacceptable string types, their structural analysis and categorization. For each failure type, pre- or post-GT editing transformations are proposed. A first outline is proposed of a GT-editing tool consisting of a pre-GT editor performing string identification, substitution or deletion operations, a post-GT editor with a set of more complex string transformation rules and an additional module transferring structural information.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here