z-logo
open-access-imgOpen Access
Text Selection Issue For Parallel Corpus
Author(s) -
Rustam Abdurasulovich Karimov
Publication year - 2020
Publication title -
the american journal of social science and education innovations
Language(s) - English
Resource type - Journals
ISSN - 2689-100X
DOI - 10.37547/tajssei/volume02issue09-48
Subject(s) - representativeness heuristic , computer science , selection (genetic algorithm) , corpus linguistics , natural language processing , compiler , parallel corpora , text corpus , artificial intelligence , linguistics , information retrieval , machine translation , programming language , psychology , social psychology , philosophy
It is known that the basis of any corpus is its units. Typically, texts of different genres are selected as the corpus unit to ensure the representativeness of the corpus. Therefore, when creating any language corpus, first of all, the principles of selection of texts that are part of it should be defined. Parallel corpus units consist of texts that have been translated one or more times from the original. Which topic and genre text to choose for the parallel corpus is determined by the purpose of the compiler?

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom