Text Selection Issue For Parallel Corpus
Author(s) -
Rustam Abdurasulovich Karimov
Publication year - 2020
Publication title -
the american journal of social science and education innovations
Language(s) - English
Resource type - Journals
ISSN - 2689-100X
DOI - 10.37547/tajssei/volume02issue09-48
Subject(s) - representativeness heuristic , computer science , selection (genetic algorithm) , corpus linguistics , natural language processing , compiler , parallel corpora , text corpus , artificial intelligence , linguistics , information retrieval , machine translation , programming language , psychology , social psychology , philosophy
It is known that the basis of any corpus is its units. Typically, texts of different genres are selected as the corpus unit to ensure the representativeness of the corpus. Therefore, when creating any language corpus, first of all, the principles of selection of texts that are part of it should be defined. Parallel corpus units consist of texts that have been translated one or more times from the original. Which topic and genre text to choose for the parallel corpus is determined by the purpose of the compiler?
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom