z-logo
open-access-imgOpen Access
Structural Models of English Terms of Automated Processing of Scientific and Technical Texts Corpora
Author(s) -
Yu. I. Butenko,
Н. С. Николаева,
Elena Kartseva
Publication year - 2022
Publication title -
vestnik rossijskogo universiteta družby narodov. seriâ: teoriâ âzyka, semiotika, semantika/vestnik rossijskogo universiteta družby narodov. seriâ teoriâ âzyka, semiotika, semantika
Language(s) - English
Resource type - Journals
eISSN - 2411-1236
pISSN - 2313-2299
DOI - 10.22363/2313-2299-2022-13-1-80-95
Subject(s) - terminology , computer science , noun , component (thermodynamics) , natural language processing , word formation , linguistics , relevance (law) , adjective , artificial intelligence , corpus linguistics , scientific literature , subject (documents) , paleontology , philosophy , physics , biology , political science , library science , law , thermodynamics
The article is devoted to the structural models of English multi-component terms from the subject area Welding types as a basis for marking the corpora of scientific and technical texts. The place of corpora of scientific and technical texts in corpus linguistics and prospects of further scientific research based on them are marked. Relevance of the research is conditioned by the necessity to create the corpus of scientific and technical texts, in general, and means of automatic marking of terms, in particular. It has been substantiated that the main problem in creating the corpus of scientific and technical texts is automatic marking of terminological word combinations. The analysis of the current state of the terminology system of the subject area Welding types has been carried out. The formal structure of elements of the Welding types terminology system is considered. The results of the analysis of two, three, four-component English terminological word combinations of the Welding types subject area and their structural models are presented. All structural models of English terminology combinations are illustrated with examples. The most productive models of English terms word combinations are highlighted. It is shown that the most productive model - the combination of a nucleus element with a noun or an adjective in the function of the prepositional definition - can be traced in two-component word combinations, but the analysis of more complex formations shows that the model of left definition attached to the term kernel is also present in them, demonstrating generic features. The necessity of enumerating all possible structural models of terminological combinations in the subject area Welding types has been substantiated. The novelty of the study is seen in the formation of a database of structural models of terminological combinations as the basis of a superstructure database on the structure of terms to improve the quality of automatic marking of the bodies of scientific and technical texts and processing of terms-candidates in the conduct of body studies.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here