z-logo
open-access-imgOpen Access
Solving practical tasks of computer linguistics using the created text processing framework
Author(s) -
Екатерина Валерьевна Полицына,
Sergey A. Politsyn,
Alexander S. Porechny
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1902/1/012129
Subject(s) - computer science , documentation , text processing , java , software engineering , software , quality (philosophy) , data science , natural language processing , programming language , philosophy , epistemology
The use of linguistic analysis based on the accumulated experience in computer linguistics allows simplifying processing of huge amounts of text information and opens up new opportunities for documents processing automating. The problem of finding suitable tools, adapting them to work with texts in the Russian language, and integrating with each other makes difficult to use them both for research and in industrial systems. We present an open source Java framework (TAWT) that provides convenient tools and data structures for the main stages of text analysis which meets modern requirements for performance, reliability, project assembly tools, etc. Examples of automating some technical documentation preparation tasks demonstrate the use of the framework, TAWT can be useful for developers of research tools or applied software for implementing new functions or improving the quality of text processing, as well as for developers of automated tools to reduce routine tasks working with documentation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here