Software tool for recognition of Ukrainian-language scientific articles | Zendy

Oksana Andriivna Tatarinova | Zendy; Vladislav Valerievich Ovsyanikov | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Software tool for recognition of Ukrainian-language scientific articles

Author(s) -

Oksana Andriivna Tatarinova,

Vladislav Valerievich Ovsyanikov

Publication year - 2021

Publication title -

vestnik nacionalʹnogo tehničeskogo universiteta "hpi". dinamika i pročnostʹ mašin

Language(s) - English

Resource type - Journals

ISSN - 2078-9130

DOI - 10.20998/2078-9130.2021.2.249715

Subject(s) - computer science , software , heuristics , artificial intelligence , optical character recognition , convolutional neural network , character (mathematics) , artificial neural network , pattern recognition (psychology) , noise (video) , image (mathematics) , natural language processing , programming language , mathematics , geometry , operating system

The problem of computer recognition, both separately printed characters and whole texts, which may contain mathematical formulas, and further saving the resulting document in the "Latex" format, is considered. The developed software implements the ability to recognize printable Latin, Cyrillic, Greek letters and special mathematical symbols. For this, a multilayer convolutional neural network built using the Keras machine learning library and additional validation heuristics are used. To improve the quality of neural network recognition, a sophisticated image processing mechanism has been developed that helps to remove noise from the image, eliminate errors associated with the inclination of characters, and correct character defects associated with the quality of the input image. Also implemented are mechanisms for collecting individual characters into words or mathematical formulas, reproducing the position of signs of indices and degrees, forming ordinary fractions and expressions under the root sign. The results of the recognized text are saved in a file with the simultaneous construction of the "latex" document structure. To demonstrate the capabilities of the developed software, a graphical user interface has been added, with which you can select and inspect the input image even before the start of recognition. During testing of the software, the recognition of images of different types was carried out: completely textual, mathematical formulas without text, mathematical formulas that are between blocks of text.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore