
Method of the optical recognition of technical documentation and the transformation of graphic information into machine-readable form for cognitive analysis
Author(s) -
A A Dzyubanenko,
А. В. Рабин
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/2094/3/032056
Subject(s) - documentation , computer science , transformation (genetics) , optical character recognition , information retrieval , artificial intelligence , technical documentation , segmentation , pattern recognition (psychology) , computer vision , image (mathematics) , programming language , biochemistry , chemistry , gene
The paper proposes the implementation of the method of optical recognition of technical documentation and the transformation of graphic information into a machine-readable form available for cognitive analysis, which is based on the methods of binarization and alignment of images, text segmentation and recognition. The use of the proposed method will provide a dramatic reduction in the costs of cataloging, checking the completeness and inventory of documentation, as well as an increase in design quality due to the semantic analysis of documentation using a knowledge base that is updated automatically. The article presents the development of the algorithm for optical recognition of a document, preparation of an image for optical recognition of a document, an example of the application of the Sauvola method for binarization of an image, and an analysis of the research results. The proposed implementation allows the text recognition on scanned/photographed documents.