Open Access
Extraction of Text From Image
Author(s) -
Akanksha Mate,
Megha Gurav,
Kajal Babar,
Gauri Raskar,
Prakash Kshirsagar
Publication year - 2021
Publication title -
international journal of advanced research in science, communication and technology
Language(s) - English
Resource type - Journals
ISSN - 2581-9429
DOI - 10.48175/ijarsct-1391
Subject(s) - computer science , component (thermodynamics) , image (mathematics) , variety (cybernetics) , information retrieval , code (set theory) , data extraction , artificial intelligence , programming language , physics , set (abstract data type) , medline , political science , law , thermodynamics
Picture Text is the content data implanted or written in picture of various structure. Picture text can be found in caught pictures, filtered records, magazines, papers, banners and so on These picture messages are profoundly accessible these days and they are vital in addressing, depicting and moving data which help people groups in correspondence, tackling issues, accessibility, formation of new sorts of occupations, cost viability, efficiency, globalization and social hole and so forth The data from these picture archives would give higher proficiency and straightforward entry on the off chance that it is changed over to message structure. The cycle by which Image Text changed over into plain content is Text Extraction. Text Extraction is helpful in data recovering, looking, altering, recording, filing or detailing of picture text. In any case, variety of these writings because of contrasts in size, direction style, and arrangement, text is installed in complex hued archive pictures, corrupted reports picture, inferior quality picture, as well as low picture differentiation and complex foundation make issue text extraction incredibly troublesome what's more, testing one. Various strategies like Connected Component Method, Mathematical Morphology Method, Edged Based Method and Texture Based Method have been utilized beforehand, however those all have their own constraints when estimated by various boundaries like exactness, review and f- score. In this paper, text extraction from picture reports, utilizing blend of the two amazing techniques Connected Component and Edge Based Method, to improve execution and exactness of text extraction is talked about and execution is finished by incorporated MATLAB code with MATLAB/Simulink device and the proposed framework is tried by Digital Image Binarization Competition (DIBCO) 2017 dataset. At long last, the separated and perceived is changed over to discourse for legitimate use for outwardly hindered individuals.