
Multiscale fully convolutional network‐based approach for multilingual character segmentation
Author(s) -
Yu Chao,
Liu Jin,
Li Yunhui
Publication year - 2021
Publication title -
iet computer vision
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.38
H-Index - 37
eISSN - 1751-9640
pISSN - 1751-9632
DOI - 10.1049/cvi2.12034
Subject(s) - computer science , segmentation , character (mathematics) , artificial intelligence , benchmark (surveying) , convolutional neural network , pattern recognition (psychology) , encoder , exploit , image segmentation , task (project management) , computer security , geodesy , management , economics , geography , operating system , geometry , mathematics
Character segmentation is a challenging task for optical character recognition systems. Traditional methods usually utilize rule‐based algorithms but most of them are not applicable in modern intelligent recognition applications that require high accuracy. It is especially the case for text containing Eastern Asian language characters with complex pictograph structures, such as Chinese. To alleviate this problem, this study proposes an encoder–decoder structure‐based multiscale fully convolutional network (MSFCN) model for optical character segmentation. Comparing with other methods, MSFCN can not only effectively extract semantic details from images but also exploit boundary information of intervals between characters, thereby distinguishing characters from a background in pixel level. Extensive experiments have been conducted on two benchmark data sets of ICDAR2013 and MLCS. Obtained results prove that MSFCN achieves state‐of‐the‐art segmentation performance and indicated its practical application value.