Baidu Meizu Deep Learning Competition: Arithmetic Operation Recognition Using End-to-End Learning OCR Technologies
Author(s) -
Yuxiang Jiang,
Haiwei Dong,
Abdulmotaleb El Saddik
Publication year - 2018
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2018.2876035
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
The end-to-end learning approaches were proposed for an arithmetic expression recognition task in the Baidu Meizu Deep Learning Competition by a deep convolutional neural network (DCNN) with parallel dense layers and component-connection-based detection pipeline with the convolutional recurrent neural network (CRNN) model. Two effective pipelines for DCNN and CRNN to identify long and complex expressions are presented and compared. In the first task, a DCNN connected to parallel dense layers for digital arithmetic operations was developed, which achieves 99.985% accuracy. In the second task, the CRNN with connectionist temporal classification was adopted, combined with the text region detection technique to recognize more complex pictures with both assignment operations and calculation formulas, which achieves 98.087% accuracy.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom