z-logo
open-access-imgOpen Access
An Automated Academic Book Scanner with Deep Learning Powered Math Expression Detection and Recognition
Author(s) -
S Thamaraiselvan,
Vivek Venugopal,
Susmitha Vekkot,
Pramod K. Pisharady
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3638780
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Mathematical expressions are hard to read, which makes it hard to digitize printed STEM material. Conventional OCR methods do not work well on contents which comprise of text along with mathematical expressions. This paper proposes an automated book scanning system comprising of Raspberry Pi, page gripper and flipper set up and a smartphone camera for capturing the content. Once the content is captured, a combination of text recognition, mathematical expression detection and recognition modules work in synergy to provide the final digitized content. The mathematical expression detection module consists of a customized RoIHeads version of Faster R-CNN. The proposed model performed with 95.71% precision, 91.77% recall, 93.74% F1-score, and 87.42% mean IOU on the publicly available IBEM dataset. Once the expressions are detected, they are recognised using the Vision Encoder-Decoder Model architecture and the subsequent latex version of the mathematical expression is generated. This model is trained on the Im2Latex dataset and provided a BLEU score of 86.44 % on the test set. We also demonstrated the utility of the proposed method in academic STEM book scanning with accurate recovery of mathematical expressions. Our system gives a cost effective solution which makes it possible to extract text and mathematical content from printed books in a method that is scalable and accurate. This opens the door to large-scale digital preservation and machine-readable academic archiving.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom