
Information extraction from blood test PDFs: systematic review and case study
Author(s) -
Alice T. G. Pintanel,
Gracaliz P. Dimuro,
Eduardo N. Borges,
Mateus O. Jung,
Pedro G. Machado,
Eric L. Correa,
Beatriz S. De M. Bernardo
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3593427
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Automated extraction of information from files is increasingly essential due to the growing volume of data. However, extracting information from blood test results remains a relatively unexplored area. Therefore, the objective of this paper is to present a system that is capable of automatically extracting blood test variables. To this end, this study proposes a comprehensive review of the methodologies and tools employed in extracting information from blood test PDFs based on a systematic review and a case study that presents the developed system in detail in all its stages, in order to validate the proposed system. The proposed methodology is validated based on the use of precision, recall and F1 Score evaluation metrics. To the best of our knowledge, there is no existing literature that conducts a study analogous to the one presented and validated in this paper through the provided case study.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom