z-logo
open-access-imgOpen Access
Working with batches of PDF files
Author(s) -
Moritz Mähr
Publication year - 2020
Publication title -
the programming historian
Language(s) - English
Resource type - Journals
ISSN - 2397-2068
DOI - 10.46430/phen0088
Subject(s) - computer science , line (geometry) , information retrieval , natural language processing , artificial intelligence , world wide web , mathematics , geometry
Learn how to perform OCR and text extraction with free command line tools like Tesseract and Poppler and how to get an overview of large numbers of PDF documents using topic modeling.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here