Multimodality, interactivity, and crowdsourcing for document transcription | Zendy

Granell Emilio | Zendy; Romero Verónica | Zendy; MartínezHinarejos Carlos D. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Multimodality, interactivity, and crowdsourcing for document transcription

Author(s) -

Granell Emilio,

Romero Verónica,

MartínezHinarejos Carlos D.

Publication year - 2018

Publication title -

computational intelligence

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.353

H-Index - 52

eISSN - 1467-8640

pISSN - 0824-7935

DOI - 10.1111/coin.12169

Subject(s) - crowdsourcing , multimodality , interactivity , computer science , transcription (linguistics) , natural language processing , artificial intelligence , speech recognition , information retrieval , world wide web , linguistics , philosophy

Knowledge mining from documents usually use document engineering techniques that allow the user to access the information contained in documents of interest. In this framework, transcription may provide efficient access to the contents of handwritten documents. Manual transcription is a time‐consuming task that can be sped up by using different mechanisms. A first possibility is employing state‐of‐the‐art handwritten text recognition systems to obtain an initial draft transcription that can be manually amended. A second option is employing crowdsourcing to obtain a massive but not error‐free draft transcription. In this case, when collaborators employ mobile devices, speech dictation can be used as a transcription source, and speech and handwritten text recognition can be fused to provide a better draft transcription, which can be amended with even less effort. A final option is using interactive assistive frameworks, where the automatic system that provides the draft transcription and the transcriber cooperate to generate the final transcription. The novel contributions presented in this work include the study of the data fusion on a multimodal crowdsourcing framework and its integration with an interactive system. The use of the proposed solutions reduces the required transcription effort and optimizes the overall performance and usability, allowing for a better transcription process.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research