
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Author(s) -
Zhaokai Wang,
Renda Bao,
Qi Wu,
Si Liu
Publication year - 2021
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v35i4.16389
Subject(s) - computer science , closed captioning , transformer , natural language processing , artificial intelligence , repetition (rhetorical device) , optical character recognition , redundancy (engineering) , reading (process) , word recognition , speech recognition , word (group theory) , image (mathematics) , linguistics , philosophy , physics , quantum mechanics , voltage , operating system