
An Integrative Review of Image Captioning Research
Author(s) -
Chaoyang Wang,
Ziwei Zhou,
Liang Xu
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1748/4/042060
Subject(s) - closed captioning , computer science , image (mathematics) , task (project management) , artificial intelligence , benchmark (surveying) , computer vision , field (mathematics) , engineering , mathematics , geography , systems engineering , geodesy , pure mathematics
In the field of computer vision, image captioning is a new frontier research. The basic task of image captioning is to generate a descriptive natural language for the input image. This paper investigates and analyzes the related research of image captioning. Firstly, the task and application scenarios of image captioning are introduced. Secondly, the image captioning algorithm based on template and the image captioning algorithm based on encoder-decoder structure are analyzed, and the advantages and limitations of each method are discussed. Then, the benchmark dataset and evaluation for image captioning are introduced. Finally, the future development of image captioning is prospected.