
Automatic Image Captioning Methods
Author(s) -
Ruchitesh Malukani,
Gaurav Patel,
Netaji Subhash
Publication year - 2020
Publication title -
international journal of innovative technology and exploring engineering
Language(s) - English
Resource type - Journals
ISSN - 2278-3075
DOI - 10.35940/ijitee.g1013.0597s20
Subject(s) - closed captioning , computer science , convolutional neural network , natural language processing , task (project management) , natural language , artificial intelligence , image (mathematics) , frame (networking) , natural (archaeology) , interpretation (philosophy) , programming language , telecommunications , management , archaeology , economics , history
A language known to humans is a natural language. In computer science it is the most challenging task to make the computers understand the natural languages and generating caption automatically from the given image. While a lot of work has been done, the total solution to this problem has been demonstrated daunting so far. Image captioning is a crucial job involving linguistic image understanding and the ability to generate interpretation of sentences with proper and accurate structure. It requires expertise in Image processing and natural language processing. The publishers suggest in this practice a system using the multilayer Convolutional Neural Network (CNN) to generate language describing the images and Long Short Term Memory (LSTM) to concisely frame relevant phrases using the driven keywords. We aim in this article to provide a brief overview of current methods and algorithms of image captioning using deep learning. We also address datasets and measurement criteria widely used for the same.