Image Captioning using CNN and LSTM | Zendy

Anish Banda | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Image Captioning using CNN and LSTM

Author(s) -

Anish Banda

Publication year - 2021

Publication title -

international journal for research in applied science and engineering technology

Language(s) - English

Resource type - Journals

ISSN - 2321-9653

DOI - 10.22214/ijraset.2021.37846

Subject(s) - computer science , closed captioning , artificial intelligence , encoder , convolutional neural network , image (mathematics) , sentence , metric (unit) , speech recognition , recurrent neural network , artificial neural network , pattern recognition (psychology) , computer vision , operations management , economics , operating system

In the model we proposed, we examine the deep neural networks-based image caption generation technique. We give image as input to the model, the technique give output in three different forms i.e., sentence in three different languages describing the image, mp3 audio file and an image file is also generated. In this model, we use the techniques of both computer vision and natural language processing. We are aiming to develop a model using the techniques of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) to build a model to generate a Caption. Target image is compared with the training images, we have a large dataset containing the training images, this is done by convolutional neural network. This model generates a decent description utilizing the trained data. To extract features from images we need encoder, we use CNN as encoder. To decode the description of image generated we use LSTM. To evaluate the accuracy of generated caption we use BLEU metric algorithm. It grades the quality of content generated. Performance is calculated by the standard calculation matrices. Keywords: CNN, RNN, LSTM, BLEU score, encoder, decoder, captions, image description.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore