Image Captioning Methods and Metrics

doi:10.1109/ESCI50559.2021.9396839

Proceedings ArticleDOI

Image Captioning Methods and Metrics

Omkar Sargar, +1 more

Chats0

TLDR

This paper demonstrates a concise state of art image captioning and its method for caption generation using deep learning concepts and evaluates the proposed system experimental analysis with numerous existing systems and shows the effeteness of system.

Abstract:

Image Captioning is one of the emerging topics of research in the field of AI. It uses a combination of Computer Vision (CV) and Natural Language Processing (NLP) to derive features from the image, use this information to identify objects, actions, their relationships, and generate a description for the image. It is most important concept in artificial intelligence applied in the fields like aid to the blind, self-driving cars, and many more. This paper we demonstrates a concise state of art image captioning and its method for caption generation using deep learning concepts. We also determine the approach for image caption generation using Convolutional Neural Network (CNN) and Generative Adversarial Network (GAN) model in deep learning framework. Using this approach system intelligent enough to create sentences for images. It uses the encoder-decoder architecture, where CNN is used for image vector generation and LSTM is used for the generation of a logical sentence using the NLP concepts. Finally, we evaluate the proposed system experimental analysis with numerous existing systems and show the effeteness of system.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

A Comparative Study of Machine Learning Based Image Captioning Models

TL;DR: Zhang et al. as discussed by the authors performed a comparative analysis on three Machine Learning (ML) algorithms, i.e. k-Nearest Neighbor (KNN), Convolution Neural Network (CNN) with Long Short Term Memory (LSTM) and Attention Based LSTM.

...read moreread less

Journal ArticleDOI

Automatic image captioning system using a deep learning approach

Gerard Deepak, +5 more

- 27 May 2023 -

Soft Computing

Proceedings ArticleDOI

Sequential Memory Modelling for Video Captioning

TL;DR: In this paper , an encoder-decoder network end-in-frame based on a deep learning approach was used to generate video subtitles, and the model, dataset and parameters used to evaluate the model.

...read moreread less

Journal ArticleDOI

Arabic Captioning for Images of Clothing Using Deep Learning

Rasha AL-Malki, +1 more

- 01 Apr 2023 -

Sensors

TL;DR: In this article , the authors proposed a model for captioning images of clothing in the Arabic language using deep learning, which achieved a BLEU-1 score of 88.52.

...read moreread less

Proceedings ArticleDOI

Sequential Memory Modelling for Video Captioning

Puttaraja. Puttaraja, +4 more

TL;DR: In this paper , an encoder-decoder network end-in-frame based on a deep learning approach was used to generate video subtitles, and the model, dataset and parameters used to evaluate the model.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

Proceedings ArticleDOI

Deep visual-semantic alignments for generating image descriptions

Andrej Karpathy, +1 more

TL;DR: A model that generates natural language descriptions of images and their regions based on a novel combination of Convolutional Neural Networks over image regions, bidirectional Recurrent Neural networks over sentences, and a structured objective that aligns the two modalities through a multimodal embedding is presented.

...read moreread less

Posted Content

Show and Tell: A Neural Image Caption Generator

Oriol Vinyals, +3 more

- 17 Nov 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper presents a generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be used to generate natural sentences describing an image.

...read moreread less

Journal ArticleDOI

Deep Visual-Semantic Alignments for Generating Image Descriptions

Andrej Karpathy, +1 more

- 01 Apr 2017 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A model that generates natural language descriptions of images and their regions based on a novel combination of Convolutional Neural Networks over image regions, bidirectional Recurrent Neural networks over sentences, and a structured objective that aligns the two modalities through a multimodal embedding is presented.

...read moreread less

Association for the Advancement of Artificial Intelligence

Matthew Crosby, +1 more

TL;DR: AIIDE is the premier conference on artificial intelligence in computer games and interactive entertainment that brings together technical leaders to examine how computer games can be improved using AI technologies, and to promote new approaches and commercial developments.

...read moreread less

IEEE Transactions on Systems, Man, and C...

Image Captioning using Deep Neural Architectures

Parth Shah, +2 more

- 17 Jan 2018 -

arXiv: Computer Vision and Pattern Recog...

Image Captioning Methods and Metrics

Citations

A Comparative Study of Machine Learning Based Image Captioning Models

Automatic image captioning system using a deep learning approach

Sequential Memory Modelling for Video Captioning

Arabic Captioning for Images of Clothing Using Deep Learning

Sequential Memory Modelling for Video Captioning

References

Bleu: a Method for Automatic Evaluation of Machine Translation

Deep visual-semantic alignments for generating image descriptions

Show and Tell: A Neural Image Caption Generator

Deep Visual-Semantic Alignments for Generating Image Descriptions

Association for the Advancement of Artificial Intelligence

Related Papers (5)

Image Captioning: A Comprehensive Survey

Image captioning based on deep reinforcement learning

Image Captioning with Generative Adversarial Network

Chinese Image Caption Generation via Visual Attention and Topic Modeling.

Image Captioning using Deep Neural Architectures