Results 41 to 50 of about 24,451 (274)

Human Attention in Image Captioning: Dataset and Analysis [PDF]

open access: yes, 2019
In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorded synchronously over images. Using this data, we study the differences in human attention during free-viewing and image captioning tasks. We look into the
Borji, Ali   +3 more
core   +3 more sources

Multi‐task learning for captioning images with novel words

open access: yesIET Computer Vision, 2019
Recent captioning models are limited in their ability to describe concepts unseen in paired image–sentence pairs. This study presents a framework of multi‐task learning for describing novel words not present in existing image‐captioning datasets.
He Zheng   +4 more
doaj   +1 more source

Image-Captioning Model Compression

open access: yesApplied Sciences, 2022
Image captioning is a very important task, which is on the edge between natural language processing (NLP) and computer vision (CV). The current quality of the captioning models allows them to be used for practical tasks, but they require both large ...
Viktar Atliha, Dmitrij Šešok
doaj   +1 more source

COMIC: Towards A Compact Image Captioning Model with Attention

open access: yes, 2019
Recent works in image captioning have shown very promising raw performance. However, we realize that most of these encoder-decoder style networks with attention do not scale naturally to large vocabulary size, making them difficult to be deployed on ...
Chan, Chee Seng   +2 more
core   +1 more source

Deep Interactive Region Segmentation and Captioning

open access: yes, 2017
With recent innovations in dense image captioning, it is now possible to describe every object of the scene with a caption while objects are determined by bounding boxes.
Boroujerdi, Ali Sharifi   +2 more
core   +1 more source

Captioning Transformer with Stacked Attention Modules

open access: yesApplied Sciences, 2018
Image captioning is a challenging task. Meanwhile, it is important for the machine to understand the meaning of an image better. In recent years, the image captioning usually use the long-short-term-memory (LSTM) as the decoder to generate the sentence ...
Xinxin Zhu   +4 more
doaj   +1 more source

Image Caption Generator

open access: yesInternational Journal of Innovative Technology and Exploring Engineering, 2021
In the modern era, image captioning has become one of the most widely required tools. Moreover, there are inbuilt applications that generate and provide a caption for a certain image, all these things are done with the help of deep neural network models. The process of generating a description of an image is called image captioning.
Megha J Panicker   +3 more
openaire   +1 more source

Geo-Aware Image Caption Generation [PDF]

open access: yesProceedings of the 28th International Conference on Computational Linguistics, 2020
Standard image caption generation systems produce generic descriptions of images and do not utilize any contextual information or world knowledge. In particular, they are unable to generate captions that contain references to the geographic context of an image, for example, the location where a photograph is taken or relevant geographic objects around ...
Nikiforova, Sofia   +3 more
openaire   +3 more sources

Image Captioning using Deep Neural Architectures

open access: yes, 2017
Automatically creating the description of an image using any natural languages sentence like English is a very challenging task. It requires expertise of both image processing as well as natural language processing.
Bakarola, Vishvajit   +2 more
core   +1 more source

Image Captioning

open access: yes, 2018
arXiv admin note: text overlap with arXiv:1609.06647 by other ...
Mullachery, Vikram, Motwani, Vishal
openaire   +2 more sources

Home - About - Disclaimer - Privacy