Results 21 to 30 of about 24,451 (274)
Methodologies that utilize Deep Learning offer great potential for applications that automatically attempt to generate captions or descriptions about images and video frames.
Soheyla Amirian +3 more
doaj +1 more source
A thorough review of models, evaluation metrics, and datasets on image captioning
Image captioning means generate descriptive sentences from a query image automatically. It has recently received widespread attention from the computer vision and natural language processing communities as an emerging visual task.
Gaifang Luo +4 more
doaj +1 more source
Convolutional Image Captioning [PDF]
11 pages, 9 ...
Aneja, Jyoti +2 more
openaire +2 more sources
Image Captioning With Positional and Geometrical Semantics
The last 5 to 6 years have seen tremendous progress in automatic image captioning using deep learning. Initial research focused on the attribute-to-attribute comparison of image features and texts to describe the image as a sentence, the current research
Anwar Ul Haque +2 more
doaj +1 more source
Topic scene graphs for image captioning
When describing an image, people can rapidly extract the topic from the image and find the main object, generating sentences that match the main idea of the image. However, most of the scene graph generation methods do not emphasise the importance of the
Min Zhang +4 more
doaj +1 more source
Unsupervised Image Captioning [PDF]
Deep neural networks have achieved great successes on the image captioning task. However, most of the existing models depend heavily on paired image-sentence datasets, which are very expensive to acquire. In this paper, we make the first attempt to train an image captioning model in an unsupervised manner.
Feng, Yang +3 more
openaire +2 more sources
Improving Image Captioning with Better Use of Caption
ACL ...
Shi, Zhan +3 more
openaire +2 more sources
Image Description Generation Method by Panoptic Segmentation and Multi-Visual-Feature Fusion [PDF]
Due to their powerful sequence modeling capabilities, Transformer-based image captioning models have demonstrated remarkable performance. However, most of these models typically utilize region visual features to perform encoding and decoding, which ...
LIU Mingming, LU Jinfu, LIU Hao, ZHANG Haiyan
doaj +1 more source
Entity-grounded image captioning [PDF]
An urgent limitation in current Image Captioning models is their tendency to produce generic captions that avoid the interesting detail which makes each image unique. To address this limitation, we propose an approach that enforces a stronger alignment between image regions and specific segments of text.
Lindh, Annika +2 more
openaire +2 more sources
Areas of Attention for Image Captioning [PDF]
We propose "Areas of Attention", a novel attention-based model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions.
Lucas, Thomas +3 more
core +5 more sources

