Image captioning - Open Access .click

Results 21 to 30 of about 24,451 (274)

Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap

IEEE Access, 2020
Methodologies that utilize Deep Learning offer great potential for applications that automatically attempt to generate captions or descriptions about images and video frames.
Soheyla Amirian +3 more
doaj +1 more source

A thorough review of models, evaluation metrics, and datasets on image captioning

IET Image Processing, 2022
Image captioning means generate descriptive sentences from a query image automatically. It has recently received widespread attention from the computer vision and natural language processing communities as an emerging visual task.
Gaifang Luo +4 more
doaj +1 more source

Convolutional Image Captioning [PDF]

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018
11 pages, 9 ...
Aneja, Jyoti, Deshpande, Aditya, Schwing, Alexander +2 more
openaire +2 more sources

Image Captioning With Positional and Geometrical Semantics

IEEE Access, 2021
The last 5 to 6 years have seen tremendous progress in automatic image captioning using deep learning. Initial research focused on the attribute-to-attribute comparison of image features and texts to describe the image as a sentence, the current research
Anwar Ul Haque, Sayeed Ghani, Muhammad Saeed +2 more
doaj +1 more source

Topic scene graphs for image captioning

IET Computer Vision, 2022
When describing an image, people can rapidly extract the topic from the image and find the main object, generating sentences that match the main idea of the image. However, most of the scene graph generation methods do not emphasise the importance of the
Min Zhang +4 more
doaj +1 more source

Unsupervised Image Captioning [PDF]

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Deep neural networks have achieved great successes on the image captioning task. However, most of the existing models depend heavily on paired image-sentence datasets, which are very expensive to acquire. In this paper, we make the first attempt to train an image captioning model in an unsupervised manner.
Feng, Yang, Ma, Lin, Liu, Wei, Luo, Jiebo +3 more
openaire +2 more sources

Improving Image Captioning with Better Use of Caption

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
ACL ...
Shi, Zhan, Zhou, Xu, Qiu, Xipeng, Zhu, Xiaodan +3 more
openaire +2 more sources

Image Description Generation Method by Panoptic Segmentation and Multi-Visual-Feature Fusion [PDF]

Jisuanji gongcheng
Due to their powerful sequence modeling capabilities, Transformer-based image captioning models have demonstrated remarkable performance. However, most of these models typically utilize region visual features to perform encoding and decoding, which ...
LIU Mingming, LU Jinfu, LIU Hao, ZHANG Haiyan
doaj +1 more source

Entity-grounded image captioning [PDF]

, 2018
An urgent limitation in current Image Captioning models is their tendency to produce generic captions that avoid the interesting detail which makes each image unique. To address this limitation, we propose an approach that enforces a stronger alignment between image regions and specific segments of text.
Lindh, Annika, Ross, Robert J., Kelleher, John D. +2 more
openaire +2 more sources

Areas of Attention for Image Captioning [PDF]

, 2016
We propose "Areas of Attention", a novel attention-based model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions.
Lucas, Thomas +3 more
core +5 more sources

deep learning
computer vision
transformer

natural language processing
lstm
visual attention

attention mechanism
image caption