Caption generation - Open Access .click

Results 11 to 20 of about 104,311 (276)

Geo-Aware Image Caption Generation [PDF]

Proceedings of the 28th International Conference on Computational Linguistics, 2020
Standard image caption generation systems produce generic descriptions of images and do not utilize any contextual information or world knowledge. In particular, they are unable to generate captions that contain references to the geographic context of an image, for example, the location where a photograph is taken or relevant geographic objects around ...
Nikiforova, Sofia +3 more
openaire +3 more sources

Generating Accurate Caption Units for Figure Captioning

Proceedings of the Web Conference 2021, 2021
Scientific-style figures are commonly used on the web to present numerical information. Captions that tell accurate figure information and sound natural would significantly improve figure accessibility. In this paper, we present promising results on machine figure captioning.
Xin Qian +7 more
openaire +1 more source

Image Captioning Using Motion-CNN with Object Detection

Sensors, 2021
Automatic image captioning has many important applications, such as the depiction of visual contents for visually impaired people or the indexing of images on the internet.
Kiyohiko Iwamura +4 more
doaj +1 more source

Video Content Caption Generation Based on ViT and Semantic Guidance [PDF]

Jisuanji gongcheng, 2023
This paper proposes a video captioning method based on Vision Transformer（ViT） and semantic guidance to alleviate the problems of poor readability and low accuracy of caption text generated by exsisting video content captioning models.First，the visual ...
ZHAO Hong, CHEN Zhiwen, GUO Lan, AN Dong
doaj +1 more source

Image Caption Generator

International Journal of Innovative Technology and Exploring Engineering, 2021
In the modern era, image captioning has become one of the most widely required tools. Moreover, there are inbuilt applications that generate and provide a caption for a certain image, all these things are done with the help of deep neural network models. The process of generating a description of an image is called image captioning.
Megha J Panicker +3 more
openaire +1 more source

Image Caption Generation via Unified Retrieval and Generation-Based Method

Applied Sciences, 2020
Image captioning is a multi-modal transduction task, translating the source image into the target language. Numerous dominant approaches primarily employed the generation-based or the retrieval-based method.
Shanshan Zhao +4 more
doaj +1 more source

A Rapid Review of Image Captioning

JITeCS (Journal of Information Technology and Computer Science), 2021
Image captioning is an automatic process for generating text based on the content observed in an image. We do review, create framework, and build application model. We review image captioning into 4 categories based on input model, process model, output
Adriyendi Adriyendi
doaj +1 more source

Vision Transformer and Language Model Based Radiology Report Generation

IEEE Access, 2023
Recent advancements in transformers exploited computer vision problems which results in state-of-the-art models. Transformer-based models in various sequence prediction tasks such as language translation, sentiment classification, and caption generation ...
Mashood Mohammad Mohsan +5 more
doaj +1 more source

Generating Diverse and Meaningful Captions [PDF]

, 2018
Image Captioning is a task that requires models to acquire a multimodal understanding of the world and to express this understanding in natural language text. While the state-of-the-art for this task has rapidly improved in terms of n-gram metrics, these models tend to output the same generic captions for similar images.
Annika Lindh +4 more
openaire +5 more sources

Image Caption Generation Using Contextual Information Fusion With Bi-LSTM-s

IEEE Access, 2023
The image caption generation algorithm necessitates the expression of image content using accurate natural language. Given the existing encoder-decoder algorithm structure, the decoder solely generates words one by one in a front-to-back order and is ...
Huawei Zhang, Chengbo Ma, Zhanjun Jiang, Jing Lian +3 more
doaj +1 more source

image captioning
deep learning
image caption generation

image caption
natural language processing
video captioning

computer vision