Results 11 to 20 of about 104,311 (276)
Geo-Aware Image Caption Generation [PDF]
Standard image caption generation systems produce generic descriptions of images and do not utilize any contextual information or world knowledge. In particular, they are unable to generate captions that contain references to the geographic context of an image, for example, the location where a photograph is taken or relevant geographic objects around ...
Nikiforova, Sofia +3 more
openaire +3 more sources
Generating Accurate Caption Units for Figure Captioning
Scientific-style figures are commonly used on the web to present numerical information. Captions that tell accurate figure information and sound natural would significantly improve figure accessibility. In this paper, we present promising results on machine figure captioning.
Xin Qian +7 more
openaire +1 more source
Image Captioning Using Motion-CNN with Object Detection
Automatic image captioning has many important applications, such as the depiction of visual contents for visually impaired people or the indexing of images on the internet.
Kiyohiko Iwamura +4 more
doaj +1 more source
Video Content Caption Generation Based on ViT and Semantic Guidance [PDF]
This paper proposes a video captioning method based on Vision Transformer(ViT) and semantic guidance to alleviate the problems of poor readability and low accuracy of caption text generated by exsisting video content captioning models.First,the visual ...
ZHAO Hong, CHEN Zhiwen, GUO Lan, AN Dong
doaj +1 more source
In the modern era, image captioning has become one of the most widely required tools. Moreover, there are inbuilt applications that generate and provide a caption for a certain image, all these things are done with the help of deep neural network models. The process of generating a description of an image is called image captioning.
Megha J Panicker +3 more
openaire +1 more source
Image Caption Generation via Unified Retrieval and Generation-Based Method
Image captioning is a multi-modal transduction task, translating the source image into the target language. Numerous dominant approaches primarily employed the generation-based or the retrieval-based method.
Shanshan Zhao +4 more
doaj +1 more source
A Rapid Review of Image Captioning
Image captioning is an automatic process for generating text based on the content observed in an image. We do review, create framework, and build application model. We review image captioning into 4 categories based on input model, process model, output
Adriyendi Adriyendi
doaj +1 more source
Vision Transformer and Language Model Based Radiology Report Generation
Recent advancements in transformers exploited computer vision problems which results in state-of-the-art models. Transformer-based models in various sequence prediction tasks such as language translation, sentiment classification, and caption generation ...
Mashood Mohammad Mohsan +5 more
doaj +1 more source
Generating Diverse and Meaningful Captions [PDF]
Image Captioning is a task that requires models to acquire a multimodal understanding of the world and to express this understanding in natural language text. While the state-of-the-art for this task has rapidly improved in terms of n-gram metrics, these models tend to output the same generic captions for similar images.
Annika Lindh +4 more
openaire +5 more sources
Image Caption Generation Using Contextual Information Fusion With Bi-LSTM-s
The image caption generation algorithm necessitates the expression of image content using accurate natural language. Given the existing encoder-decoder algorithm structure, the decoder solely generates words one by one in a front-to-back order and is ...
Huawei Zhang +3 more
doaj +1 more source

