Results 21 to 30 of about 69,316 (271)
Automatic Caption Generation for News Images [PDF]
This paper is concerned with the task of automatically generating captions for images, which is important for many image-related applications. Examples include video and image retrieval as well as the development of tools that aid visually impaired individuals to access pictorial information.
Yansong, Feng, Mirella, Lapata
openaire +2 more sources
Image Caption Generation via Unified Retrieval and Generation-Based Method
Image captioning is a multi-modal transduction task, translating the source image into the target language. Numerous dominant approaches primarily employed the generation-based or the retrieval-based method.
Shanshan Zhao +4 more
doaj +1 more source
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Recently, automatic image caption generation has been an important focus of the work on multimodal translation task. Existing approaches can be roughly categorized into two classes, top-down and bottom-up, the former transfers the image information ...
Ling Cheng +4 more
doaj +1 more source
Self-Learning for Few-Shot Remote Sensing Image Captioning
Large-scale caption-labeled remote sensing image samples are expensive to acquire, and the training samples available in practical application scenarios are generally limited.
Haonan Zhou +3 more
doaj +1 more source
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model
In this paper, we build a multi-style generative model for stylish image captioning which uses multi-modality image features, ResNeXt features, and text features generated by DenseCap.
Chengxi Li, Brent Harrison
doaj +1 more source
Generating images from a text description is as challenging as it is interesting. The Adversarial network performs in a competitive fashion where the networks are the rivalry of each other. With the introduction of Generative Adversarial Network, lots of development is happening in the field of Computer Vision.
Mahima Pandya, Prof. Sonal Rami
openaire +1 more source
Classification and object recognition in image processing has significantly improved computer vision tasks. The method is often used for visual problems, especially in picture classification utilizing the Convolutional Neural Network (CNN).
Rifqi Mulyawan +2 more
doaj +1 more source
Learn and Tell: Learning Priors for Image Caption Generation
In this work, we propose a novel priors-based attention neural network (PANN) for image captioning, which aims at incorporating two kinds of priors, i.e., the probabilities being mentioned for local region proposals (PBM priors) and part-of-speech clues ...
Pei Liu, Dezhong Peng, Ming Zhang
doaj +1 more source
Vision Transformer and Language Model Based Radiology Report Generation
Recent advancements in transformers exploited computer vision problems which results in state-of-the-art models. Transformer-based models in various sequence prediction tasks such as language translation, sentiment classification, and caption generation ...
Mashood Mohammad Mohsan +5 more
doaj +1 more source
Sparse Adversarial Examples Attacking on Video Captioning Model [PDF]
Despite the fact that multi-modal deep learning such as image captioning model has been proved to be vulnerable to adversarial examples,the adversarial susceptibility in video caption generation is under-examined.There are two main reasons for this.On ...
QIU Jiangxing, TANG Xueming, WANG Tianmei, WANG Chen, CUI Yongquan, LUO Ting
doaj +1 more source

