Image caption generation - Open Access .click

Results 21 to 30 of about 69,316 (271)

Automatic Caption Generation for News Images [PDF]

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
This paper is concerned with the task of automatically generating captions for images, which is important for many image-related applications. Examples include video and image retrieval as well as the development of tools that aid visually impaired individuals to access pictorial information.
Yansong, Feng, Mirella, Lapata
openaire +2 more sources

Image Caption Generation via Unified Retrieval and Generation-Based Method

Applied Sciences, 2020
Image captioning is a multi-modal transduction task, translating the source image into the target language. Numerous dominant approaches primarily employed the generation-based or the retrieval-based method.
Shanshan Zhao +4 more
doaj +1 more source

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation

IEEE Access, 2020
Recently, automatic image caption generation has been an important focus of the work on multimodal translation task. Existing approaches can be roughly categorized into two classes, top-down and bottom-up, the former transfers the image information ...
Ling Cheng +4 more
doaj +1 more source

Self-Learning for Few-Shot Remote Sensing Image Captioning

Remote Sensing, 2022
Large-scale caption-labeled remote sensing image samples are expensive to acquire, and the training samples available in practical application scenarios are generally limited.
Haonan Zhou, Xiaoping Du, Lurui Xia, Sen Li +3 more
doaj +1 more source

3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model

Proceedings of the International Florida Artificial Intelligence Research Society Conference, 2021
In this paper, we build a multi-style generative model for stylish image captioning which uses multi-modality image features, ResNeXt features, and text features generated by DenseCap.
Chengxi Li, Brent Harrison
doaj +1 more source

Image Generation from Caption

International Journal on Soft Computing, Artificial Intelligence and Applications, 2018
Generating images from a text description is as challenging as it is interesting. The Adversarial network performs in a competitive fashion where the networks are the rivalry of each other. With the introduction of Generative Adversarial Network, lots of development is happening in the field of Computer Vision.
Mahima Pandya, Prof. Sonal Rami
openaire +1 more source

Pre-Trained CNN Architecture Analysis for Transformer-Based Indonesian Image Caption Generation Model

JOIV: International Journal on Informatics Visualization, 2023
Classification and object recognition in image processing has significantly improved computer vision tasks. The method is often used for visual problems, especially in picture classification utilizing the Convolutional Neural Network (CNN).
Rifqi Mulyawan +2 more
doaj +1 more source

Learn and Tell: Learning Priors for Image Caption Generation

Applied Sciences, 2020
In this work, we propose a novel priors-based attention neural network (PANN) for image captioning, which aims at incorporating two kinds of priors, i.e., the probabilities being mentioned for local region proposals (PBM priors) and part-of-speech clues ...
Pei Liu, Dezhong Peng, Ming Zhang
doaj +1 more source

Vision Transformer and Language Model Based Radiology Report Generation

IEEE Access, 2023
Recent advancements in transformers exploited computer vision problems which results in state-of-the-art models. Transformer-based models in various sequence prediction tasks such as language translation, sentiment classification, and caption generation ...
Mashood Mohammad Mohsan +5 more
doaj +1 more source

Sparse Adversarial Examples Attacking on Video Captioning Model [PDF]

Jisuanji kexue, 2023
Despite the fact that multi-modal deep learning such as image captioning model has been proved to be vulnerable to adversarial examples,the adversarial susceptibility in video caption generation is under-examined.There are two main reasons for this.On ...
QIU Jiangxing, TANG Xueming, WANG Tianmei, WANG Chen, CUI Yongquan, LUO Ting
doaj +1 more source

image captioning
deep learning
natural language processing

fos: computer and information sciences
computer vision
caption generation

computer vision and pattern recognition cs.cv
machine learning
image caption