Results 21 to 30 of about 104,311 (276)
Abstract: Image captioning aims to automatically generate a sentence description for an image. Our project model will take an image as input and generate an English sentence as output, describing the contents of the image. It has attractedmuch research attention in cognitive computing in the recent years.
Vani M, Priya S
openaire +1 more source
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Recently, automatic image caption generation has been an important focus of the work on multimodal translation task. Existing approaches can be roughly categorized into two classes, top-down and bottom-up, the former transfers the image information ...
Ling Cheng +4 more
doaj +1 more source
Generating images from a text description is as challenging as it is interesting. The Adversarial network performs in a competitive fashion where the networks are the rivalry of each other. With the introduction of Generative Adversarial Network, lots of development is happening in the field of Computer Vision.
Mahima Pandya, Prof. Sonal Rami
openaire +1 more source
Automatic Intelligence Caption Generator
An Image Caption Generator is a sophisticated AI system that combines computer vision and natural language processing to automatically create descriptive textual captions for images. This technology utilizes deep learning, particularly Convolutional Neural Networks (CNNs), to analyze and extract meaningful visual features from the input image.
Trushna Kapadnis +4 more
openaire +1 more source
Medical image captioning via generative pretrained transformers
Abstract The proposed model for automatic clinical image caption generation combines the analysis of radiological scans with structured patient information from the textual records. It uses two language models, the Show-Attend-Tell and the GPT-3, to generate comprehensive and descriptive radiology records.
Alexander Selivanov +5 more
openaire +4 more sources
Learn and Tell: Learning Priors for Image Caption Generation
In this work, we propose a novel priors-based attention neural network (PANN) for image captioning, which aims at incorporating two kinds of priors, i.e., the probabilities being mentioned for local region proposals (PBM priors) and part-of-speech clues ...
Pei Liu, Dezhong Peng, Ming Zhang
doaj +1 more source
Self-Learning for Few-Shot Remote Sensing Image Captioning
Large-scale caption-labeled remote sensing image samples are expensive to acquire, and the training samples available in practical application scenarios are generally limited.
Haonan Zhou +3 more
doaj +1 more source
Automatic Generation of Grounded Visual Questions [PDF]
In this paper, we propose the first model to be able to generate visually grounded questions with diverse types for a single image. Visual question generation is an emerging topic which aims to ask questions in natural language based on visual input.
Qu, Lizhen +4 more
core +1 more source
Attentive Semantic Video Generation Using Captions [PDF]
This paper proposes a network architecture to perform variable length semantic video generation using captions. We adopt a new perspective towards video generation where we allow the captions to be combined with the long-term and short-term dependencies between video frames and thus generate a video in an incremental manner. Our experiments demonstrate
Marwah, Tanya +2 more
openaire +2 more sources
Automatic Caption Generation for News Images [PDF]
This paper is concerned with the task of automatically generating captions for images, which is important for many image-related applications. Examples include video and image retrieval as well as the development of tools that aid visually impaired individuals to access pictorial information.
Yansong, Feng, Mirella, Lapata
openaire +2 more sources

