Results 31 to 40 of about 69,316 (271)
Automatic Generation of Grounded Visual Questions [PDF]
In this paper, we propose the first model to be able to generate visually grounded questions with diverse types for a single image. Visual question generation is an emerging topic which aims to ask questions in natural language based on visual input.
Qu, Lizhen +4 more
core +1 more source
Multi-Band Image Caption Generation Method Based on Feature Fusion [PDF]
This study proposes a multi-band detection image caption generation method based on feature fusion to address the common problem of poor performance in describing nighttime scenes, occluded target scenes, and captured blurred images in existing image ...
HE Shan, LIN Suzhen, WANG Yanbo, LI Dawei
doaj +1 more source
Middle-Level Attribute-Based Language Retouching for Image Caption Generation
Image caption generation is attractive research which focuses on generating natural language sentences to describe the visual content of a given image. It is an interdisciplinary subject combining computer vision (CV) and natural language processing (NLP)
Zhibin Guan +4 more
doaj +1 more source
Image–text coherence and its implications for multimodal AI
Human communication often combines imagery and text into integrated presentations, especially online. In this paper, we show how image–text coherence relations can be used to model the pragmatics of image–text presentations in AI systems.
Malihe Alikhani +2 more
doaj +1 more source
Review on Image Caption Generation
With the rapid development of Deep learning, AI along with Computer Vision and Natural Language processing Image caption has become an interesting and complex task. Image caption generation is the process of generating textual description of the given image and it is a challenging task because it consists of apprehension of objects. If the machine will
null Aishwarya Mark +4 more
openaire +1 more source
Automatic caption generation with attention mechanisms aims at generating more descriptive captions containing coarser to finer semantic contents in the image.
Reshmi Sasibhooshan +2 more
doaj +1 more source
Deep Learning is generally another field and it has caught a ton of eye since it gives more elevated level of precision in perceiving objects than at any other time prior. NLP is additionally one field that has made an immense effect in our life. NLP has made considerable progress from creating a lucid synopsis of the writings to investigation of ...
openaire +1 more source
Entity-aware Image Caption Generation [PDF]
In proceedings of EMNLP ...
Lu, Di +4 more
openaire +2 more sources
Long-text caption generation for surgical image with a concept retrieval augmented large multimodal model [PDF]
Jiquan Liu +6 more
doaj +1 more source
Learning a Recurrent Visual Representation for Image Caption Generation
In this paper we explore the bi-directional mapping between images and their sentence-based descriptions. We propose learning this mapping using a recurrent neural network.
Chen, Xinlei, Zitnick, C. Lawrence
core +1 more source

