Results 41 to 50 of about 69,316 (271)
Evaluating Automatically Generated Phoneme Captions for Images [PDF]
Accepted at ...
van der Hout, Justin (author) +3 more
openaire +3 more sources
Semantically Invariant Text-to-Image Generation
Image captioning has demonstrated models that are capable of generating plausible text given input images or videos. Further, recent work in image generation has shown significant improvements in image quality when text is used as a prior.
Dominguez, Miguel +6 more
core +1 more source
Automatic image caption generation using deep learning
Abstract Image captioning is an interesting and challenging task with applications in diverse domains such as image retrieval, organizing and locating images of users’ interest etc. It has huge potential for replacing manual caption generation for images and is especially suitable for large scale image data.
Akash Verma +3 more
openaire +1 more source
An Approach to Generate a Caption for an Image Collection Using Scene Graph Generation
Summarization is a challenging task that aims to generate a summary by grasping common information of a given set of information. Text summarization is a popular task of determining the topic or generating a textual summary of documents.
Itthisak Phueaksri +4 more
doaj +1 more source
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?
In neural image captioning systems, a recurrent neural network (RNN) is typically viewed as the primary `generation' component. This view suggests that the image features should be `injected' into the RNN.
Camilleri, Kenneth P. +2 more
core +1 more source
Self-critical Sequence Training for Image Captioning
Recently it has been shown that policy-gradient methods for reinforcement learning can be utilized to train deep end-to-end systems directly on non-differentiable metrics for the task at hand.
Goel, Vaibhava +4 more
core +1 more source
Automatic creation of image descriptions, i.e. captioning of images, is an important topic in artificial intelligence (AI) that bridges the gap between computer vision (CV) and natural language processing (NLP).
Mukesh Kalla +3 more
doaj +1 more source
Multi-Task Video Captioning with Video and Entailment Generation
Video captioning, the task of describing the content of a video, has seen some promising improvements in recent years with sequence-to-sequence models, but accurately learning the temporal and logical dynamics involved in the task still remains a ...
Bansal, Mohit, Pasunuru, Ramakanth
core +1 more source
Remote Monitoring in Myasthenia Gravis: Exploring Symptom Variability
ABSTRACT Background Myasthenia gravis (MG) is a rare, autoimmune disorder characterized by fluctuating muscle weakness and potential life‐threatening crises. While continuous specialized care is essential, access barriers often delay timely interventions. To address this, we developed MyaLink, a telemedical platform for MG patients.
Maike Stein +13 more
wiley +1 more source
Semantic bottleneck for computer vision tasks
This paper introduces a novel method for the representation of images that is semantic by nature, addressing the question of computation intelligibility in computer vision tasks.
Gabriëlle Ras +4 more
core +1 more source

