Results 31 to 40 of about 24,451 (274)
From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning
With the growth of remote sensing images, understanding image content automatically has attracted many researchers' interests in deep learning for remote sensing image.
Runyan Du +6 more
doaj +1 more source
Captioning Ultrasound Images Automatically
We describe an automatic natural language processing (NLP)-based image captioning method to describe fetal ultrasound video content by modelling the vocabulary commonly used by sonographers and sonologists. The generated captions are similar to the words spoken by a sonographer when describing the scan experience in terms of visual content and ...
Alsharid, M +5 more
openaire +4 more sources
Generating images from a text description is as challenging as it is interesting. The Adversarial network performs in a competitive fashion where the networks are the rivalry of each other. With the introduction of Generative Adversarial Network, lots of development is happening in the field of Computer Vision.
Mahima Pandya, Prof. Sonal Rami
openaire +1 more source
A Rapid Review of Image Captioning
Image captioning is an automatic process for generating text based on the content observed in an image. We do review, create framework, and build application model. We review image captioning into 4 categories based on input model, process model, output
Adriyendi Adriyendi
doaj +1 more source
Deconfounded Image Captioning: A Causal Retrospect [PDF]
Dataset bias in vision-language tasks is becoming one of the main problems which hinders the progress of our community. Existing solutions lack a principled analysis about why modern image captioners easily collapse into dataset bias. In this paper, we present a novel perspective: Deconfounded Image Captioning (DIC), to find out the answer of this ...
Yang, Xu, Zhang, Hanwang, Cai, Jianfei
openaire +5 more sources
Automated Image Captioning Using Sparrow Search Algorithm With Improved Deep Learning Model
Image captioning is a deep learning technique that intends to create and generate textual descriptions or captions for images. It integrates computer vision and natural language processing (NLP) to comprehend the visual content of an image and generate ...
Munya A. Arasi +5 more
doaj +1 more source
Exploring Multi-Level Attention and Semantic Relationship for Remote Sensing Image Captioning
Remote sensing image captioning, which aims to understand high-level semantic information and interactions of different ground objects, is a new emerging research topic in recent years.
Zhenghang Yuan, Xuelong Li, Qi Wang
doaj +1 more source
Image Representations and New Domains in Neural Image Captioning
We examine the possibility that recent promising results in automatic caption generation are due primarily to language models. By varying image representation quality produced by a convolutional neural network, we find that a state-of-the-art neural ...
Hessel, Jack +2 more
core +1 more source
With the explosion of visual content on the Internet, creating captions for images has become a necessary task and an exciting topic for many researchers.
Ashwaq Alsayed +3 more
doaj +1 more source
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process
Harada, Tatsuya +2 more
core +1 more source

