Image captioning - Open Access .click

Results 31 to 40 of about 24,451 (274)

From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023
With the growth of remote sensing images, understanding image content automatically has attracted many researchers' interests in deep learning for remote sensing image.
Runyan Du +6 more
doaj +1 more source

Captioning Ultrasound Images Automatically

, 2019
We describe an automatic natural language processing (NLP)-based image captioning method to describe fetal ultrasound video content by modelling the vocabulary commonly used by sonographers and sonologists. The generated captions are similar to the words spoken by a sonographer when describing the scan experience in terms of visual content and ...
Alsharid, M +5 more
openaire +4 more sources

Image Generation from Caption

International Journal on Soft Computing, Artificial Intelligence and Applications, 2018
Generating images from a text description is as challenging as it is interesting. The Adversarial network performs in a competitive fashion where the networks are the rivalry of each other. With the introduction of Generative Adversarial Network, lots of development is happening in the field of Computer Vision.
Mahima Pandya, Prof. Sonal Rami
openaire +1 more source

A Rapid Review of Image Captioning

JITeCS (Journal of Information Technology and Computer Science), 2021
Image captioning is an automatic process for generating text based on the content observed in an image. We do review, create framework, and build application model. We review image captioning into 4 categories based on input model, process model, output
Adriyendi Adriyendi
doaj +1 more source

Deconfounded Image Captioning: A Causal Retrospect [PDF]

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
Dataset bias in vision-language tasks is becoming one of the main problems which hinders the progress of our community. Existing solutions lack a principled analysis about why modern image captioners easily collapse into dataset bias. In this paper, we present a novel perspective: Deconfounded Image Captioning (DIC), to find out the answer of this ...
Yang, Xu, Zhang, Hanwang, Cai, Jianfei
openaire +5 more sources

Automated Image Captioning Using Sparrow Search Algorithm With Improved Deep Learning Model

IEEE Access, 2023
Image captioning is a deep learning technique that intends to create and generate textual descriptions or captions for images. It integrates computer vision and natural language processing (NLP) to comprehend the visual content of an image and generate ...
Munya A. Arasi +5 more
doaj +1 more source

Exploring Multi-Level Attention and Semantic Relationship for Remote Sensing Image Captioning

IEEE Access, 2020
Remote sensing image captioning, which aims to understand high-level semantic information and interactions of different ground objects, is a new emerging research topic in recent years.
Zhenghang Yuan, Xuelong Li, Qi Wang
doaj +1 more source

Image Representations and New Domains in Neural Image Captioning

, 2015
We examine the possibility that recent promising results in automatic caption generation are due primarily to language models. By varying image representation quality produced by a convolutional neural network, we find that a state-of-the-art neural ...
Hessel, Jack, Savva, Nicolas, Wilber, Michael J. +2 more
core +1 more source

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages

Applied Sciences, 2023
With the explosion of visual content on the Internet, creating captions for images has become a necessary task and an exciting topic for many researchers.
Ashwaq Alsayed +3 more
doaj +1 more source

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

, 2016
Recent advances in image captioning task have led to increasing interests in video captioning task. However, most works on video captioning are focused on generating single input of aggregated features, which hardly deviates from image captioning process
Harada, Tatsuya, Ohnishi, Katsunori, Shin, Andrew +2 more
core +1 more source

deep learning
computer vision
transformer

natural language processing
lstm
visual attention

attention mechanism
image caption