Results 11 to 20 of about 141,410 (272)

Image Captioning Through Image Transformer [PDF]

open access: yes, 2021
Automatic captioning of images is a task that combines the challenges of image analysis and text generation. One important aspect in captioning is the notion of attention: How to decide what to describe and in which order. Inspired by the successes in text analysis and translation, previous work have proposed the \textit{transformer} architecture for ...
He, Sen   +5 more
openaire   +3 more sources

Image Captioning Using Motion-CNN with Object Detection

open access: yesSensors, 2021
Automatic image captioning has many important applications, such as the depiction of visual contents for visually impaired people or the indexing of images on the internet.
Kiyohiko Iwamura   +4 more
doaj   +1 more source

Guiding image captioning models toward more specific captions

open access: yes2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Image captioning is conventionally formulated as the task of generating captions for images that match the distribution of reference image-caption pairs. However, reference captions in standard captioning datasets are short and may not uniquely identify the images they describe. These problems are further exacerbated when models are trained directly on
Kornblith, Simon   +3 more
openaire   +2 more sources

Defoiling Foiled Image Captions [PDF]

open access: yesProceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), 2018
In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2018)
Madhyastha, P., Wang, J.K., Specia, L.
openaire   +4 more sources

VSAM-Based Visual Keyword Generation for Image Caption

open access: yesIEEE Access, 2021
Image caption is to understand and describe the visual content, which is expected to be applied in automatic news reporting in future. In recent years, there has been an increasing interest in an Encoder-Decoder framework for image caption: the encoder ...
Suya Zhang   +3 more
doaj   +1 more source

Image-Caption Model Based on Fusion Feature

open access: yesApplied Sciences, 2022
The encoder–decoder framework is the main frame of image captioning. The convolutional neural network (CNN) is usually used to extract grid-level features of the image, and the graph convolutional neural network (GCN) is used to extract the image’s ...
Yaogang Geng   +3 more
doaj   +1 more source

Image Captioning

open access: yes, 2023
This is a technical report that was written after the completion of a deep learning image captioning project in 2018.
Ibrahim, Ahmed Salah Tawfik   +1 more
openaire   +1 more source

A core region captioning framework for automatic video understanding in story video contents

open access: yesInternational Journal of Engineering Business Management, 2022
Due to the rapid increase in images and image data, research examining the visual analysis of such unstructured data has recently come to be actively conducted. One of the representative image caption models the DenseCap model extracts various regions in
Hyesun Suh   +3 more
doaj   +1 more source

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation

open access: yesIEEE Access, 2020
Recently, automatic image caption generation has been an important focus of the work on multimodal translation task. Existing approaches can be roughly categorized into two classes, top-down and bottom-up, the former transfers the image information ...
Ling Cheng   +4 more
doaj   +1 more source

Home - About - Disclaimer - Privacy