Results 41 to 50 of about 141,410 (272)
Knowledge-rich Image Gist Understanding Beyond Literal Meaning
We investigate the problem of understanding the message (gist) conveyed by images and their captions as found, for instance, on websites or news articles. To this end, we propose a methodology to capture the meaning of image-caption pairs on the basis of
Dietz, Laura +4 more
core +1 more source
Component based comparative analysis of each module in image captioning
Image captioning is a task to generate a new caption using the training data of the image and caption. Since existing deep learning is a black-box model, it is crucial to analyze the influence on each module for understanding the model. In this paper, we
Seoung-Ho Choi +2 more
doaj +1 more source
STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset
In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. In this paper, we particularly consider generating Japanese captions for images.
Shigeto, Yutaro +2 more
core +1 more source
In the modern era, image captioning has become one of the most widely required tools. Moreover, there are inbuilt applications that generate and provide a caption for a certain image, all these things are done with the help of deep neural network models. The process of generating a description of an image is called image captioning.
Megha J Panicker +3 more
openaire +1 more source
Geo-Aware Image Caption Generation [PDF]
Standard image caption generation systems produce generic descriptions of images and do not utilize any contextual information or world knowledge. In particular, they are unable to generate captions that contain references to the geographic context of an image, for example, the location where a photograph is taken or relevant geographic objects around ...
Nikiforova, Sofia +3 more
openaire +3 more sources
Middle-Level Attribute-Based Language Retouching for Image Caption Generation
Image caption generation is attractive research which focuses on generating natural language sentences to describe the visual content of a given image. It is an interdisciplinary subject combining computer vision (CV) and natural language processing (NLP)
Zhibin Guan +4 more
doaj +1 more source
arXiv admin note: text overlap with arXiv:1609.06647 by other ...
Mullachery, Vikram, Motwani, Vishal
openaire +2 more sources
Can Audio Captions Be Evaluated With Image Caption Metrics?
ICASSP ...
Zhou, Zelin +5 more
openaire +2 more sources
Single‐molecule DNA flow‐stretch assays for high‐throughput DNA–protein interaction studies
We describe an optimised single‐molecule DNA flow‐stretch assay that visualises DNA–protein interactions in real time. Linear DNA fragments are tethered to a surface and stretched by buffer flow for fluorescence imaging. Using λ and φX174 DNA, this protocol enhances reproducibility and accessibility, providing a versatile approach for studying diverse ...
Ayush Kumar Ganguli +8 more
wiley +1 more source
Incorporating object counts into remote sensing image captioning
Existing methods for remote sensing image captioning tend to describe a remote sensing image using generic language that lacks specific information about object counts.
Zihao Ni, Zhaoyun Zong, Peng Ren
doaj +1 more source

