Results 1 to 10 of about 1,042,157 (292)
Video Description using Bidirectional Recurrent Neural Networks [PDF]
Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions.
Bolaños, Marc +3 more
core +2 more sources
Content-Based Video Description for Automatic Video Genre Categorization [PDF]
International audienceIn this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses
A.F. Smeaton +5 more
core +4 more sources
Video captioning based on vision transformer and reinforcement learning [PDF]
Global encoding of visual features in video captioning is important for improving the description accuracy. In this paper, we propose a video captioning method that combines Vision Transformer (ViT) and reinforcement learning.
Hong Zhao +3 more
doaj +2 more sources
DeepRide: Dashcam Video Description Dataset for Autonomous Vehicle Location-Aware Trip Description
Video description is one of the most challenging task in the combined domain of computer vision and natural language processing. Captions for various open and constrained domain videos have been generated in the recent past but descriptions for driving ...
Ghazala Rafiq +4 more
doaj +1 more source
Video Description: Datasets & Evaluation Metrics
Rapid expansion and the novel phenomenon of deep learning have manifested a variety of proposals and concerns in the area of video description, particularly in the recent past.
Muhammad Rafiq +2 more
doaj +1 more source
The article explores art of cinematography as an objectified cultural reality, in spatial and temporal structures of video description. Genesis of art of photography has changed the habits of human perception and thinking process – from photographic ...
Nadiia Korabliova, Hanna Chmil
doaj +1 more source
Grounded Video Description [PDF]
Video description is one of the most challenging problems in vision and language understanding due to the large variability both on the video and language side. Models, hence, typically shortcut the difficulty in recognition and generate plausible sentences that are based on priors but are not necessarily grounded in the video.
Zhou, Luowei +4 more
openaire +2 more sources
Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms
Video description plays an important role in the field of intelligent imaging technology. Attention perception mechanisms are extensively applied in video description models based on deep learning.
Jie Xu +4 more
doaj +1 more source
A thorough analysis and comprehension of the entire cue set in visual data are indispensable for an ideal video description model. As outlined in recent algorithm proposals, video descriptions have primarily been generated by learning RGB and optical ...
Ghazala Rafiq +2 more
doaj +1 more source
Identity-Aware Multi-sentence Video Description [PDF]
Project link at https://sites.google.com/site/describingmovies/lsmdc-2019/
Park, Jae Sung +2 more
openaire +2 more sources

