Video description - Open Access .click

Results 1 to 10 of about 1,042,157 (292)

Video Description using Bidirectional Recurrent Neural Networks [PDF]

, 2016
Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions.
Bolaños, Marc +3 more
core +2 more sources

Content-Based Video Description for Automatic Video Genre Categorization [PDF]

, 2012
International audienceIn this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses
A.F. Smeaton +5 more
core +4 more sources

Video captioning based on vision transformer and reinforcement learning [PDF]

PeerJ Computer Science, 2022
Global encoding of visual features in video captioning is important for improving the description accuracy. In this paper, we propose a video captioning method that combines Vision Transformer (ViT) and reinforcement learning.
Hong Zhao, Zhiwen Chen, Lan Guo, Zeyu Han +3 more
doaj +2 more sources

DeepRide: Dashcam Video Description Dataset for Autonomous Vehicle Location-Aware Trip Description

IEEE Access, 2022
Video description is one of the most challenging task in the combined domain of computer vision and natural language processing. Captions for various open and constrained domain videos have been generated in the recent past but descriptions for driving ...
Ghazala Rafiq +4 more
doaj +1 more source

Video Description: Datasets & Evaluation Metrics

IEEE Access, 2021
Rapid expansion and the novel phenomenon of deep learning have manifested a variety of proposals and concerns in the area of video description, particularly in the recent past.
Muhammad Rafiq, Ghazala Rafiq, Gyu Sang Choi +2 more
doaj +1 more source

CINEMATOGRAPHY AS A DESCRIPTIVE PHILOSOPHY "ON THE CINEFILM" OF IMAGES OF HOMO: FROM THE FIXATION OF IMAGES (HOMO PHOTOGRAPHICUS) TO THE TREACHERY OF IMAGES (CONTEMPORARY HOMO VIDENS)

Вісник Харківського національного університету імені В.Н. Каразіна. Серия: Теорія культури та філософіі науки, 2021
The article explores art of cinematography as an objectified cultural reality, in spatial and temporal structures of video description. Genesis of art of photography has changed the habits of human perception and thinking process – from photographic ...
Nadiia Korabliova, Hanna Chmil
doaj +1 more source

Grounded Video Description [PDF]

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Video description is one of the most challenging problems in vision and language understanding due to the large variability both on the video and language side. Models, hence, typically shortcut the difficulty in recognition and generate plausible sentences that are based on priors but are not necessarily grounded in the video.
Zhou, Luowei +4 more
openaire +2 more sources

Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms

Applied Sciences, 2020
Video description plays an important role in the field of intelligent imaging technology. Attention perception mechanisms are extensively applied in video description models based on deep learning.
Jie Xu +4 more
doaj +1 more source

Spectral Representation Learning and Fusion for Autonomous Vehicles Trip Description Exploiting Recurrent Transformer

IEEE Access, 2023
A thorough analysis and comprehension of the entire cue set in visual data are indispensable for an ideal video description model. As outlined in recent algorithm proposals, video descriptions have primarily been generated by learning RGB and optical ...
Ghazala Rafiq, Muhammad Rafiq, Gyu Sang Choi +2 more
doaj +1 more source

Identity-Aware Multi-sentence Video Description [PDF]

, 2020
Project link at https://sites.google.com/site/describingmovies/lsmdc-2019/
Park, Jae Sung, Darrell, Trevor, Rohrbach, Anna +2 more
openaire +2 more sources

fos: computer and information sciences
computer vision and pattern recognition cs.cv
telecommunication

video captioning
natural language processing
convolutional neural network

multiple description coding