Results 21 to 30 of about 1,042,157 (292)
Towards a Video Game Description Language [PDF]
Dagstuhl Follow ...
Ebner, Marc +5 more
openaire +3 more sources
Attention-Based Convolutional LSTM for Describing Video
Video description technique has been widely used in the computer community for many applications. The typical approaches are mainly based on the encode-decode framework: the fixed-length video representation vectors are extracted by the encoder using the
Zhongyu Liu +4 more
doaj +1 more source
Exploring deep learning approaches for video captioning: A comprehensive review
While humans can easily describe visual data at varying levels of detail, the same task presents a significant challenge for machines. This challenge becomes even more complex when dealing with video data.
Adel Jalal Yousif, Mohammed H. Al-Jammas
doaj +1 more source
Multiple description video coding based on zero padding [PDF]
This paper proposes a simple multiple description video coding approach based on zero padding theory. It is completely based on pre- and post-processing, which require no modifications to the source codec.
Bull, DR +3 more
core +2 more sources
Learning to detect video events from zero or very few video examples [PDF]
In this work we deal with the problem of high-level event detection in video. Specifically, we study the challenging problems of i) learning to detect video events from solely a textual description of the event, without using any positive video examples,
Galanopoulos, Damianos +3 more
core +2 more sources
Human uses communication language either by written, spoken or typed to describe visual the world around them. So, the study of text description for any video goes increasing.
Vishakha Wankhede, Ramesh M Kagalkar
doaj +1 more source
Multiple description video coding for stereoscopic 3D [PDF]
In this paper, we propose an MDC schemes for stereoscopic 3D video. In the literature, MDC has previously been applied in 2D video but not so much in 3D video.
Abdul Karim, H +4 more
core +1 more source
Video description method based on multidimensional and multimodal information
In order to solve the problem of complex information representation in automatic video description tasks,a multi-dimensional and multi-modal visual feature extraction and fusion method was proposed.Firstly,multi-dimensional features such as static and ...
Enjie DING +3 more
doaj +2 more sources
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism [PDF]
With the rapid development of mobile network and we-media platform,lots of video and text information are generated,which bring an urgent demand for video-text cross-modal entity resolution.In order to improve the performance of video-text cross-modal ...
ZENG Zhi-xian, CAO Jian-jun, WENG Nian-feng, JIANG Guo-quan, XU Bin
doaj +1 more source
Fine-grained Audible Video Description
We explore a new task for audio-visual-language modeling called fine-grained audible video description (FAVD). It aims to provide detailed textual descriptions for the given audible videos, including the appearance and spatial locations of each object, the actions of moving objects, and the sounds in videos.
Shen, Xuyang +11 more
openaire +2 more sources

