Results 21 to 30 of about 20,321 (258)
A novel Multi-Layer Attention Framework for visual description prediction using bidirectional LSTM
The massive influx of text, images, and videos to the internet has recently increased the challenge of computer vision-based tasks in big data. Integrating visual data with natural language to generate video explanations has been a challenge for decades.
Dinesh Naik, C. D. Jaidhar
doaj +1 more source
Methodologies that utilize Deep Learning offer great potential for applications that automatically attempt to generate captions or descriptions about images and video frames.
Soheyla Amirian +3 more
doaj +1 more source
Evaluation metrics for video captioning: A survey
Automatic evaluation metrics play an important role in assessing video captioning systems. Popular metrics used for assessing such approaches are based on word matching and may fail to evaluate the quality of automatically generated captions due to ...
Andrei de Souza Inácio +1 more
doaj +1 more source
Video Content Caption Generation Based on ViT and Semantic Guidance [PDF]
This paper proposes a video captioning method based on Vision Transformer(ViT) and semantic guidance to alleviate the problems of poor readability and low accuracy of caption text generated by exsisting video content captioning models.First,the visual ...
ZHAO Hong, CHEN Zhiwen, GUO Lan, AN Dong
doaj +1 more source
While describing visual data is a trivial task for humans, it is an intricate task for a computer. This is even more challenging if the visual data is a video. Comprehending a video and describing it is called Video Captioning.
Khushboo Khurana, Umesh Deshpande
doaj +1 more source
Video Captions Benefit Everyone [PDF]
Video captions, also known as same-language subtitles, benefit everyone who watches videos (children, adolescents, college students, and adults). More than 100 empirical studies document that captioning a video improves comprehension of, attention to, and memory for the video.
openaire +2 more sources
With the advancement of the technological field, day by day, people from around the world are having easier access to internet abled devices, and as a result, video data is growing rapidly. The increase of portable devices such as various action cameras,
Shakil Ahmed +10 more
doaj +1 more source
Thinking Hallucination for Video Captioning
18 ...
Ullah, Nasib, Mohanta, Partha Pratim
openaire +2 more sources
As an increasingly popular format of input, the affordances of audio-visual materials have been widely studied. Past research has provided evidence that audio-visual input combined with different captioning strategies could benefit learners in terms of ...
Long He
doaj +1 more source
Parallel Dense Video Caption Generation with Multi-Modal Features
The task of dense video captioning is to generate detailed natural-language descriptions for an original video, which requires deep analysis and mining of semantic captions to identify events in the video. Existing methods typically follow a localisation-
Xuefei Huang +3 more
doaj +1 more source

