Video captioning - Open Access .click

Results 111 to 120 of about 20,321 (258)

Delving Deeper into Convolutional Networks for Learning Video Representations

, 2016
We propose an approach to learn spatio-temporal features in videos from intermediate visual representations we call "percepts" using Gated-Recurrent-Unit Recurrent Networks (GRUs).Our method relies on percepts that are extracted from all level of a deep ...
Ballas, Nicolas, Courville, Aaron, Pal, Chris, Yao, Li +3 more
core

Grounded Video Caption Generation

We propose a new task, dataset and model for grounded video caption generation. This task unifies captioning and object grounding in video, where the objects in the caption are grounded in the video via temporally consistent bounding boxes. We introduce the following contributions.
Kazakos, Evangelos, Schmid, Cordelia, Sivic, Josef +2 more
openaire +2 more sources

Increased energetic cost of movement reduces reproductive output in zebrafish at different temperatures and water flow rates

Functional Ecology, EarlyView.
Read the free Plain Language Summary for this article on the Journal blog. Abstract Locomotion consumes a large proportion of individual energy budgets and may impose energetic constraints on other fitness‐related traits particularly under variable environmental conditions.
Miki Jahn, Frank Seebacher
wiley +1 more source

Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Accepted in EMNLP ...
Jeon, MinJu +4 more
openaire +2 more sources

From talking tools to metahumans: social interaction, semiotic skill, and the authority of AI chatbots Des outils parlants aux métahumains : interactions sociales, compétences sémiotiques et autorité des robots conversationnels

Journal of the Royal Anthropological Institute, EarlyView.
What does it take to turn a tool into a talking tool and that into an ultimate authority? Generative artificial intelligence (GenAI) in its diverse forms, such as large language models (LLMs), is celebrated as a useful tool. But LLM‐based conversational agents, or chatbots, the software applications through which ordinary users are likely to engage ...
Webb Keane
wiley +1 more source

Retrieval-Augmented Egocentric Video Captioning

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
CVPR 2024. Project page is available at: https://jazzcharles.github.io/Egoinstructor/
Xu, Jilan +6 more
openaire +2 more sources

‘The Good Couscous That Pleases Us!’: The Meanings of Enduring Imperialist Imagery in Postcolonial French Food Advertising, 1970–2000

Gender &History, EarlyView.
ABSTRACT This article examines a wave of Orientalism‐inspired food commercials that appeared on television in France between 1975 and 2000. Older commercials for couscous were more banal, emphasizing a given product's superiority or affordability. Around 1975, however, there was a concerted shift in the advertising; new spots contained exoticized ...
Kelly Ricciardi Colvin
wiley +1 more source

Approach of dense video captioning based on multimodal memory knowledge

Dianxin kexue
Dense video captioning aims to localize events in an untrimmed video and generate a corresponding captions for each meaningful event. Existing methods mainly utilize the source video input to generate captions, and these methods are unable to capture the
FANG Haojie, LI Yonggang, CAO Zongrui, YE Lihua +3 more
doaj

Local feature‐based video captioning with multiple classifier and CARU‐attention

IET Image Processing
Video captioning aims to identify multiple objects and their behaviours in a video event and generate captions for the current scene. This task aims to generate a detailed description of the current video in real‐time using natural language, which ...
Sio‐Kei Im, Ka‐Hou Chan
doaj +1 more source

Generative AI Use by Capital Market Information Intermediaries: Evidence from Seeking Alpha

Journal of Accounting Research, EarlyView.
ABSTRACT We study the use of generative AI for firm‐specific financial analysis on the Seeking Alpha platform. After the initial launch of ChatGPT in November 2022, the share of AI‐generated articles rose sharply to 13.5% of all articles, then declined in late 2023 after Seeking Alpha equated the use of AI to plagiarism and announced a prohibition on ...
Mark T. Bradshaw +3 more
wiley +1 more source

deep learning
computer vision
dense video captioning

lstm
natural language processing
video description

arabic video captioning