Results 111 to 120 of about 20,321 (258)
Delving Deeper into Convolutional Networks for Learning Video Representations
We propose an approach to learn spatio-temporal features in videos from intermediate visual representations we call "percepts" using Gated-Recurrent-Unit Recurrent Networks (GRUs).Our method relies on percepts that are extracted from all level of a deep ...
Ballas, Nicolas +3 more
core
Grounded Video Caption Generation
We propose a new task, dataset and model for grounded video caption generation. This task unifies captioning and object grounding in video, where the objects in the caption are grounded in the video via temporally consistent bounding boxes. We introduce the following contributions.
Kazakos, Evangelos +2 more
openaire +2 more sources
Read the free Plain Language Summary for this article on the Journal blog. Abstract Locomotion consumes a large proportion of individual energy budgets and may impose energetic constraints on other fitness‐related traits particularly under variable environmental conditions.
Miki Jahn, Frank Seebacher
wiley +1 more source
Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Accepted in EMNLP ...
Jeon, MinJu +4 more
openaire +2 more sources
What does it take to turn a tool into a talking tool and that into an ultimate authority? Generative artificial intelligence (GenAI) in its diverse forms, such as large language models (LLMs), is celebrated as a useful tool. But LLM‐based conversational agents, or chatbots, the software applications through which ordinary users are likely to engage ...
Webb Keane
wiley +1 more source
Retrieval-Augmented Egocentric Video Captioning
CVPR 2024. Project page is available at: https://jazzcharles.github.io/Egoinstructor/
Xu, Jilan +6 more
openaire +2 more sources
ABSTRACT This article examines a wave of Orientalism‐inspired food commercials that appeared on television in France between 1975 and 2000. Older commercials for couscous were more banal, emphasizing a given product's superiority or affordability. Around 1975, however, there was a concerted shift in the advertising; new spots contained exoticized ...
Kelly Ricciardi Colvin
wiley +1 more source
Approach of dense video captioning based on multimodal memory knowledge
Dense video captioning aims to localize events in an untrimmed video and generate a corresponding captions for each meaningful event. Existing methods mainly utilize the source video input to generate captions, and these methods are unable to capture the
FANG Haojie +3 more
doaj
Local feature‐based video captioning with multiple classifier and CARU‐attention
Video captioning aims to identify multiple objects and their behaviours in a video event and generate captions for the current scene. This task aims to generate a detailed description of the current video in real‐time using natural language, which ...
Sio‐Kei Im, Ka‐Hou Chan
doaj +1 more source
Generative AI Use by Capital Market Information Intermediaries: Evidence from Seeking Alpha
ABSTRACT We study the use of generative AI for firm‐specific financial analysis on the Seeking Alpha platform. After the initial launch of ChatGPT in November 2022, the share of AI‐generated articles rose sharply to 13.5% of all articles, then declined in late 2023 after Seeking Alpha equated the use of AI to plagiarism and announced a prohibition on ...
Mark T. Bradshaw +3 more
wiley +1 more source

