Dense video captioning - Open Access .click

Results 61 to 70 of about 5,597 (161)

Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Accepted in EMNLP ...
Jeon, MinJu +4 more
openaire +2 more sources

ADVISE: Symbolism and External Knowledge for Decoding Advertisements

, 2018
In order to convey the most content in their limited space, advertisements embed references to outside knowledge via symbolism. For example, a motorcycle stands for adventure (a positive property the ad wants associated with the product being sold), and ...
A Kembhavi +10 more
core +1 more source

Indirect Striatal Projection Neurons Drive a D2 Receptor‐Dependent Pathway to Dyskinesia and Dystonia

Movement Disorders, EarlyView.
D2 receptor ablation in indirect‐pathway striatal neurons reduces or abolishes dyskinetic and dystonic behaviors induced by L‐DOPA or D2 receptor agonists, respectively. Contralateral turning is reduced, while forward locomotion is increased. These effects are associated with modulation of neuronal activity in dorsal striatum and external globus ...
Laura Andreoli +5 more
wiley +1 more source

Audio Caption: Listen and Tell

, 2019
Increasing amount of research has shed light on machine perception of audio events, most of which concerns detection and classification tasks. However, human-like perception of audio scenes involves not only detecting and classifying audio sounds, but ...
Dinkel, Heinrich, Wu, Mengyue, Yu, Kai
core +1 more source

Low Temperature Site‐Specific Pulsed Laser Annealing of MoS2

Small, EarlyView.
The application of laser pulses, of extremely short duration, is investigated as a potential new method to modify the atomic structure of ultrathin 2D materials for use in the creation of future electrical devices. The process is efficient, offering a site‐specific functionality, where regions of an electronic device that only requires annealing is ...
Nazar Farid +13 more
wiley +1 more source

Compositional Gradient–Engineered Ti–WO3 Films for Simultaneous Enhancement of Coloration Efficiency and Mechanical Robustness

Small, EarlyView.
A gradient Ti‐doped WO3 electrochromic film is developed via dynamic co‐sputtering, enabling depth‐dependent band engineering and strain regulation. The gradient structure induces a built‐in electric field across the film thickness that enhances coupled ion–electron transport while improving mechanical robustness.
Fang Luo +4 more
wiley +1 more source

Optimizing the Effectiveness of Captioned Viewing for Incidental Second Language Vocabulary Learning: The Effects of Repeated Viewing and Reading Fluency

TESOL Quarterly, EarlyView.
Abstract This study examined the effects of repeated viewing and reading fluency on incidental second language vocabulary acquisition through captioned video exposure. A total of 149 Japanese EFL learners watched a short animation with or without captions, varying in the number of repetitions (once, twice, or three times).
Satsuki Kurokawa, Takumi Uchihara
wiley +1 more source

Visual Entailment Task for Visually-Grounded Language Learning [PDF]

, 2019
We introduce a new inference task - Visual Entailment (VE) - which differs from traditional Textual Entailment (TE) tasks whereby a premise is defined by an image, rather than a natural language sentence as in TE tasks.
Doran, Derek, Kadav, Asim, Lai, Farley, Xie, Ning +3 more
core +2 more sources

Increased energetic cost of movement reduces reproductive output in zebrafish at different temperatures and water flow rates

Functional Ecology, EarlyView.
Read the free Plain Language Summary for this article on the Journal blog. Abstract Locomotion consumes a large proportion of individual energy budgets and may impose energetic constraints on other fitness‐related traits particularly under variable environmental conditions.
Miki Jahn, Frank Seebacher
wiley +1 more source

MultiCOIN: Multi‐Modal COntrollable INbetweening

Computer Graphics Forum, EarlyView.
Abstract Video inbetweening creates smooth transitions between two frames making it an indispensable tool for video editing and longform video synthesis. Existing methods struggle with large or complex motion and offer limited control over intermediate frames, often misaligning with user intent.
M. Tanveer +6 more
wiley +1 more source

video captioning
feature extraction