Results 61 to 70 of about 5,597 (161)
Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Accepted in EMNLP ...
Jeon, MinJu +4 more
openaire +2 more sources
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
In order to convey the most content in their limited space, advertisements embed references to outside knowledge via symbolism. For example, a motorcycle stands for adventure (a positive property the ad wants associated with the product being sold), and ...
A Kembhavi +10 more
core +1 more source
D2 receptor ablation in indirect‐pathway striatal neurons reduces or abolishes dyskinetic and dystonic behaviors induced by L‐DOPA or D2 receptor agonists, respectively. Contralateral turning is reduced, while forward locomotion is increased. These effects are associated with modulation of neuronal activity in dorsal striatum and external globus ...
Laura Andreoli +5 more
wiley +1 more source
Audio Caption: Listen and Tell
Increasing amount of research has shed light on machine perception of audio events, most of which concerns detection and classification tasks. However, human-like perception of audio scenes involves not only detecting and classifying audio sounds, but ...
Dinkel, Heinrich, Wu, Mengyue, Yu, Kai
core +1 more source
Low Temperature Site‐Specific Pulsed Laser Annealing of MoS2
The application of laser pulses, of extremely short duration, is investigated as a potential new method to modify the atomic structure of ultrathin 2D materials for use in the creation of future electrical devices. The process is efficient, offering a site‐specific functionality, where regions of an electronic device that only requires annealing is ...
Nazar Farid +13 more
wiley +1 more source
A gradient Ti‐doped WO3 electrochromic film is developed via dynamic co‐sputtering, enabling depth‐dependent band engineering and strain regulation. The gradient structure induces a built‐in electric field across the film thickness that enhances coupled ion–electron transport while improving mechanical robustness.
Fang Luo +4 more
wiley +1 more source
Abstract This study examined the effects of repeated viewing and reading fluency on incidental second language vocabulary acquisition through captioned video exposure. A total of 149 Japanese EFL learners watched a short animation with or without captions, varying in the number of repetitions (once, twice, or three times).
Satsuki Kurokawa, Takumi Uchihara
wiley +1 more source
Visual Entailment Task for Visually-Grounded Language Learning [PDF]
We introduce a new inference task - Visual Entailment (VE) - which differs from traditional Textual Entailment (TE) tasks whereby a premise is defined by an image, rather than a natural language sentence as in TE tasks.
Doran, Derek +3 more
core +2 more sources
Read the free Plain Language Summary for this article on the Journal blog. Abstract Locomotion consumes a large proportion of individual energy budgets and may impose energetic constraints on other fitness‐related traits particularly under variable environmental conditions.
Miki Jahn, Frank Seebacher
wiley +1 more source
MultiCOIN: Multi‐Modal COntrollable INbetweening
Abstract Video inbetweening creates smooth transitions between two frames making it an indispensable tool for video editing and longform video synthesis. Existing methods struggle with large or complex motion and offer limited control over intermediate frames, often misaligning with user intent.
M. Tanveer +6 more
wiley +1 more source

