Results 21 to 30 of about 573,957 (224)
AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications
The paper presents $AVQBits$ , a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality.
Rakesh Rao Ramachandra Rao +2 more
doaj +1 more source
Class-Aware Sounding Objects Localization via Audiovisual Correspondence [PDF]
Audiovisual scenes are pervasive in our daily life. It is commonplace for humans to discriminatively localize different sounding objects but quite challenging for machines to achieve class-aware sounding objects localization without category annotations,
Di Hu +5 more
semanticscholar +1 more source
End-to-End Audiovisual Speech Recognition System With Multitask Learning
An automatic speech recognition (ASR) system is a key component in current speech-based systems. However, the surrounding acoustic noise can severely degrade the performance of an ASR system.
Fei Tao, C. Busso
semanticscholar +1 more source
Feminist Stereotypes and Women’s Roles in Spanish Radio Ads
This article takes a quantitative approach to Spanish radio advertising and the stereotypes and female roles that it broadcasts in a medium that has traditionally had high female audience rates in our country.
Anna Fajula +4 more
doaj +1 more source
Automated audiovisual behavior recognition in wild primates
Deep learning using audiovisual data from chimpanzee percussive behaviors enables action recognition in the wild.
Max Bain +11 more
semanticscholar +1 more source
At the core of augmented reality audio (ARA) technology lies the ARA mix, a process responsible for the assignment of a virtual environment to a real one. Legacy ARA mix models have focused on the natural reproduction of the real environment, whereas the
Nikolaos Moustakas +3 more
doaj +1 more source
Watching Historical Films Through AI: Reflections on Image Retrieval from Heritage Collections
Computer tools allow us to watch films differently and offer film scholars the opportunity to ask diverse research questions and imagine innovative methods.
Beatriz Tadeo Fuica +2 more
doaj +1 more source
End-to-End Audiovisual Speech Recognition [PDF]
Several end-to-end deep learning approaches have been recently presented which extract either audio or visual features from the input images or audio signals and perform speech recognition.
Stavros Petridis +5 more
semanticscholar +1 more source
STAViS: Spatio-Temporal AudioVisual Saliency Network [PDF]
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-temporal visual and auditory information in order to efficiently address the problem of saliency estimation in videos.
A. Tsiami, Petros Koutras, P. Maragos
semanticscholar +1 more source
The aim of the study is to analyze the communication management strategies of the top 40 hotel companies, in terms of turnover, using their corporate accounts on social networks during the Easter holiday campaign in 2021 and 2022.
Antonio Baraybar-Fernández +2 more
doaj +1 more source

