Results 21 to 30 of about 573,957 (224)

AVQBits—Adaptive Video Quality Model Based on Bitstream Information for Various Video Applications

open access: yesIEEE Access, 2022
The paper presents $AVQBits$ , a versatile, bitstream-based video quality model. It can be applied in several contexts such as video service monitoring, evaluation of video encoding quality, of gaming video QoE, and even of omnidirectional video quality.
Rakesh Rao Ramachandra Rao   +2 more
doaj   +1 more source

Class-Aware Sounding Objects Localization via Audiovisual Correspondence [PDF]

open access: yesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2021
Audiovisual scenes are pervasive in our daily life. It is commonplace for humans to discriminatively localize different sounding objects but quite challenging for machines to achieve class-aware sounding objects localization without category annotations,
Di Hu   +5 more
semanticscholar   +1 more source

End-to-End Audiovisual Speech Recognition System With Multitask Learning

open access: yesIEEE transactions on multimedia, 2021
An automatic speech recognition (ASR) system is a key component in current speech-based systems. However, the surrounding acoustic noise can severely degrade the performance of an ASR system.
Fei Tao, C. Busso
semanticscholar   +1 more source

Feminist Stereotypes and Women’s Roles in Spanish Radio Ads

open access: yesMedia and Communication, 2021
This article takes a quantitative approach to Spanish radio advertising and the stereotypes and female roles that it broadcasts in a medium that has traditionally had high female audience rates in our country.
Anna Fajula   +4 more
doaj   +1 more source

Automated audiovisual behavior recognition in wild primates

open access: yesScience Advances, 2021
Deep learning using audiovisual data from chimpanzee percussive behaviors enables action recognition in the wild.
Max Bain   +11 more
semanticscholar   +1 more source

Prediction and Controlling of Auditory Perception in Augmented Environments. A Loudness-Based Dynamic Mixing Technique

open access: yesApplied Sciences, 2021
At the core of augmented reality audio (ARA) technology lies the ARA mix, a process responsible for the assignment of a virtual environment to a real one. Legacy ARA mix models have focused on the natural reproduction of the real environment, whereas the
Nikolaos Moustakas   +3 more
doaj   +1 more source

Watching Historical Films Through AI: Reflections on Image Retrieval from Heritage Collections

open access: yesCinergie, 2021
Computer tools allow us to watch films differently and offer film scholars the opportunity to ask diverse research questions and imagine innovative methods.
Beatriz Tadeo Fuica   +2 more
doaj   +1 more source

End-to-End Audiovisual Speech Recognition [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2018
Several end-to-end deep learning approaches have been recently presented which extract either audio or visual features from the input images or audio signals and perform speech recognition.
Stavros Petridis   +5 more
semanticscholar   +1 more source

STAViS: Spatio-Temporal AudioVisual Saliency Network [PDF]

open access: yesComputer Vision and Pattern Recognition, 2020
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-temporal visual and auditory information in order to efficiently address the problem of saliency estimation in videos.
A. Tsiami, Petros Koutras, P. Maragos
semanticscholar   +1 more source

A Comparative Study of Communication Management Strategies on Social Media in the Hotel Industry in Spain in Times of COVID-19

open access: yesAdministrative Sciences, 2023
The aim of the study is to analyze the communication management strategies of the top 40 hotel companies, in terms of turnover, using their corporate accounts on social networks during the Easter holiday campaign in 2021 and 2022.
Antonio Baraybar-Fernández   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy