Results 11 to 20 of about 573,957 (224)
Deep Learning for Depression Recognition with Audiovisual Cues: A Review [PDF]
With the acceleration of the pace of work and life, people have to face more and more pressure, which increases the possibility of suffering from depression.
Lang He +11 more
semanticscholar +1 more source
Audiovisual Masked Autoencoders [PDF]
Can we leverage the audiovisual information already present in video to improve self-supervised representation learning? To answer this question, we study various pre-training architectures and objectives within the masked autoencoding framework ...
Mariana-Iuliana Georgescu +5 more
semanticscholar +1 more source
In the intricate and multifaceted landscape of the European construction process, where the development and governance of the European Union take shape through a myriad of policies, institutions, and stakeholders, this study delves into the role of ...
Aritz Gorostiza-Cerviño +3 more
doaj +1 more source
Modeling of an Automatic Vision Mixer With Human Characteristics for Multi-Camera Theater Recordings
A production process using high-resolution cameras can be used for multi-camera recordings of theater performances or other stage performances. One approach to automate the generation of suitable image cuts could be to focus on speaker changes so that ...
Eckhard Stoll +3 more
doaj +1 more source
Audiovisual integration in the human brain: a coordinate-based meta-analysis
People can seamlessly integrate a vast array of information from what we see and hear in the noisy and uncertain world. However, the neural underpinnings of audiovisual integration continue to be the topic of debate.
Chuanji Gao +5 more
semanticscholar +1 more source
Analysis of Appeal for Realistic AI-Generated Photos
AI-generated images have gained in popularity in recent years due to improvements and developments in the field of artificial intelligence. This has led to several new AI generators, which may produce realistic, funny, and impressive images using a ...
Steve Goring +3 more
doaj +1 more source
Emotion recognition using audiovisual features is a challenging task for human-machine interaction systems. Under ideal conditions (perfect illumination, clean speech signals, and non-occluded visual data) many systems are able to achieve reliable ...
Lucas Goncalves, C. Busso
semanticscholar +1 more source
Metaverse and Extended Realities in Immersive Journalism: A Systematic Literature Review
Immersive journalism is a new form of media communication that uses extended reality systems to produce its content. Despite the possibilities it offers, its use is still limited in the media due to the lack of systematised and scientific knowledge ...
Alberto Sanchez-Acedo +3 more
doaj +1 more source
Modular Framework and Instances of Pixel-Based Video Quality Models for UHD-1/4K
The popularity of video on-demand streaming services increased tremendously over the last years. Most services use http-based adaptive video streaming methods.
Steve Goring +3 more
doaj +1 more source
The audiovisual translation (AVT) sector has undergone rapid changes in recent years. It would be uncontroversial to state that the various stakeholders: academics; freelancers; technology providers, and language service providers (LSP) are likely to ...
Kristijan Nikolic, L. Bywood
semanticscholar +1 more source

