End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models
Speech activity detection (SAD) plays an important role in current speech processing systems, including automatic speech recognition (ASR). SAD is particularly difficult in environments with acoustic noise.
Busso, Carlos, Tao, Fei
core
Artimate: an articulatory animation framework for audiovisual speech synthesis [PDF]
We present a modular framework for articulatory animation synthesis using speech motion capture data obtained with electromagnetic articulography (EMA).
Ouni, Slim, Steiner, Ingmar
core +4 more sources
Indo-U.S. FTA: Prospects for Audiovisual Services [PDF]
Many WTO (World Trade Organization) member countries, including India, are defensive about opening up of the audiovisual sector in the Doha Round due to reasons of cultural sensitivity.
Arpita Mukherjee +2 more
core
What role does temporal synchrony play in mid-level audiovisual crossmodal correspondences? [PDF]
Spence C, Di Stefano N.
europepmc +1 more source
Inverted encoding of neural responses to audiovisual stimuli reveals super-additive multisensory enhancement. [PDF]
Buhmann Z +3 more
europepmc +1 more source
Echoes of the mind's eye: Reciprocal crossmodal interaction between auditory and visual processing. [PDF]
Tang X, Zhang T, Sun J, Lu S.
europepmc +1 more source
Sensorimotor Frequency Tagging Is Enhanced by Auditory and Audiovisual but Not Visual, Inputs During a Body-Walking Task. [PDF]
Matamala-Gomez M +4 more
europepmc +1 more source
Generative AI-driven synthetic media risks in digital health: implications for telemedicine and teledentistry. [PDF]
Jędrasiak K, Bijoch J.
europepmc +1 more source
A deep neural network model of audiovisual speech recognition reports the McGurk effect. [PDF]
Ma H +4 more
europepmc +1 more source

