Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Automatic visual speech recognition is an interesting problem in pattern recognition especially when audio data is noisy or not readily available.
Ekenel, Hazım Kemal+3 more
core +1 more source
A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition [PDF]
The ability to accurately recognize, localize and separate sound sources is fundamental to any audio-visual perception task. Historically, these abilities were tackled separately, with several methods developed independently for each task. However, given the interconnected nature of source localization, separation, and recognition, independent models ...
arxiv
On the role of spatial phase and phase correlation in vision, illusion and cognition
Numerous findings indicate that spatial phase bears an important cognitive information. Distortion of phase affects topology of edge structures and makes images unrecognizable.
Evgeny eGladilin+2 more
doaj +1 more source
An Orientation Selective Neural Network and its Application to Cosmic Muon Identification
We propose a novel method for identification of a linear pattern of pixels on a two-dimensional grid. Following principles employed by the visual cortex, we employ orientation selective neurons in a neural network which performs this task.
Andresen+12 more
core +1 more source
On spatial selectivity and prediction across conditions with fMRI [PDF]
Researchers in functional neuroimaging mostly use activation coordinates to formulate their hypotheses. Instead, we propose to use the full statistical images to define regions of interest (ROIs).
Schwartz, Yannick+2 more
core +4 more sources
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition [PDF]
This paper focuses on designing a noise-robust end-to-end Audio-Visual Speech Recognition (AVSR) system. To this end, we propose Visual Context-driven Audio Feature Enhancement module (V-CAFE) to enhance the input noisy audio speech with a help of audio-visual correspondence.
arxiv
Event-driven continuous STDP learning with deep structure for visual pattern recognition [PDF]
Human beings can achieve reliable and fast visual pattern recognition with limited time and learning samples. Underlying this capability, ventral stream plays an important role in object representation and form recognition. Modeling the ventral steam may
Liu, Daqi, Yue, Shigang
core +1 more source
Identification of novel small molecule inhibitors of ETS transcription factors
ETS transcription factors play an essential role in tumourigenesis and are indispensable for sprouting angiogenesis, a hallmark of cancer, which fuels tumour expansion and dissemination. Thus, targeting ETS transcription factor function could represent an effective, multifaceted strategy to block tumour growth. The evolutionarily conserved E‐Twenty‐Six
Shaima Abdalla+9 more
wiley +1 more source
Can audio-visual integration strengthen robustness under multimodal attacks? [PDF]
In this paper, we propose to make a systematic study on machines multisensory perception under attacks. We use the audio-visual event recognition task against multimodal adversarial attacks as a proxy to investigate the robustness of audio-visual learning.
arxiv
In lymphoid organs, antigen recognition and B cell receptor signaling rely on integrins and the cytoskeleton. Integrins act as mechanoreceptors, couple B cell receptor activation to cytoskeletal remodeling, and support immune synapse formation as well as antigen extraction.
Abhishek Pethe, Tanja Nicole Hartmann
wiley +1 more source