Results 61 to 70 of about 62,294 (212)

Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed

open access: yes, 2018
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker's visual features such as movement of lips, face, teeth and tongue.
Beerends John G   +9 more
core   +1 more source

Joint Acoustic and Modulation Frequency

open access: yesEURASIP Journal on Advances in Signal Processing, 2003
There is a considerable evidence that our perception of sound uses important features which is related to underlying signal modulations. This topic has been studied extensively via perceptual experiments, yet there are few, if any, well-developed signal ...
Les Atlas, Shihab A. Shamma
doaj   +1 more source

Collapsed speech segment detection and suppression for WaveNet vocoder

open access: yes, 2018
In this paper, we propose a technique to alleviate the quality degradation caused by collapsed speech segments sometimes generated by the WaveNet vocoder.
Hayashi, Tomoki   +4 more
core   +1 more source

Low-Rate High-Quality Parametric Audio Coder based on Sinusoidal plus Noise Representations

open access: yesIraqi Journal of Physics, 2002
This paper presents a parametric audio compression scheme intended for scalable audio coding applications, and is particularly well suited for operation at low rates, in the vicinity of 5 to 32 Kbps.
Raed AL-MOUSSAWY
doaj  

The Occurrence Rate of the Fission Illusion Differs Depending on the Complexity of Visual Stimuli

open access: yesi-Perception, 2011
A fission illusion (also named a double—flash illusion) is a famous phenomenon of audio-visual interaction, in which a single brief flash is perceived as two flashes when presented simultaneously with two brief beeps (Shames, Kamitani, & Shimojo, 2000 ...
Yasuhiro Takeshima, Jiro Gyoba
doaj   +1 more source

Scalable wavelet packet based perceptual audio coding scheme

open access: yes, 2005
Conventional perceptual coding algorithms do not normally exploit the temporal masking property of the human auditory system. These algorithms rely only on simultaneous masking models to calculate the masking threshold. This work proposes the use of a temporal masking model, combined with a simultaneous masking model, in wavelet packet-based audio ...
openaire   +2 more sources

Perceptual multimedia quality: Implications of an empirical study [PDF]

open access: yes, 2005
Copyright @ 2005 HCI InternationalIf commercial multimedia development continues to ignore the user-perspective in preference of other factors, i.e. user fascination (i.e.
Ghinea, G, Gulliver, SR
core  

InSE-NET: A Perceptually Coded Audio Quality Model based on CNN [PDF]

open access: green, 2021
Guanxin Jiang   +3 more
openalex   +1 more source

Perceptual audio coding schemes based on adaptive signal processing tools [PDF]

open access: bronze, 2016
Fernando A. Marengo Rodríguez   +2 more
openalex   +1 more source

Home - About - Disclaimer - Privacy