Perceptual audio coding - Open Access .click

Results 61 to 70 of about 62,294 (212)

Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed

, 2018
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker's visual features such as movement of lips, face, teeth and tongue.
Beerends John G +9 more
core +1 more source

Joint Acoustic and Modulation Frequency

EURASIP Journal on Advances in Signal Processing, 2003
There is a considerable evidence that our perception of sound uses important features which is related to underlying signal modulations. This topic has been studied extensively via perceptual experiments, yet there are few, if any, well-developed signal ...
Les Atlas, Shihab A. Shamma
doaj +1 more source

Collapsed speech segment detection and suppression for WaveNet vocoder

, 2018
In this paper, we propose a technique to alleviate the quality degradation caused by collapsed speech segments sometimes generated by the WaveNet vocoder.
Hayashi, Tomoki +4 more
core +1 more source

Low-Rate High-Quality Parametric Audio Coder based on Sinusoidal plus Noise Representations

Iraqi Journal of Physics, 2002
This paper presents a parametric audio compression scheme intended for scalable audio coding applications, and is particularly well suited for operation at low rates, in the vicinity of 5 to 32 Kbps.
Raed AL-MOUSSAWY
doaj

The Occurrence Rate of the Fission Illusion Differs Depending on the Complexity of Visual Stimuli

i-Perception, 2011
A fission illusion (also named a double—flash illusion) is a famous phenomenon of audio-visual interaction, in which a single brief flash is perceived as two flashes when presented simultaneously with two brief beeps (Shames, Kamitani, & Shimojo, 2000 ...
Yasuhiro Takeshima, Jiro Gyoba
doaj +1 more source

Scalable wavelet packet based perceptual audio coding scheme

, 2005
Conventional perceptual coding algorithms do not normally exploit the temporal masking property of the human auditory system. These algorithms rely only on simultaneous masking models to calculate the masking threshold. This work proposes the use of a temporal masking model, combined with a simultaneous masking model, in wavelet packet-based audio ...
openaire +2 more sources

Perceptual multimedia quality: Implications of an empirical study [PDF]

, 2005
Copyright @ 2005 HCI InternationalIf commercial multimedia development continues to ignore the user-perspective in preference of other factors, i.e. user fascination (i.e.
Ghinea, G, Gulliver, SR
core

InSE-NET: A Perceptually Coded Audio Quality Model based on CNN [PDF]

, 2021
Guanxin Jiang +3 more
openalex +1 more source

Perceptual audio coding schemes based on adaptive signal processing tools [PDF]

, 2016
Fernando A. Marengo Rodríguez, Sergio A. Castells, Gonzalo D. Sad +2 more
openalex +1 more source

Audio Source Separation Using Perceptual Principles for Content-Based Coding and Information Management

, 2004
K. Melih
openalex +3 more sources

computer science
speech recognition
mathematics

coding social sciences
statistics
speech coding

perception
neuroscience
sub-band coding