Results 61 to 70 of about 62,294 (212)
Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker's visual features such as movement of lips, face, teeth and tongue.
Beerends John G +9 more
core +1 more source
Joint Acoustic and Modulation Frequency
There is a considerable evidence that our perception of sound uses important features which is related to underlying signal modulations. This topic has been studied extensively via perceptual experiments, yet there are few, if any, well-developed signal ...
Les Atlas, Shihab A. Shamma
doaj +1 more source
Collapsed speech segment detection and suppression for WaveNet vocoder
In this paper, we propose a technique to alleviate the quality degradation caused by collapsed speech segments sometimes generated by the WaveNet vocoder.
Hayashi, Tomoki +4 more
core +1 more source
Low-Rate High-Quality Parametric Audio Coder based on Sinusoidal plus Noise Representations
This paper presents a parametric audio compression scheme intended for scalable audio coding applications, and is particularly well suited for operation at low rates, in the vicinity of 5 to 32 Kbps.
Raed AL-MOUSSAWY
doaj
The Occurrence Rate of the Fission Illusion Differs Depending on the Complexity of Visual Stimuli
A fission illusion (also named a double—flash illusion) is a famous phenomenon of audio-visual interaction, in which a single brief flash is perceived as two flashes when presented simultaneously with two brief beeps (Shames, Kamitani, & Shimojo, 2000 ...
Yasuhiro Takeshima, Jiro Gyoba
doaj +1 more source
Scalable wavelet packet based perceptual audio coding scheme
Conventional perceptual coding algorithms do not normally exploit the temporal masking property of the human auditory system. These algorithms rely only on simultaneous masking models to calculate the masking threshold. This work proposes the use of a temporal masking model, combined with a simultaneous masking model, in wavelet packet-based audio ...
openaire +2 more sources
Perceptual multimedia quality: Implications of an empirical study [PDF]
Copyright @ 2005 HCI InternationalIf commercial multimedia development continues to ignore the user-perspective in preference of other factors, i.e. user fascination (i.e.
Ghinea, G, Gulliver, SR
core
InSE-NET: A Perceptually Coded Audio Quality Model based on CNN [PDF]
Guanxin Jiang +3 more
openalex +1 more source
Perceptual audio coding schemes based on adaptive signal processing tools [PDF]
Fernando A. Marengo Rodríguez +2 more
openalex +1 more source

