Mushra - Open Access .click

Results 31 to 40 of about 1,179 (120)

, 2017
Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet generative speech model can be used to generate high quality speech from the bit ...
Kleijn, W. Bastiaan +6 more
core +1 more source

Sonic Watermarking

EURASIP Journal on Advances in Signal Processing, 2004
Audio watermarking has been used mainly for digital sound. In this paper, we extend the range of its applications to live performances with a new composition method for real-time audio watermarking.
Ryuki Tachibana
doaj +1 more source

Efficient Phantom Source Widening and Diffuseness in Ambisonics [PDF]

, 2014
Object-based spatial audio considers virtual sound sources having a width/diffuseness parameter. This parameter aims at controlling the perceived width or diffuseness of the auditory object, or phantom source, created by the renderer.
Choi, Jung-Woo +3 more
core +1 more source

Ultrasonic Sensor-Based Personalized Multichannel Audio Rendering for Multiview Broadcasting Services

International Journal of Distributed Sensor Networks, 2013
An ultrasonic sensor-based personalized multichannel audio rendering method is proposed for multiview broadcasting services. Multiview broadcasting, a representative next-generation broadcasting technique, renders video image sequences captured by ...
Yong Guk Kim +3 more
doaj +1 more source

A Novel Syllable-Level Signal Encryption for Robust Secure Speech Communication System

IEEE Access
Speech communication is vital for conveying information and emotions, yet it faces significant security threats. This research presents a novel signal encryption system that operates at the syllable level, preserving the natural flow of speech while ...
Albertus Anugerah Pekerti +3 more
doaj +1 more source

Considering Bluetooth's Subband Codec (SBC) for Wideband Speech and Audio on the Internet [PDF]

, 2012
The Bluetooth Special Interest Group (SIG) has standardized the subband coding (SBC) audio codec to connect headphones via wireless Bluetooth links. SBC compresses audio at high fidelity while having an ultra-low algorithm delay. To make SBC suitable for
Hoene, Christian, Hyder, Mansoor
core

On Building Immersive Audio Applications Using Robust Adaptive Beamforming and Joint Audio-Video Source Localization

EURASIP Journal on Advances in Signal Processing, 2006
This paper deals with some of the different problems, strategies, and solutions of building true immersive audio systems oriented to future communication applications.
Beracoechea JA +3 more
doaj +1 more source

Non-intrusive method for audio quality assessment of lossy-compressed music recordings using convolutional neural networks [PDF]

International Journal of Electronics and Telecommunications
Most of the existing algorithms for the objective audio quality assessment are intrusive, as they require access both to an unimpaired reference recording and an evaluated signal. This feature excludes them from many practical applications. In this paper,
Aleksandra Kasperuk, Sławomir Krzysztof Zieliński +1 more
doaj +1 more source

A Bitrate-Scalable Variational Recurrent Mel-Spectrogram Coder for Real-Time Resynthesis-Based Speech Coding

IEEE Access
This paper introduces a method for real-time speech coding that combines a binary-latent-vector variational recurrent neural network for mel-spectrogram coding with a non-autoregressive convolutional vocoder for waveform reconstruction. To enable bitrate
Benjamin Stahl, Simon Windtner, Alois Sontacchi +2 more
doaj +1 more source

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

EURASIP Journal on Audio, Speech, and Music Processing, 2019
Text-to-speech (TTS) synthesis systems have been widely used in general-purpose applications based on the generation of speech. Nonetheless, there are some domains, such as storytelling or voice output aid devices, which may also require singing.
Marc Freixes, Francesc Alías, Joan Claudi Socoró +2 more
doaj +1 more source

audio and speech processing eess.as
listening tests
psychoacoustics