Results 31 to 40 of about 1,179 (120)

Wavenet based low rate speech coding

open access: yes, 2017
Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet generative speech model can be used to generate high quality speech from the bit ...
Kleijn, W. Bastiaan   +6 more
core   +1 more source

Sonic Watermarking

open access: yesEURASIP Journal on Advances in Signal Processing, 2004
Audio watermarking has been used mainly for digital sound. In this paper, we extend the range of its applications to live performances with a new composition method for real-time audio watermarking.
Ryuki Tachibana
doaj   +1 more source

Efficient Phantom Source Widening and Diffuseness in Ambisonics [PDF]

open access: yes, 2014
Object-based spatial audio considers virtual sound sources having a width/diffuseness parameter. This parameter aims at controlling the perceived width or diffuseness of the auditory object, or phantom source, created by the renderer.
Choi, Jung-Woo   +3 more
core   +1 more source

Ultrasonic Sensor-Based Personalized Multichannel Audio Rendering for Multiview Broadcasting Services

open access: yesInternational Journal of Distributed Sensor Networks, 2013
An ultrasonic sensor-based personalized multichannel audio rendering method is proposed for multiview broadcasting services. Multiview broadcasting, a representative next-generation broadcasting technique, renders video image sequences captured by ...
Yong Guk Kim   +3 more
doaj   +1 more source

A Novel Syllable-Level Signal Encryption for Robust Secure Speech Communication System

open access: yesIEEE Access
Speech communication is vital for conveying information and emotions, yet it faces significant security threats. This research presents a novel signal encryption system that operates at the syllable level, preserving the natural flow of speech while ...
Albertus Anugerah Pekerti   +3 more
doaj   +1 more source

Considering Bluetooth's Subband Codec (SBC) for Wideband Speech and Audio on the Internet [PDF]

open access: yes, 2012
The Bluetooth Special Interest Group (SIG) has standardized the subband coding (SBC) audio codec to connect headphones via wireless Bluetooth links. SBC compresses audio at high fidelity while having an ultra-low algorithm delay. To make SBC suitable for
Hoene, Christian, Hyder, Mansoor
core  

On Building Immersive Audio Applications Using Robust Adaptive Beamforming and Joint Audio-Video Source Localization

open access: yesEURASIP Journal on Advances in Signal Processing, 2006
This paper deals with some of the different problems, strategies, and solutions of building true immersive audio systems oriented to future communication applications.
Beracoechea JA   +3 more
doaj   +1 more source

Non-intrusive method for audio quality assessment of lossy-compressed music recordings using convolutional neural networks [PDF]

open access: yesInternational Journal of Electronics and Telecommunications
Most of the existing algorithms for the objective audio quality assessment are intrusive, as they require access both to an unimpaired reference recording and an evaluated signal. This feature excludes them from many practical applications. In this paper,
Aleksandra Kasperuk   +1 more
doaj   +1 more source

A Bitrate-Scalable Variational Recurrent Mel-Spectrogram Coder for Real-Time Resynthesis-Based Speech Coding

open access: yesIEEE Access
This paper introduces a method for real-time speech coding that combines a binary-latent-vector variational recurrent neural network for mel-spectrogram coding with a non-autoregressive convolutional vocoder for waveform reconstruction. To enable bitrate
Benjamin Stahl   +2 more
doaj   +1 more source

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2019
Text-to-speech (TTS) synthesis systems have been widely used in general-purpose applications based on the generation of speech. Nonetheless, there are some domains, such as storytelling or voice output aid devices, which may also require singing.
Marc Freixes   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy