This is a preprint of a paper submitted to and accepted for INTERSPEECH ...
Lechler, Laura +2 more
exaly +4 more sources
Perceptual Similarities between Artificial Reverberation Algorithms and Real Reverberation
This paper presents a study evaluating the perceptual similarity between artificial reverberation algorithms and acoustic measurements. An online headphone-based listening test was conducted and data were collected from 20 expert assessors.
Huan Mi, Gavin Cyril Kearney
exaly +3 more sources
A Perceptual Evaluation of Music Real-Time Communication Applications
Music Real-time Communication applications (M-RTC) enable music making (musiking) for musicians simultaneously across geographic distance. When used for musiking, M-RTC such as Zoom and JackTrip, require satisfactorily received acoustical perception of ...
Dana Kemack Goot, Timothy Hsu
exaly +3 more sources
Perceptual Evaluation of Binaural MVDR-Based Algorithms to Preserve the Interaural Coherence of Diffuse Noise Fields [PDF]
Besides improving speech intelligibility in background noise, another important objective of noise reduction algorithms for binaural hearing devices is preserving the spatial impression for the listener.
Nico Gößling +2 more
doaj +2 more sources
Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation
Accepted in ...
Varadhan, Praveen Srinivasa +10 more
openaire +3 more sources
Comparison of spatial sound recording techniques with usage of ambisonics and object-based audio [PDF]
In this article spatial audio recording techniques are compared: scene-based audio and object-based audio. The study involved mixing recordings from a higher-order ambisonic microphone and support microphones, ambisonically encoded on a virtual sphere ...
Bartłomiej Mróz, Patryk Kosior
doaj +2 more sources
A Novel Syllable-Level Signal Encryption for Robust Secure Speech Communication System
Speech communication is vital for conveying information and emotions, yet it faces significant security threats. This research presents a novel signal encryption system that operates at the syllable level, preserving the natural flow of speech while ...
Albertus Anugerah Pekerti +3 more
doaj +2 more sources
Enhancement by postfiltering for speech and audio coding in ad hoc sensor networks [PDF]
Enhancement algorithms for wireless acoustic sensor networks (WASNs) are indispensable with the increasing availability and usage of connected devices with microphones.
Sneha Das, Tom Bäckström
doaj +1 more source
High Fidelity Neural Audio Compression [PDF]
We introduce a state-of-the-art real-time, high-fidelity, audio codec leveraging neural networks. It consists in a streaming encoder-decoder architecture with quantized latent space trained in an end-to-end fashion.
Alexandre D'efossez +3 more
semanticscholar +1 more source
Speech quality assessment with WARP‐Q: From similarity to subsequence dynamic time warp cost
Abstract Speech coding has been shown to achieve good speech quality using either waveform matching or parametric reconstruction. For very low bit rate streams, recently developed generative speech models can reconstruct high‐quality wideband speech from the bit streams of standard parametric encoders at less than 3 kb/s. Generative codecs produce high‐
Wissam A. Jassim +3 more
wiley +1 more source

