Dnsmos: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors [PDF]
Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores.
Chandan K. A. Reddy +2 more
semanticscholar +1 more source
Multisensory Integration Sites Identified by Perception of Spatial Wavelet Filtered Visual Speech Gesture Information [PDF]
Perception of speech is improved when presentation of the audio signal is accompanied by concordant visual speech gesture information. This enhancement is most prevalent when the audio signal is degraded.
Callan, Akiko M. +5 more
core +2 more sources
Dataset of directional room impulse responses for realistic speech data
Obtaining real-world multi-channel speech recordings is expensive and time-consuming. Therefore, multi-channel recordings are often artificially generated by convolving existing monaural speech recordings with simulated Room Impulse Responses (RIRs) from
Stefan Fragner +3 more
doaj +1 more source
Estimation of Severity of Speech Disability through Speech Envelope
In this paper, envelope detection of speech is discussed to distinguish the pathological cases of speech disabled children. The speech signal samples of children of age between five to eight years are considered for the present study.
Gudi, Anandthirtha B. +2 more
core +1 more source
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation [PDF]
Single-channel, speaker-independent speech separation methods have recently seen great progress. However, the accuracy, latency, and computational cost of such methods remain insufficient.
Yi Luo, N. Mesgarani
semanticscholar +1 more source
Gaussian Process Modeling of Specular Multipath Components
The consideration of ultra-wideband (UWB) and mm-wave signals allows for a channel description decomposed into specular multipath components (SMCs) and dense/diffuse multipath. In this paper, the amplitude and phase of SMCs are studied.
Anh Hong Nguyen +4 more
doaj +1 more source
Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language [PDF]
Vocoder simulation studies have suggested that the carrier signal type employed affects the intelligibility of vocoded speech. The present work further assessed how carrier signal type interacts with additional signal processing, namely, single-channel ...
Chen F. +6 more
core +2 more sources
Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech [PDF]
Speech emotion recognition is an important and challenging task in the realm of human-computer interaction. Prior work proposed a variety of models and feature sets for training a system.
Michael Neumann, Ngoc Thang Vu
semanticscholar +1 more source
Speech enhancement methods based on binaural cue coding
According to the encoding and decoding mechanism of binaural cue coding (BCC), in this paper, the speech and noise are considered as left channel signal and right channel signal of the BCC framework, respectively.
Xianyun Wang, Changchun Bao
doaj +1 more source
In this paper, a novel array structure exploiting coprime arrays is proposed, which comprises the translocation of one subarray and axis rotation with a compression of another subarray to produce a larger number of consecutive lags.
Tarek Hasan Al Mahmud +3 more
doaj +1 more source

