Results 41 to 50 of about 6,722,896 (329)

Dnsmos: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2020
Human subjective evaluation is the "gold standard" to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores.
Chandan K. A. Reddy   +2 more
semanticscholar   +1 more source

Multisensory Integration Sites Identified by Perception of Spatial Wavelet Filtered Visual Speech Gesture Information [PDF]

open access: yes, 2004
Perception of speech is improved when presentation of the audio signal is accompanied by concordant visual speech gesture information. This enhancement is most prevalent when the audio signal is degraded.
Callan, Akiko M.   +5 more
core   +2 more sources

Dataset of directional room impulse responses for realistic speech data

open access: yesData in Brief
Obtaining real-world multi-channel speech recordings is expensive and time-consuming. Therefore, multi-channel recordings are often artificially generated by convolving existing monaural speech recordings with simulated Room Impulse Responses (RIRs) from
Stefan Fragner   +3 more
doaj   +1 more source

Estimation of Severity of Speech Disability through Speech Envelope

open access: yes, 2011
In this paper, envelope detection of speech is discussed to distinguish the pathological cases of speech disabled children. The speech signal samples of children of age between five to eight years are considered for the present study.
Gudi, Anandthirtha B.   +2 more
core   +1 more source

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2018
Single-channel, speaker-independent speech separation methods have recently seen great progress. However, the accuracy, latency, and computational cost of such methods remain insufficient.
Yi Luo, N. Mesgarani
semanticscholar   +1 more source

Gaussian Process Modeling of Specular Multipath Components

open access: yesApplied Sciences, 2020
The consideration of ultra-wideband (UWB) and mm-wave signals allows for a channel description decomposed into specular multipath components (SMCs) and dense/diffuse multipath. In this paper, the amplitude and phase of SMCs are studied.
Anh Hong Nguyen   +4 more
doaj   +1 more source

Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language [PDF]

open access: yes, 2017
Vocoder simulation studies have suggested that the carrier signal type employed affects the intelligibility of vocoded speech. The present work further assessed how carrier signal type interacts with additional signal processing, namely, single-channel ...
Chen F.   +6 more
core   +2 more sources

Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech [PDF]

open access: yesInterspeech, 2017
Speech emotion recognition is an important and challenging task in the realm of human-computer interaction. Prior work proposed a variety of models and feature sets for training a system.
Michael Neumann, Ngoc Thang Vu
semanticscholar   +1 more source

Speech enhancement methods based on binaural cue coding

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2019
According to the encoding and decoding mechanism of binaural cue coding (BCC), in this paper, the speech and noise are considered as left channel signal and right channel signal of the BCC framework, respectively.
Xianyun Wang, Changchun Bao
doaj   +1 more source

Interpolating Coprime Arrays With Translocated and Axis Rotated Compressed Subarrays by Iterative Power Factorization for DOA Estimation

open access: yesIEEE Access, 2018
In this paper, a novel array structure exploiting coprime arrays is proposed, which comprises the translocation of one subarray and axis rotation with a compression of another subarray to produce a larger number of consecutive lags.
Tarek Hasan Al Mahmud   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy