Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages
This paper presents the resources and benchmarks developed for keyword search (KWS) in spoken audio from six low-resource Indian languages (from two families), namely Gujarati, Hindi, Marathi, Odia, Tamil, and Telugu.
Vijaya Lakshmi V. Nadimpalli +4 more
doaj +1 more source
Seeing a speaker's face benefits speech comprehension, especially in challenging listening conditions. This perceptual benefit is thought to stem from the neural integration of visual and auditory speech at multiple stages of processing, whereby movement
Aisling E. O’Sullivan +4 more
semanticscholar +1 more source
Single-Anchor Positioning: Multipath Processing With Non-Coherent Directional Measurements
High-accuracy indoor radio positioning can be achieved by using (ultra) wideband (UWB) radio signals. Multiple fixed anchor nodes are needed to compute the position or alternatively, specular multipath components (SMCs) extracted from radio signals can ...
Michael Rath +3 more
doaj +1 more source
Reading Fluency in Children and Adolescents Who Stutter
Speech fluency is a major challenge for young persons who stutter. Reading aloud, in particular, puts high demands on fluency, not only regarding online text decoding and articulation, but also in terms of prosodic performance.
Mona Franke +3 more
doaj +1 more source
Individual differences in the discrimination of novel speech sounds: effects of sex, temporal processing, musical and cognitive abilities [PDF]
This study examined whether rapid temporal auditory processing, verbal working memory capacity, non-verbal intelligence, executive functioning, musical ability and prior foreign language experience predicted how well native English speakers (N = 120 ...
Brooks, Patricia J. +4 more
core +8 more sources
Transcranial alternating current stimulation in the theta band but not in the delta band modulates the comprehension of naturalistic speech in noise [PDF]
© 2020 Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).Auditory cortical activity entrains to speech rhythms and has been proposed as a mechanism for online ...
Asamoah +31 more
core +3 more sources
Automatic Detection of Dyspnea in Real Human–Robot Interaction Scenarios
A respiratory distress estimation technique for telephony previously proposed by the authors is adapted and evaluated in real static and dynamic HRI scenarios.
Eduardo Alvarado +7 more
doaj +1 more source
Speech vocoding for laboratory phonology [PDF]
Using phonological speech vocoding, we propose a platform for exploring relations between phonology and speech processing, and in broader terms, for exploring relations between the abstract and physical structures of a speech signal.
Benus, Stefan +2 more
core +3 more sources
Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). The handling of these voice communications requires high levels of awareness from ATCos and can be tedious and ...
Juan Zuluaga-Gomez +10 more
doaj +1 more source
Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language [PDF]
Vocoder simulation studies have suggested that the carrier signal type employed affects the intelligibility of vocoded speech. The present work further assessed how carrier signal type interacts with additional signal processing, namely, single-channel ...
Chen F. +6 more
core +2 more sources

