Results 11 to 20 of about 132,794 (332)

Building competitive direct acoustics-to-word models for English conversational speech recognition [PDF]

open access: green, 2017
Direct acoustics-to-word (A2W) models in the end-to-end paradigm have received increasing attention compared to conventional sub-word based automatic speech recognition models using phones, characters, or context-dependent hidden Markov model states ...
Audhkhasi, Kartik   +4 more
core   +2 more sources

Digital remote assessment of speech acoustics in cognitively unimpaired adults: feasibility, reliability and associations with amyloid pathology. [PDF]

open access: goldAlzheimers Res Ther
van den Berg RL   +17 more
europepmc   +3 more sources

Identification of Affective State Change in Adults With Aphasia Using Speech Acoustics. [PDF]

open access: greenJ Speech Lang Hear Res, 2018
Gillespie S   +5 more
europepmc   +3 more sources

A Pronunciation Prior Assisted Vowel Reduction Detection Framework with Multi-Stream Attention Method

open access: yesApplied Sciences, 2021
Vowel reduction is a common pronunciation phenomenon in stress-timed languages like English. Native speakers tend to weaken unstressed vowels into a schwa-like sound.
Zongming Liu   +3 more
doaj   +1 more source

Hierarchical Attention and Knowledge Matching Networks With Information Enhancement for End-to-End Task-Oriented Dialog Systems

open access: yesIEEE Access, 2019
Nowadays, most end-to-end task-oriented dialog systems are based on sequence-to-sequence (Seq2seq), which is an encoder-decoder framework. These systems sometimes produce grammatically correct, but logically incorrect responses.
Junqing He   +4 more
doaj   +1 more source

GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system

open access: yesElectronics Letters, 2022
Data augmentation is an essential technique for building a high‐robustness speaker recognition system. this letter proposes a novel on‐the‐fly data augmentation strategy called GuidedMix.
Runqiu Xiao   +4 more
doaj   +1 more source

Collecting language, speech acoustics, and facial expression to predict psychosis and other clinical outcomes: strategies from the AMP® SCZ initiative. [PDF]

open access: diamondSchizophrenia (Heidelb)
Bilgrami ZR   +77 more
europepmc   +3 more sources

Home - About - Disclaimer - Privacy