Elastic CRFs for Open-Ontology Slot Filling
Slot filling is a crucial component in task-oriented dialog systems that is used to parse (user) utterances into semantic concepts called slots. An ontology is defined by the collection of slots and the values that each slot can take.
Yinpei Dai +5 more
doaj +1 more source
Predicting future reading problems based on pre-reading auditory measures: a longitudinal study of children with a familial risk of dyslexia [PDF]
Purpose: This longitudinal study examines measures of temporal auditory processing in pre-reading children with a family risk of dyslexia. Specifically, it attempts to ascertain whether pre-reading auditory processing, speech perception, and ...
Ghesquière, Pol +3 more
core +1 more source
Robust Speaker Recognition Using Speech Enhancement And Attention Model [PDF]
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancement and speaker processing. Its aim is to improve speaker recognition performance when speech signals are corrupted by noise.
Hain, Thomas, Huang, Qiang, Shi, Yanpei
core +2 more sources
Effective Exploitation of Posterior Information for Attention-Based Speech Recognition
End-to-end attention-based modeling is increasingly popular for tackling sequence-to-sequence mapping tasks. Traditional attention mechanisms utilize prior input information to derive attention, which then conditions the output.
Jian Tang +4 more
doaj +1 more source
Packet Loss Concealment Based on Phase Correction and Deep Neural Network
In a packet switching network, the performance of packet loss concealment (PLC) is often affected by inaccurate estimation of phase spectrum of speech signal in the lost packet.
Qiang Ji, Changchun Bao, Zihao Cui
doaj +1 more source
Federated Learning for privacy-Friendly Health Apps: A Case Study on Ovulation Tracking
In an era of increasing reliance on digital health solutions, safeguarding user privacy has emerged as a paramount concern. Health applications often need to balance advanced AI functionalities with sufficient privacy measures to ensure user engagement ...
Nikolaos Pavlidis +12 more
doaj +1 more source
Profiles of Dysarthria: Clinical Assessment and Treatment
In recent decades, we have witnessed a wealth of theoretical work and proof-of-principle studies on dysarthria, including descriptions and classifications of dysarthric speech patterns, new and refined assessment methods, and innovative experimental ...
Wolfram Ziegler +2 more
doaj +1 more source
Dataset of directional room impulse responses for realistic speech data
Obtaining real-world multi-channel speech recordings is expensive and time-consuming. Therefore, multi-channel recordings are often artificially generated by convolving existing monaural speech recordings with simulated Room Impulse Responses (RIRs) from
Stefan Fragner +3 more
doaj +1 more source
Segment boundary detection directed attention for online end-to-end speech recognition
Attention-based encoder-decoder models have recently shown competitive performance for automatic speech recognition (ASR) compared to conventional ASR systems.
Junfeng Hou +3 more
doaj +1 more source
Speech Processing in Computer Vision Applications [PDF]
Deep learning has been recently proven to be a viable asset in determining features in the field of Speech Analysis. Deep learning methods like Convolutional Neural Networks facilitate the expansion of specific feature information in waveforms, allowing ...
Waterworth, Nicholas
core +2 more sources

