Speech processing - Open Access .click

Results 41 to 50 of about 18,884,699 (343)

Elastic CRFs for Open-Ontology Slot Filling

Applied Sciences, 2021
Slot filling is a crucial component in task-oriented dialog systems that is used to parse (user) utterances into semantic concepts called slots. An ontology is defined by the collection of slots and the values that each slot can take.
Yinpei Dai +5 more
doaj +1 more source

Predicting future reading problems based on pre-reading auditory measures: a longitudinal study of children with a familial risk of dyslexia [PDF]

, 2017
Purpose: This longitudinal study examines measures of temporal auditory processing in pre-reading children with a family risk of dyslexia. Specifically, it attempts to ascertain whether pre-reading auditory processing, speech perception, and ...
Ghesquière, Pol +3 more
core +1 more source

Robust Speaker Recognition Using Speech Enhancement And Attention Model [PDF]

, 2020
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancement and speaker processing. Its aim is to improve speaker recognition performance when speech signals are corrupted by noise.
Hain, Thomas, Huang, Qiang, Shi, Yanpei
core +2 more sources

Effective Exploitation of Posterior Information for Attention-Based Speech Recognition

IEEE Access, 2020
End-to-end attention-based modeling is increasingly popular for tackling sequence-to-sequence mapping tasks. Traditional attention mechanisms utilize prior input information to derive attention, which then conditions the output.
Jian Tang +4 more
doaj +1 more source

Packet Loss Concealment Based on Phase Correction and Deep Neural Network

Applied Sciences, 2022
In a packet switching network, the performance of packet loss concealment (PLC) is often affected by inaccurate estimation of phase spectrum of speech signal in the lost packet.
Qiang Ji, Changchun Bao, Zihao Cui
doaj +1 more source

Federated Learning for privacy-Friendly Health Apps: A Case Study on Ovulation Tracking

Journal of Sensor and Actuator Networks
In an era of increasing reliance on digital health solutions, safeguarding user privacy has emerged as a paramount concern. Health applications often need to balance advanced AI functionalities with sufficient privacy measures to ensure user engagement ...
Nikolaos Pavlidis +12 more
doaj +1 more source

Profiles of Dysarthria: Clinical Assessment and Treatment

Brain Sciences, 2023
In recent decades, we have witnessed a wealth of theoretical work and proof-of-principle studies on dysarthria, including descriptions and classifications of dysarthric speech patterns, new and refined assessment methods, and innovative experimental ...
Wolfram Ziegler, Anja Staiger, Theresa Schölderle +2 more
doaj +1 more source

Dataset of directional room impulse responses for realistic speech data

Data in Brief
Obtaining real-world multi-channel speech recordings is expensive and time-consuming. Therefore, multi-channel recordings are often artificially generated by convolving existing monaural speech recordings with simulated Room Impulse Responses (RIRs) from
Stefan Fragner +3 more
doaj +1 more source

Segment boundary detection directed attention for online end-to-end speech recognition

EURASIP Journal on Audio, Speech, and Music Processing, 2020
Attention-based encoder-decoder models have recently shown competitive performance for automatic speech recognition (ASR) compared to conventional ASR systems.
Junfeng Hou, Wu Guo, Yan Song, Li-Rong Dai +3 more
doaj +1 more source

Speech Processing in Computer Vision Applications [PDF]

, 2020
Deep learning has been recently proven to be a viable asset in determining features in the field of Speech Analysis. Deep learning methods like Convolutional Neural Networks facilitate the expansion of specific feature information in waveforms, allowing ...
Waterworth, Nicholas
core +2 more sources

computer science
speech recognition
psychology

medicine
audiology
cognitive psychology

neuroscience
philosophy
linguistics