Results 51 to 60 of about 934,756 (199)
AUTOMATIC MUSIC TRANSCRIPTION USING ROW WEIGHTED DECOMPOSITIONS [PDF]
(c) 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
IEEE, O'Hanlon, K, Plumbley, MD
core +1 more source
Applying deep matching networks to Chinese medical question answering: a study and a dataset
Background Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward.
Junqing He, Mingming Fu, Manshu Tu
doaj +1 more source
ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES [PDF]
(c)2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Ewert, S, IEEE, Plumbley, MD, Sandler, M
core +1 more source
This study aims to systematically review original articles investigating the link between spectral acoustic measures in healthy talkers and perceived speech intelligibility, according to the PRISMA guidelines. Twenty-two studies were retained.
Timothy Pommée+5 more
semanticscholar +1 more source
As demonstrated in hybrid connectionist temporal classification (CTC)/Attention architecture, joint training with a CTC objective is very effective to solve the misalignment problem existing in the attention-based end-to-end automatic speech recognition (
Long Wu, Ta Li, Li Wang, Yonghong Yan
doaj +1 more source
Modeling Speech Sound Radiation With Different Degrees of Realism for Articulatory Synthesis
Articulatory synthesis is based on modeling various physical phenomena of speech production, including sound radiation from the mouth. With regard to sound radiation, the most common approach is to approximate it in terms of a simple spherical source of ...
Peter Birkholz+5 more
doaj +1 more source
Query-by-Example with Acoustic Word Embeddings Using wav2vec Pretraining [PDF]
Query-by-Example is a popular keyword detection method in the absence of speech resources.It can build a keyword query system with excellent performance when there are few labeled voice resources and a lack of pronunciation dictionaries.In recent years ...
LI Zhao-qi, LI Ta
doaj +1 more source
Chinese Dialogue Intention Classification Based on Multi-Model Ensemble
In dialogue systems, understanding the user utterances is crucial for providing appropriate responses. A traditional dialogue act classification (DA) task is to classify each user reply into “ACCEPT, REJECT, PROPOSE, and others”.
Manshu Tu, Bing Wang, Xuemin Zhao
doaj +1 more source
An Archaeoacoustics Analysis of Cistercian Architecture: The Case of the Beaulieu Abbey
The Cistercian order is of acoustic interest because previous research has hypothesized that Cistercian architectural structures were designed for longer reverberation times in order to reinforce Gregorian chants.
Sebastian Duran+2 more
doaj +1 more source
Modeling the perception of children's age from speech acoustics.
Adult listeners were presented with /hVd/ syllables spoken by boys and girls ranging from 5 to 18 years of age. Half of the listeners were informed of the sex of the speaker; the other half were not. Results indicate that veridical age in children can be
Santiago Barreda, P. Assmann
semanticscholar +1 more source