Results 51 to 60 of about 132,260 (194)
Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI
Vocal tract configurations play a vital role in generating distinguishable speech sounds, by modulating the airflow and creating different resonant cavities in speech production. They contain abundant information that can be utilized to better understand
Fels, Sidney +2 more
core +1 more source
Two-Stage Domain Adaptation for LLM-Based ASR by Decoupling Linguistic and Acoustic Factors
Large language models (LLMs) have been increasingly applied in Automatic Speech Recognition (ASR), achieving significant advancements. However, the performance of LLM-based ASR (LLM-ASR) models remains unsatisfactory when applied across domains due to ...
Lin Zheng +3 more
doaj +1 more source
How classroom acoustics influence students and teachers: A systematic literature review
Acoustics in schools have been studied during years, but nowadays there are more possibilities than ever before to introduce improvements. This study presents a systematic literature review determining what acoustic parameters are present in classrooms ...
Jordi Mogas Recalde +2 more
doaj +1 more source
POLYPHONIC PIANO TRANSCRIPTION USING NON-NEGATIVE MATRIX FACTORISATION WITH GROUP SPARSITY [PDF]
(c)2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
IEEE, O'Hanlon, K, Plumbley, MD
core
The Assessment of Acoustical Characteristics for Recent Mosque Buildings in Erbil City of Iraq
The study of mosque acoustics, concerning acoustical features, sound quality for speech intelligibility, and additional practical acoustic criteria, is commonly overlooked.
Dawa A. A. Masih +3 more
doaj +1 more source
Broadband DOA estimation using Convolutional neural networks trained with noise signals
A convolution neural network (CNN) based classification method for broadband DOA estimation is proposed, where the phase component of the short-time Fourier transform coefficients of the received microphone signals are directly fed into the CNN and the ...
Chakrabarty, Soumitro +1 more
core +1 more source
Head-Related Transfer Functions (HRTFs) play a vital role in binaural spatial audio rendering. With the release of numerous HRTF datasets in recent years, abundant data has become available to support HRTF-related research based on deep learning. However,
Jiale Zhao, Dingding Yao, Junfeng Li
doaj +1 more source
Deep neural networks (DNNs) have been shown to be effective for single sound source localization in shallow water environments. However, multiple source localization is a more challenging task because of the interactions among multiple acoustic signals ...
Zhaoqiong Huang +4 more
doaj +1 more source
BEHAVIOR OF GREEDY SPARSE REPRESENTATION ALGORITHMS ON NESTED SUPPORTS [PDF]
© 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new ...
IEEE, Mailhe, B, Plumbley, MD, Sturm, B
core
Deep Dynamic Network Embedding for Link Prediction
Network embedding task aims at learning low-dimension latent representations of vertices while preserving the structure of a network simultaneously. Most existing network embedding methods mainly focus on static networks, which extract and condense the ...
Taisong Li +4 more
doaj +1 more source

