Speech acoustics - Open Access .click

Results 51 to 60 of about 132,260 (194)

Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI

, 2018
Vocal tract configurations play a vital role in generating distinguishable speech sounds, by modulating the airflow and creating different resonant cavities in speech production. They contain abundant information that can be utilized to better understand
Fels, Sidney, Saha, Pramit, Srungarapu, Praneeth +2 more
core +1 more source

Two-Stage Domain Adaptation for LLM-Based ASR by Decoupling Linguistic and Acoustic Factors

Applied Sciences
Large language models (LLMs) have been increasingly applied in Automatic Speech Recognition (ASR), achieving significant advancements. However, the performance of LLM-based ASR (LLM-ASR) models remains unsatisfactory when applied across domains due to ...
Lin Zheng, Xuyang Wang, Qingwei Zhao, Ta Li +3 more
doaj +1 more source

How classroom acoustics influence students and teachers: A systematic literature review

Journal of Technology and Science Education, 2021
Acoustics in schools have been studied during years, but nowadays there are more possibilities than ever before to introduce improvements. This study presents a systematic literature review determining what acoustic parameters are present in classrooms ...
Jordi Mogas Recalde, Ramon Palau, Marian Márquez +2 more
doaj +1 more source

POLYPHONIC PIANO TRANSCRIPTION USING NON-NEGATIVE MATRIX FACTORISATION WITH GROUP SPARSITY [PDF]

, 2014
(c)2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
IEEE, O'Hanlon, K, Plumbley, MD
core

The Assessment of Acoustical Characteristics for Recent Mosque Buildings in Erbil City of Iraq

ARO-The Scientific Journal of Koya University, 2021
The study of mosque acoustics, concerning acoustical features, sound quality for speech intelligibility, and additional practical acoustic criteria, is commonly overlooked.
Dawa A. A. Masih +3 more
doaj +1 more source

Broadband DOA estimation using Convolutional neural networks trained with noise signals

, 2017
A convolution neural network (CNN) based classification method for broadband DOA estimation is proposed, where the phase component of the short-time Fourier transform coefficients of the received microphone signals are directly fed into the CNN and the ...
Chakrabarty, Soumitro, Habets, Emanuël. A. P. +1 more
core +1 more source

Cross-Dataset Head-Related Transfer Function Harmonization Based on Perceptually Relevant Loss Function

IEEE Open Journal of Signal Processing
Head-Related Transfer Functions (HRTFs) play a vital role in binaural spatial audio rendering. With the release of numerous HRTF datasets in recent years, abundant data has become available to support HRTF-related research based on deep learning. However,
Jiale Zhao, Dingding Yao, Junfeng Li
doaj +1 more source

Multiple Source Localization in a Shallow Water Waveguide Exploiting Subarray Beamforming and Deep Neural Networks

Sensors, 2019
Deep neural networks (DNNs) have been shown to be effective for single sound source localization in shallow water environments. However, multiple source localization is a more challenging task because of the interactions among multiple acoustic signals ...
Zhaoqiong Huang +4 more
doaj +1 more source

BEHAVIOR OF GREEDY SPARSE REPRESENTATION ALGORITHMS ON NESTED SUPPORTS [PDF]

, 2013
© 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new ...
IEEE, Mailhe, B, Plumbley, MD, Sturm, B
core

Deep Dynamic Network Embedding for Link Prediction

IEEE Access, 2018
Network embedding task aims at learning low-dimension latent representations of vertices while preserving the structure of a network simultaneously. Most existing network embedding methods mainly focus on static networks, which extract and condense the ...
Taisong Li +4 more
doaj +1 more source

humans
speech
acoustics

male
middle aged
sound

phonetics
female
adult