Results 151 to 160 of about 10,582 (198)
Some of the next articles are maybe not open access.
Using Psychoacoustic Models for Sound Analysis in Music
Proceedings of the 8th Annual Meeting of the Forum for Information Retrieval Evaluation, 2016Overall sound perception of a song is an important attribute of music. Several psychoacoustic models have been studied to extract perceptual sound qualities from audio signals. By means of listening tests, we investigate whether these sound models successfully reflect (inter-)subjective perception of sound resemblance in music.
Tim Ziemer, Yi Yu 0001, Suhua Tang
openaire +1 more source
Optimization of Masking Expansion Algorithm in Psychoacoustic Models
2011 2nd International Symposium on Intelligence Information Processing and Trusted Computing, 2011MPEG-4 AAC audio coding is the most widely used audio coding at present, but the MPEG-4 AAC audio coding standard has high complexity, long time delay and huge computation, what's more, it is not beneficial for real-time applications. Psychoacoustic model is the core part of the audio encoder, so huge computation also exists.
Hong-fu Liu, Cong Zhang, Rui-fan Liang
openaire +1 more source
EMD and psychoacoustic model based watermarking for audio
2010 IEEE International Conference on Multimedia and Expo, 2010The audio watermarking method proposed in this paper offers the copyright protection to an audio without the use of the original signal for watermark detection. The analysis filterbank decomposition, the psychoacoustic model and the empirical mode decomposition (EMD) are the three key techniques used in the novel audio watermarking method.
Liang Wang 0001 +2 more
openaire +1 more source
Psychoacoustic Models for Heart Sounds
2011The phonocardiography (PCG) — the art and science of recording and interpreting of heart sounds using latest digital technology has significantly helped us to understand and interpret the complex heart sounds (normal, abnormal sounds including murmurs) and in particular valvular diseases.
Kiran Kumari Patil +2 more
openaire +1 more source
Psychoacoustic Models for Audio Coding
2003In the prior chapter we learned about the limits to human hearing. We learned about the threshold in quiet or hearing threshold below which sounds are inaudible. The hearing threshold is very important to coder design because it represents frequency-dependent levels below which quantization noise levels will be inaudible.
Marina Bosi, Richard E. Goldberg
openaire +1 more source
Speech recognition enhancement by psychoacoustic modeled noise suppression
2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763), 2005This paper proposes a spectral subtraction based speech enhancement algorithm that improves computer based speech recognition. Speech recognition can not be improved by traditional spectral subtraction techniques because of the associated artifacts, such as musical noise.
Yiu-Pong Lai +3 more
openaire +1 more source
Direct MDCT Domain Psychoacoustic Modeling
2007 IEEE International Symposium on Signal Processing and Information Technology, 2007We extend the recently proposed spectral integration based psychoacoustic model for sinusoidal distortions to the MDCT domain. The estimated masking threshold additionally depends on the sub-band spectral flatness measure of the signal which accounts for the non- sinusoidal distortion introduced by masking.
K Suresh, T.V. Sreenivas
openaire +1 more source
An excitation level based psychoacoustic model for audio compression
Proceedings of the seventh ACM international conference on Multimedia (Part 1), 1999This paper describes an excitation level based psychoacoustic model to estimate the simultaneous masking threshold for audio coding. The system has the following stages: 1) a windowing function; 2) a time-to-frequency transformation; 3) an excitation level calculation block similar to that in Moore and Glasberg's loudness model; 4) a correction factor ...
Ye Wang 0007, Miikka Vilermo
openaire +1 more source
Incorporating a Psychoacoustical Model in Frequency Domain Speech Enhancement
IEEE Signal Processing Letters, 2004A frequency domain optimal linear estimator is proposed which incorporates the masking properties of the human auditory system to make the residual noise distortion inaudible. The use of wavelet-thresholded multitaper spectra is also proposed for frequency-domain speech enhancement methods as an alternative to the traditional fast Fourier transform ...
Yi Hu, Philipos C. Loizou
openaire +1 more source
A new psychoacoustical masking model for audio coding applications
IEEE International Conference on Acoustics Speech and Signal Processing, 2002The use of psychoacoustical masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used.
van de Par, Steven L.J.D.E. +3 more
openaire +1 more source

