Results 61 to 70 of about 4,447 (146)

Towards a style-specific basis for computational beat tracking [PDF]

open access: yes, 2006
Outlined in this paper are a number of sources of evidence, from psychological, ethnomusicological and engineering grounds, to suggest that current approaches to computational beat tracking are incomplete.
Collins, Nick
core   +2 more sources

The bag-of-frames approach: a not so sufficient model for urban soundscapes [PDF]

open access: yes, 2015
The "bag-of-frames" approach (BOF), which encodes audio signals as the long-term statistical distribution of short-term spectral features, is commonly regarded as an effective and sufficient way to represent environmental sound recordings (soundscapes ...
Aucouturier, Jean-Julien   +3 more
core   +2 more sources

Contrastive Loss Based Frame-Wise Feature Disentanglement for Polyphonic Sound Event Detection

open access: yesICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Comment: accepted by ...
Guan, Yadong   +6 more
openaire   +2 more sources

Weakly-Supervised Temporal Localization via Occurrence Count Learning [PDF]

open access: yes, 2019
We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise
Marshall, David   +2 more
core   +2 more sources

Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet

open access: yesMathematical Biosciences and Engineering
<abstract> <p>Sound event localization and detection have been applied in various fields. Due to the polyphony and noise interference, it becomes challenging to accurately predict the sound event and their occurrence locations. Aiming at this problem, we propose a Multiple Attention Fusion ResNet, which uses ResNet34 as the base network ...
Shouming Zhang   +5 more
openaire   +3 more sources

Input features for deep learning-based polyphonic sound event localization and detection

open access: yes, 2023
Sound event localization and detection (SELD) is an emerging research topic that combines the tasks of sound event detection (SED) and direction-of-arrival estimation (DOAE). The SELD task aims to jointly recognize the sound classes and estimate the directions of arrival (DOAs) and the temporal activities of detected sound events.
openaire   +2 more sources

Analysis and Acoustic Event Classification of Environmental Data Collected in a Citizen Science Project. [PDF]

open access: yesInt J Environ Res Public Health, 2023
Bonet-Solà D   +2 more
europepmc   +1 more source

Unlocking the musical brain: A proof-of-concept study on playing the piano in MRI scanner with naturalistic stimuli. [PDF]

open access: yesHeliyon, 2023
Olszewska AM   +8 more
europepmc   +1 more source

Recurrent neural networks for polyphonic sound event detection

open access: yes, 2015
The objective of this thesis is to investigate how a deep learning model called recurrent neural network (RNN) performs in the task of detecting overlapping sound events in real life environments. Examples of such sound events include dog barking, footsteps, and crowd applauding.
openaire   +1 more source

Home - About - Disclaimer - Privacy