Towards a style-specific basis for computational beat tracking [PDF]
Outlined in this paper are a number of sources of evidence, from psychological, ethnomusicological and engineering grounds, to suggest that current approaches to computational beat tracking are incomplete.
Collins, Nick
core +2 more sources
The bag-of-frames approach: a not so sufficient model for urban soundscapes [PDF]
The "bag-of-frames" approach (BOF), which encodes audio signals as the long-term statistical distribution of short-term spectral features, is commonly regarded as an effective and sufficient way to represent environmental sound recordings (soundscapes ...
Aucouturier, Jean-Julien +3 more
core +2 more sources
Contrastive Loss Based Frame-Wise Feature Disentanglement for Polyphonic Sound Event Detection
Comment: accepted by ...
Guan, Yadong +6 more
openaire +2 more sources
Weakly-Supervised Temporal Localization via Occurrence Count Learning [PDF]
We propose a novel model for temporal detection and localization which allows the training of deep neural networks using only counts of event occurrences as training labels. This powerful weakly-supervised framework alleviates the burden of the imprecise
Marshall, David +2 more
core +2 more sources
Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
<abstract> <p>Sound event localization and detection have been applied in various fields. Due to the polyphony and noise interference, it becomes challenging to accurately predict the sound event and their occurrence locations. Aiming at this problem, we propose a Multiple Attention Fusion ResNet, which uses ResNet34 as the base network ...
Shouming Zhang +5 more
openaire +3 more sources
Input features for deep learning-based polyphonic sound event localization and detection
Sound event localization and detection (SELD) is an emerging research topic that combines the tasks of sound event detection (SED) and direction-of-arrival estimation (DOAE). The SELD task aims to jointly recognize the sound classes and estimate the directions of arrival (DOAs) and the temporal activities of detected sound events.
openaire +2 more sources
Analysis and Acoustic Event Classification of Environmental Data Collected in a Citizen Science Project. [PDF]
Bonet-Solà D +2 more
europepmc +1 more source
Harmonizing minds and machines: survey on transformative power of machine learning in music. [PDF]
Liang J.
europepmc +1 more source
Unlocking the musical brain: A proof-of-concept study on playing the piano in MRI scanner with naturalistic stimuli. [PDF]
Olszewska AM +8 more
europepmc +1 more source
Recurrent neural networks for polyphonic sound event detection
The objective of this thesis is to investigate how a deep learning model called recurrent neural network (RNN) performs in the task of detecting overlapping sound events in real life environments. Examples of such sound events include dog barking, footsteps, and crowd applauding.
openaire +1 more source

