Results 41 to 50 of about 2,183,275 (319)
Effects of feedback, mobility and index of difficulty on deictic spatial audio target acquisition in the horizontal plane [PDF]
We present the results of an empirical study investigating the effect of feedback, mobility and index of difficulty on a deictic spatial audio target acquisition task in the horizontal plane in front of a user.
Brewster, S.A., Marentakis, G.N.
core +3 more sources
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI)-based tools.
Juan Zuluaga-Gomez +4 more
doaj +1 more source
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition [PDF]
Audio pattern recognition is an important research topic in the machine learning area, and includes several tasks such as audio tagging, acoustic scene classification, music classification, speech emotion classification and sound event detection ...
Qiuqiang Kong +5 more
semanticscholar +1 more source
Bit rates in audio source coding [PDF]
The goal is to introduce and solve the audio coding optimization problem. Psychoacoustic results such as masking and excitation pattern models are combined with results from rate distortion theory to formulate the audio coding optimization problem.
Veldhuis, Raymond N.J.
core +3 more sources
A germinal manifesto for the audio paper.
Groth, Sanne Krogh, Samson, Kristine
openaire +2 more sources
CNN architectures for large-scale audio classification [PDF]
Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 video ...
Shawn Hershey +12 more
semanticscholar +1 more source
Audio Caption: Listen and Tell
Increasing amount of research has shed light on machine perception of audio events, most of which concerns detection and classification tasks. However, human-like perception of audio scenes involves not only detecting and classifying audio sounds, but ...
Dinkel, Heinrich, Wu, Mengyue, Yu, Kai
core +1 more source
Latency Performance for Real-Time Audio on BeagleBone Black [PDF]
In this paper we present a set of tests aimed at evaluating the responsiveness of a BeagleBone Black board in real-time interactive audio applications. The default Angstrom Linux distribution was tested without modifying the underlying kernel.
Linux Audio Conference +3 more
core
Large-scale weakly supervised audio classification using gated convolutional neural network [PDF]
In this paper, we present a gated convolutional neural network and a temporal attention-based localization method for audio classification, which won the 1st place in the large-scale weakly supervised sound event detection task of Detection and ...
Kong, Qiuqiang +3 more
core +2 more sources

