Multi-label Zero-Shot Audio Classification with Temporal Attention [PDF]
Zero-shot learning models are capable of classifying new classes by transferring knowledge from the seen classes using auxiliary information. While most of the existing zero-shot learning methods focused on single-label classification tasks, the present study introduces a method to perform multi-label zero-shot audio classification.
arxiv
DAVE: Diagnostic benchmark for Audio Visual Evaluation [PDF]
Audio-visual understanding is a rapidly evolving field that seeks to integrate and interpret information from both auditory and visual modalities. Despite recent advances in multi-modal learning, existing benchmarks often suffer from strong visual bias -- where answers can be inferred from visual data alone -- and provide only aggregate scores that ...
arxiv
Towards long double-stranded chains and robust DNA-based data storage using the random code system. [PDF]
Yang X+5 more
europepmc +1 more source
A comprehensive voice dataset for Hindko digit recognition. [PDF]
Ahmed T+4 more
europepmc +1 more source
Machine learning model for reproducing subjective sensations and alleviating sound-induced stress in individuals with developmental disorders. [PDF]
Ichikawa I+3 more
europepmc +1 more source
Performance evaluation of deep learning techniques for lung cancer prediction. [PDF]
Deepapriya BS+6 more
europepmc +1 more source
Implementation of deep reinforcement learning models for emotion detection and personalization of learning in hybrid educational environments. [PDF]
Govea J+3 more
europepmc +1 more source
GFDM-OQAM Performance Analysis Using Linear Equalization for Audio Transmission
Anggun Fitrian Isnawati+3 more
openalex +1 more source
Optimized Acoustic Phantom Design for Characterizing Body Sound Sensors. [PDF]
Rennoll V+3 more
europepmc +1 more source