Results 21 to 30 of about 2,183,275 (319)
AudioLM: A Language Modeling Approach to Audio Generation [PDF]
We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in this representation space. We show how
Zalán Borsos +10 more
semanticscholar +1 more source
Audio phrases for audio event recognition [PDF]
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a local feature of an audio signal is matched to a code word according to a learned codebook. The signal is then represented by frequencies of the matched code words on the whole signal.
Phan, Huy +4 more
openaire +1 more source
Extending Audio Masked Autoencoders toward Audio Restoration
WASPAA 2023.Copyright 2023 IEEE.Personal use of this material is permitted.Permission from IEEE must be obtained for all other uses,in any current or future media,including reprinting/republishing this material for advertising or promotional purposes, creating new collective works,for resale or redistribution to servers or lists,or reuse of any ...
Zhong, Zhi +7 more
openaire +2 more sources
Fall detection from audios with Audio Transformers
Fall detection for the elderly is a well-researched problem with several proposed solutions, including wearable and non-wearable techniques. While the existing techniques have excellent detection rates, their adoption by the target population is lacking due to the need for wearing devices and user privacy concerns.
Prabhjot Kaur, Qifan Wang, Weisong Shi
openaire +2 more sources
Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them [PDF]
Audio event classification, as an important part of Computational Auditory Scene Analysis, has attracted much attention. Currently, the classification technology is mature enough to classify isolated audio events accurately, but for overlapped audio ...
Cheng, C. F. +7 more
core +2 more sources
Estimation of acoustic echoes using expectation-maximization methods
Estimation problems like room geometry estimation and localization of acoustic reflectors are of great interest and importance in robot and drone audition.
Usama Saqib +2 more
doaj +1 more source
(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Adler, A +5 more
core +6 more sources
BEATs: Audio Pre-Training with Acoustic Tokenizers [PDF]
The massive growth of self-supervised learning (SSL) has been witnessed in language, vision, speech, and audio domains over the past few years. While discrete label prediction is widely adopted for other modalities, the state-of-the-art audio SSL models ...
Sanyuan Chen +6 more
semanticscholar +1 more source
The analysis of recorded audio material using computational methods has received increased attention in ethnomusicological research. We present a curated dataset of traditional Georgian vocal music for computational musicology.
Sebastian Rosenzweig +4 more
doaj +1 more source
Towards Leitmotif Activity Detection in Opera Recordings
This paper approaches the automatic detection of musical patterns in audio recordings with a particular focus on leitmotifs, which are specific types of patterns associated with certain characters, places, items, or feelings occurring in an opera or ...
Michael Krause +2 more
doaj +1 more source

