Results 21 to 30 of about 2,183,275 (319)

AudioLM: A Language Modeling Approach to Audio Generation [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2022
We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in this representation space. We show how
Zalán Borsos   +10 more
semanticscholar   +1 more source

Audio phrases for audio event recognition [PDF]

open access: yes2015 23rd European Signal Processing Conference (EUSIPCO), 2015
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a local feature of an audio signal is matched to a code word according to a learned codebook. The signal is then represented by frequencies of the matched code words on the whole signal.
Phan, Huy   +4 more
openaire   +1 more source

Extending Audio Masked Autoencoders toward Audio Restoration

open access: yes2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
WASPAA 2023.Copyright 2023 IEEE.Personal use of this material is permitted.Permission from IEEE must be obtained for all other uses,in any current or future media,including reprinting/republishing this material for advertising or promotional purposes, creating new collective works,for resale or redistribution to servers or lists,or reuse of any ...
Zhong, Zhi   +7 more
openaire   +2 more sources

Fall detection from audios with Audio Transformers

open access: yesSmart Health, 2022
Fall detection for the elderly is a well-researched problem with several proposed solutions, including wearable and non-wearable techniques. While the existing techniques have excellent detection rates, their adoption by the target population is lacking due to the need for wearing devices and user privacy concerns.
Prabhjot Kaur, Qifan Wang, Weisong Shi
openaire   +2 more sources

Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them [PDF]

open access: yes, 2015
Audio event classification, as an important part of Computational Auditory Scene Analysis, has attracted much attention. Currently, the classification technology is mature enough to classify isolated audio events accurately, but for overlapped audio ...
Cheng, C. F.   +7 more
core   +2 more sources

Estimation of acoustic echoes using expectation-maximization methods

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2020
Estimation problems like room geometry estimation and localization of acoustic reflectors are of great interest and importance in robot and drone audition.
Usama Saqib   +2 more
doaj   +1 more source

Audio Inpainting [PDF]

open access: yes, 2012
(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Adler, A   +5 more
core   +6 more sources

BEATs: Audio Pre-Training with Acoustic Tokenizers [PDF]

open access: yesInternational Conference on Machine Learning, 2022
The massive growth of self-supervised learning (SSL) has been witnessed in language, vision, speech, and audio domains over the past few years. While discrete label prediction is widely adopted for other modalities, the state-of-the-art audio SSL models ...
Sanyuan Chen   +6 more
semanticscholar   +1 more source

Erkomaishvili Dataset: A Curated Corpus of Traditional Georgian Vocal Music for Computational Musicology

open access: yesTransactions of the International Society for Music Information Retrieval, 2020
The analysis of recorded audio material using computational methods has received increased attention in ethnomusicological research. We present a curated dataset of traditional Georgian vocal music for computational musicology.
Sebastian Rosenzweig   +4 more
doaj   +1 more source

Towards Leitmotif Activity Detection in Opera Recordings

open access: yesTransactions of the International Society for Music Information Retrieval, 2021
This paper approaches the automatic detection of musical patterns in audio recordings with a particular focus on leitmotifs, which are specific types of patterns associated with certain characters, places, items, or feelings occurring in an opera or ...
Michael Krause   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy