Results 71 to 80 of about 84,378 (284)
During the last decade, audio streams became an essential and fast means of communication through personal and business applications including social media and telehealth applications.
Mohammad Nassef
doaj +1 more source
Learning to detect dysarthria from raw speech
Speech classifiers of paralinguistic traits traditionally learn from diverse hand-crafted low-level features, by selecting the relevant information for the task at hand. We explore an alternative to this selection, by learning jointly the classifier, and
Millet, Juliette, Zeghidour, Neil
core
MusicSwarm: Biologically Inspired Intelligence for Music Composition
Biologically inspired swarms of frozen foundation models self‐organize to compose complex music without fine‐tuning. By coordinating through stigmergic signals, decentralized agents dynamically evolve specialized roles and adapt to solve complex tasks.
Markus J. Buehler
wiley +1 more source
Overview of the proposed Gate‐Align‐SED, including two stages of training: (1) Mean‐Teacher SSL Training; and (2) Enhancer Model Training. In complex real‐world environments such as disaster monitoring, effective sound event detection (SED) is often hindered by the presence of noise and limited labeled data.
Jieli Chen +4 more
wiley +1 more source
A Multimedia Application: Spatial Perceptual Entropy of Multichannel Audio Signals
Usually multimedia data have to be compressed before transmitting, and higher compression rate, or equivalently lower bitrate, relieves the load of communication channels but impacts negatively the quality.
Shuixian Chen, Ruimin Hu, Naixue Xiong
doaj +2 more sources
Integration of Road Data Collected Using LSB Audio Steganography
Modern traffic-monitoring systems increasingly rely on supplemental analytical data to complement video recordings, yet such data are rarely integrated into video containers without altering the original footage.
Adam Stančić +3 more
doaj +1 more source
The effect of dynamic range compression on the psychoacoustic quality and loudness of commercial music [PDF]
It is common practice for music productions to be mastered with the aim of increasing the perceived loudness for the listener, allowing one record to stand out from another by delivering an immediate impact and intensity.
Campbell, William +2 more
core +1 more source
Group-theoretic structure of linear phase multirate filter banks
Unique lifting factorization results for group lifting structures are used to characterize the group-theoretic structure of two-channel linear phase FIR perfect reconstruction filter bank groups. For D-invariant, order-increasing group lifting structures,
Brislawn, Christopher M.
core +1 more source
ABSTRACT The rapid advancement of large language model (LLM) technology is profoundly transforming the practice of social science research. Scholarly discussions on Artificial Intelligence (AI)'s role in social science research can be organised into three levels: AI as a research tool, AI as a methodological infrastructure and AI as a quasi‐cognitive ...
Jie Xiong
wiley +1 more source
Proprietary software tools as learning aids [PDF]
Proprietary software tools, though not designed for educational use, have considerable educational potential. This paper describes, as case studies, the use of proprietary graphics- and audio-editing tools in two distance-taught courses produced by the ...
Jones, Allan
core

