Results 51 to 60 of about 4,974,855 (283)
Topological fingerprints for audio identification [PDF]
We present a topological audio fingerprinting approach for robustly identifying duplicate audio tracks. Our method applies persistent homology on local spectral decompositions of audio signals, using filtered cubical complexes computed from mel-spectrograms.
arxiv
Abstract Non‐physical abuse is a form of intimate partner violence (IPV), which negatively impacts physical and mental well‐being. The study objectives were to understand the process of support seeking amongst women who experience non‐physical IPV. Interviews were conducted with women who have experience of non‐physical IPV and support workers.
Karishma Doolabh+2 more
wiley +1 more source
AudioSR: Versatile Audio Super-resolution at Scale [PDF]
Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolution audio, enhancing audio quality in digital applications. Previous methods have limitations such as the limited scope of audio types (e.g., music, speech) and specific bandwidth settings they can handle (e.g., 4kHz to 8kHz). In this paper, we introduce
arxiv
Deep Audio-Visual Speech Recognition [PDF]
The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world ...
Triantafyllos Afouras+4 more
semanticscholar +1 more source
A Novel Dentary Bone Conduction Device Equipped with Laser Communication in DSP
In this study, we designed a dentary bone conduction system that transmits and receives audio by laser. The main objective of this research was to propose a complete hardware design method, including a laser audio transmitter and receiver and digital ...
Jau-Woei Perng+2 more
doaj +1 more source
Clotho: an Audio Captioning Dataset [PDF]
Audio captioning is the novel task of general audio content description using free text. It is an intermodal translation task (not speech-to-text), where a system accepts as an input an audio signal and outputs the textual description (i.e.
K. Drossos+2 more
semanticscholar +1 more source
Commissioning evaluation of a deviceless 4DCT scanner
Abstract Background The utilization of four‐dimensional computed tomography (4DCT) for radiation therapy has not seen major advances to the method of data binning since shortly after inception. Recently there is increased interest in the utilization of an alternative binning method rather than more established techniques.
Hunter Tillery+2 more
wiley +1 more source
pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance ...
Theodoros Giannakopoulos
semanticscholar +1 more source
Objective Despite knowledge that health outcomes vary according to patient characteristics, identity, and geography, including underrepresented populations in arthritis research remains a challenge. We conducted interviews to explore how researchers in arthritis have used equity, diversity, and inclusion (EDI) principles to inform their research ...
Megan M. Thomas+8 more
wiley +1 more source
The domain of spatial audio comprises methods for capturing, processing, and reproducing audio content that contains spatial information. Data-based methods are those that operate directly on the spatial information carried by audio signals.
M. Cobos+3 more
semanticscholar +1 more source