Results 251 to 260 of about 4,974,855 (283)
Some of the next articles are maybe not open access.
Emotion recognition on the basis of audio signal using Naive Bayes classifier
International Conference on Advances in Computing, Communications and Informatics, 2016In this paper we have studied and implemented the classification of audio signal into four basic emotional state. For that we have considered different statistical features of pitch, energy, and ZCR (Zero Crossing Rate) MFCC (Mel frequency cepstral ...
Sagar K. Bhakre, Arti V. Bang
semanticscholar +1 more source
A novel method for malware detection using audio signal processing techniques
2016 Artificial Intelligence and Robotics (IRANOPEN), 2016Malware is any kind of program that is designed to perform malicious activity in computers and networks. To evade traditional signature-based malware detection techniques, malware developers employ obfuscation techniques.
Mehrdad Farrokhmanesh, A. Hamzeh
semanticscholar +1 more source
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
NeurIPS Datasets and Benchmarks, 2021Deep generative modeling has the potential to cause significant harm to society. Recognizing this threat, a magnitude of research into detecting so-called "Deepfakes" has emerged.
J. Frank, Lea Schönherr
semanticscholar +1 more source
An audio signal based model for condition monitoring of sheet metal stamping process
2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA), 2015Tool condition monitoring is an important factor in ensuring manufacturing efficiency and product quality. Audio signal based methods are a promising technique for condition monitoring.
I. Ubhayaratne+3 more
semanticscholar +1 more source
arXiv.org
We introduce the latest progress of Qwen-Audio, a large-scale audio-language model called Qwen2-Audio, which is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions.
Yunfei Chu+11 more
semanticscholar +1 more source
We introduce the latest progress of Qwen-Audio, a large-scale audio-language model called Qwen2-Audio, which is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions.
Yunfei Chu+11 more
semanticscholar +1 more source
madmom: A New Python Audio and Music Signal Processing Library
ACM Multimedia, 2016In this paper, we present madmom, an open-source audio processing and music information retrieval (MIR) library written in Python. madmom features a concise, NumPy-compatible, object oriented design with simple calling conventions and sensible default ...
Sebastian Böck+4 more
semanticscholar +1 more source
Detecting symptoms of diseases in poultry through audio signal processing
IEEE Global Conference on Signal and Information Processing, 2014We developed an audio signal processing algorithm that detects rales (gurgling noises that are a distinct symptom of common respiratory diseases in poultry).
Brandon T. Carroll+5 more
semanticscholar +1 more source
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
European Conference on Computer Vision, 2018The thud of a bouncing ball, the onset of speech as lips open -- when visual and audio events occur together, it suggests that there might be a common, underlying event that produced both signals.
Andrew Owens, Alexei A. Efros
semanticscholar +1 more source
Audio augmentation for speech recognition
Interspeech, 2015Data augmentation is a common strategy adopted to increase the quantity of training data, avoid overfitting and improve robustness of the models. In this paper, we investigate audio-level speech augmentation methods which directly process the raw signal.
Tom Ko+3 more
semanticscholar +1 more source
International Journal of Computational Mathematics, 2013
In this paper, a novel semi-fragile watermarking scheme for authenticating an audio signal based on dual-tree complex wavelet transform (DT-CWT) and discrete cosine transform (DCT) is proposed.
Mingquan Fan+3 more
semanticscholar +1 more source
In this paper, a novel semi-fragile watermarking scheme for authenticating an audio signal based on dual-tree complex wavelet transform (DT-CWT) and discrete cosine transform (DCT) is proposed.
Mingquan Fan+3 more
semanticscholar +1 more source