Results 1 to 10 of about 598 (156)

Cochleagram to Recognize Dysphonia: Auditory Perceptual Analysis for Health Informatics [PDF]

open access: yesIEEE Access
The spectral images provide the dynamic characteristics of the voice signal in the time and frequency domains. However, extracting the predominant spectral features from the voice samples is still challenging.
Rumana Islam   +2 more
exaly   +7 more sources

Attitude Recognition Using Multi-resolution Cochleagram Features [PDF]

open access: yesICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019
Attitudes play an important role in human communication.Models and algorithms for automatic recognition of attitudes therefore may have applications in areas where successful communication and interaction are crucial, such as healthcare, education and ...
Haider, Fasih, Luz, Saturnino
core   +6 more sources

Multi-Level Attention-Based Categorical Emotion Recognition Using Modulation-Filtered Cochleagram

open access: yesApplied Sciences (Switzerland), 2023
Speech emotion recognition is a critical component for achieving natural human–robot interaction. The modulation-filtered cochleagram is a feature based on auditory modulation perception, which contains multi-dimensional spectral–temporal modulation ...
Zhichao Peng, Wenhua He, Yongwei Li
exaly   +4 more sources

STRFs in primary auditory cortex emerge from masking-based statistics of natural sounds. [PDF]

open access: yesPLoS Computational Biology, 2019
We investigate how the neural processing in auditory cortex is shaped by the statistics of natural sounds. Hypothesising that auditory cortex (A1) represents the structural primitives out of which sounds are composed, we employ a statistical model to ...
Abdul-Saboor Sheikh   +6 more
doaj   +8 more sources

An Attack-Independent Audio Forgery Detection Technique Based on Cochleagram Images of Segments With Dynamic Threshold

open access: yesIEEE Access
Thanks to advanced audio editing software, speech recordings can be tampered with very quickly. If the speech recordings are used as forensic evidence, adding the audio recordings together, cutting them, and changing their content are legally ...
Beste Ustubioglu
exaly   +4 more sources

Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection [PDF]

open access: yesSensors, 2023
Singing-voice separation is a separation task that involves a singing voice and musical accompaniment. In this paper, we propose a novel, unsupervised methodology for extracting a singing voice from the background in a musical mixture.
Feng Li, Yujun Hu, Lingling Wang
doaj   +2 more sources

Deep Spectrogram Learning for Gunshot Classification: A Comparative Study of CNN Architectures and Time-Frequency Representations [PDF]

open access: yesJournal of Imaging
Gunshot sound classification plays a crucial role in public safety, forensic investigations, and intelligent surveillance systems. This study evaluates the performance of deep learning models in classifying firearm sounds by analyzing twelve time ...
Pafan Doungpaisan, Peerapol Khunarsa
doaj   +2 more sources

Kombinasi Fitur Multispektrum Hilbert dan Cochleagram untuk Identifikasi Emosi Wicara

open access: yesJurnal Nasional Teknik Elektro dan Teknologi Informasi, 2020
Dalam interaksi perilaku sosial, suara manusia menjadi salah satu saluran utama pembawa atribut ekspresi emosi kondisi mentalnya. Suara manusia merupakan hasil olah vokal yang tersusun dengan disertai urutan kata demi kata, hingga menghasilkan kalimat ...
Agustinus Bimo Gumelar   +5 more
doaj   +3 more sources

Biological neurons act as generalization filters in reservoir computing. [PDF]

open access: yesProc Natl Acad Sci U S A, 2023
Reservoir computing is a machine learning paradigm that transforms the transient dynamics of high-dimensional nonlinear systems for processing time-series data.
Sumi T   +7 more
europepmc   +2 more sources

Role of non-linear data processing on speech recognition task in the framework of reservoir computing. [PDF]

open access: yesSci Rep, 2020
The reservoir computing neural network architecture is widely used to test hardware systems for neuromorphic computing. One of the preferred tasks for bench-marking such devices is automatic speech recognition.
Abreu Araujo F   +10 more
europepmc   +4 more sources

Home - About - Disclaimer - Privacy