Results 31 to 40 of about 22,343 (153)
Enhancing timbre model using MFCC and its time derivatives for music similarity estimation
One of the popular methods for content-based music similarity estimation is to model timbre with MFCC as a single multivariate Gaussian with full covariance matrix, then use symmetric Kullback-Leibler divergence.
de leon, Franz, Martinez, Kirk
core +1 more source
Speech Processing in Computer Vision Applications [PDF]
Deep learning has been recently proven to be a viable asset in determining features in the field of Speech Analysis. Deep learning methods like Convolutional Neural Networks facilitate the expansion of specific feature information in waveforms, allowing ...
Waterworth, Nicholas
core +2 more sources
Statistically Significant Duration-Independent-based Noise-Robust Speaker Verification [PDF]
A speaker verification system models individual speakers using different speech features to improve their robustness. However, redundant features degrade the system's performance.
Asmita Nirmal +2 more
doaj +1 more source
In Acoustic Scene Classification (ASC) two major approaches have been followed . While one utilizes engineered features such as mel-frequency-cepstral-coefficients (MFCCs), the other uses learned features that are the outcome of an optimization algorithm.
bisot +15 more
core +1 more source
SPEAKER IDENTIFICATION SYSTEM USING AUDIO SIGNAL AND DEEP LEARNING METHOD [PDF]
Automatic Speaker Identification (ASI) does not result in high accuracy, so it is essential to develop a highly accurate Speaker Identification (SI) system. Artificial Intelligence has shown remarkable improvement in the development of such systems using
Neelam Nehra +2 more
doaj +1 more source
Rare earth based intermetallics, SmScGe and NdScGe, are shown to exhibit near zero net magnetization with substitutions of 6 to 9 atomic percent of Nd and 25 atomic percent of Gd, respectively.
A K Grover +11 more
core +1 more source
Fault detection method for belt conveyor idler
The existing fault detection methods for belt conveyor idler have the problems of low recognition precision, poor anti-interference capability and inability to operate stably over a long period of time.
WU Guoping
doaj +1 more source
Discrimination between patients with CVDs and healthy people by voiceprint using the MFCC and Pitch
Heart diseases cause many deaths around the world every year, and his death rate makes him the leader of the killer diseases. But early diagnosis can be helpful to decrease those several deaths and save lives.
Abdelhamid Bourouhou +3 more
doaj +1 more source
Ion‐Gating Reservoir Computing for Preprocessing‐Free Speech Recognition from Throat Vibrations
This work presents a throat‐mounted mechanoelectric sensor integrated with an ion‐gel/graphene reservoir device for on‐device speech recognition. The system converts raw biomechanical vibrations into rich nonlinear current dynamics, enabling efficient classification through a simple linear readout. The approach highlights a compact and tunable physical‐
Daiki Nishioka +5 more
wiley +1 more source
Speech depression recognition based on attentional residual network
Background: Depressive disorder is a common affective disorder, also known as depression, which is characterized by sadness, loss of interest, feelings of guilt or low self-worth and poor concentration.
Xiaoyong Lu +3 more
doaj +1 more source

