Results 21 to 30 of about 22,343 (153)
The paper proposes a framework to record meeting to avoid hassle of writing points of meeting. Key components of framework are “Model Trainer” and “Meeting Recorder”.
Khan Isra +3 more
doaj +1 more source
Prediction method of operation state of mine belt conveyor
The sensor monitoring data combined with neural network prediction model is the mainstream method of mine belt conveyor operation state prediction.
LI Jingzhao, SUN Jiechen, YE Tongzhou
doaj +1 more source
Multi-Accent Speaker Detection Using Normalize Feature MFCC Neural Network Method
Speaker recognition is a field of research that continues to this day. Various methods have been developed to detect the human voice with greater precision and accuracy. Research on human speech recognition that is quite challenging is accent recognition.
Kristiawan Nugroho +3 more
doaj +1 more source
Automatic Detection of Laryngeal Pathology on Sustained Vowels Using Short-Term Cepstral Parameters: Analysis of Performance and Theoretical Justification [PDF]
The majority of speech signal analysis procedures for automatic detection of laryngeal pathologies mainly rely on parameters extracted from time domain processing.
B. Boyanov +11 more
core +2 more sources
Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features
A conversion method based on the inversion of Mel frequency cepstral coefficient (MFCC) features was proposed to convert whispered speech into normal speech.
Qiang Zhu +3 more
doaj +1 more source
The Intelligent Recognition of Speech Emotions: Survey Study [PDF]
Speech emotion recognition (SER) is a challenging task in the field of artificial intelligence and machine learning. Over the years, researchers have proposed various approaches to recognize emotions from speech signals.
Ali Abdulwahhab Yehya al_saffar +1 more
doaj +1 more source
Synthetic speech detection and audio steganography in VoIP scenarios [PDF]
The distinction between synthetic and human voice uses the techniques of the current biometric voice recognition systems, which prevent that a person’s voice, no matter if with good or bad intentions, can be confused with someone else’s.
Capolupo, Daniele, D'AMORE, Fabrizio
core +1 more source
This paper introduces two significant contributions: one is a new feature based on histograms of MFCC (Mel-Frequency Cepstral Coefficients) extracted from the audio files that can be used in emotion classification from speech signals, and the other – our
Muhammet Pakyurek +3 more
doaj +1 more source
Fusion of Learned Multi-Modal Representations and Dense Trajectories for Emotional Analysis in Videos [PDF]
When designing a video affective content analysis algorithm, one of the most important steps is the selection of discriminative features for the effective representation of video segments.
Acar, Esra +2 more
core +1 more source
Acoustic analysis of selected homographs for speech recognition systems [PDF]
This paper presents an acoustic analysis of selected homographs in the context of automatic speech recognition (ASR) systems. The study focuses on the Polish words “Dania” (eng. Denmark) and “dania” (eng. meals), which, despite identical spelling, differ
Dominik Lentas, Michał Łuczyński
doaj +1 more source

