Results 31 to 40 of about 6,722,896 (329)
Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj +1 more source
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing [PDF]
Paralinguistic speech processing is important in addressing many issues, such as sentiment and neurocognitive disorder analyses. Recently, Transformer has achieved remarkable success in the natural language processing field and has demonstrated its ...
Weidong Chen +4 more
semanticscholar +1 more source
In this paper, a novel array structure exploiting coprime arrays is proposed which can be very proficient to determine the number of consecutive lags in proportion with the number of array elements.
Tarek Hasan Al Mahmud +4 more
doaj +1 more source
Automated audio captioning: an overview of recent progress and new challenges
Automated audio captioning is a cross-modal translation task that aims to generate natural language descriptions for given audio clips. This task has received increasing attention with the release of freely available datasets in recent years. The problem
Xinhao Mei +3 more
doaj +1 more source
A validated finite element model for room acoustic treatments with edge absorbers
Porous acoustic absorbers have excellent properties in the low-frequency range when positioned in room edges, therefore they are a common method for reducing low-frequency reverberation.
Kraxberger Florian +5 more
doaj +1 more source
HMM-based speech synthesiser using the LF-model of the glottal source [PDF]
A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech.
Cabral, J. +3 more
core +4 more sources
Robust and complex approach of pathological speech signal analysis [PDF]
This paper presents a study of the approaches in the state-of-the-art in the field of pathological speech signal analysis with a special focus on parametrization techniques.
J. Mekyska +10 more
semanticscholar +1 more source
The pursuit of invariance in speech signals [PDF]
The search for the acoustic properties useful to the listener in extracting the linguistic message from a speech signal is often construed as the task of matching invariant physical properties to invariant phonological percepts; the discovery of the former will explain the latter.
openaire +2 more sources
Using the beat histogram for speech rhythm description and language identification [PDF]
In this paper we present a novel approach for the description of speech rhythm and the extraction of rhythm-related features for automatic language identification (LID).
Lykartsis, Athanasios, Weinzierl, Stefan
core +1 more source
On Learning to Identify Genders from Raw Speech Signal Using CNNs
Automatic Gender Recognition (AGR) is the task of identifying the gender of a speaker given a speech signal. Standard approaches extract features like fundamental frequency and cepstral features from the speech signal and train a binary classi-fier ...
Selen Hande Kabil +2 more
semanticscholar +1 more source

