Results 31 to 40 of about 6,722,896 (329)

Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition

open access: yesIEEE Access, 2021
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj   +1 more source

SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2023
Paralinguistic speech processing is important in addressing many issues, such as sentiment and neurocognitive disorder analyses. Recently, Transformer has achieved remarkable success in the natural language processing field and has demonstrated its ...
Weidong Chen   +4 more
semanticscholar   +1 more source

Off-Grid DOA Estimation Aiding Virtual Extension of Coprime Arrays Exploiting Fourth Order Difference Co-Array With Interpolation

open access: yesIEEE Access, 2018
In this paper, a novel array structure exploiting coprime arrays is proposed which can be very proficient to determine the number of consecutive lags in proportion with the number of array elements.
Tarek Hasan Al Mahmud   +4 more
doaj   +1 more source

Automated audio captioning: an overview of recent progress and new challenges

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2022
Automated audio captioning is a cross-modal translation task that aims to generate natural language descriptions for given audio clips. This task has received increasing attention with the release of freely available datasets in recent years. The problem
Xinhao Mei   +3 more
doaj   +1 more source

A validated finite element model for room acoustic treatments with edge absorbers

open access: yesActa Acustica, 2023
Porous acoustic absorbers have excellent properties in the low-frequency range when positioned in room edges, therefore they are a common method for reducing low-frequency reverberation.
Kraxberger Florian   +5 more
doaj   +1 more source

HMM-based speech synthesiser using the LF-model of the glottal source [PDF]

open access: yes, 2011
A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech.
Cabral, J.   +3 more
core   +4 more sources

Robust and complex approach of pathological speech signal analysis [PDF]

open access: yesNeurocomputing, 2015
This paper presents a study of the approaches in the state-of-the-art in the field of pathological speech signal analysis with a special focus on parametrization techniques.
J. Mekyska   +10 more
semanticscholar   +1 more source

The pursuit of invariance in speech signals [PDF]

open access: yesThe Journal of the Acoustical Society of America, 1983
The search for the acoustic properties useful to the listener in extracting the linguistic message from a speech signal is often construed as the task of matching invariant physical properties to invariant phonological percepts; the discovery of the former will explain the latter.
openaire   +2 more sources

Using the beat histogram for speech rhythm description and language identification [PDF]

open access: yes, 2015
In this paper we present a novel approach for the description of speech rhythm and the extraction of rhythm-related features for automatic language identification (LID).
Lykartsis, Athanasios, Weinzierl, Stefan
core   +1 more source

On Learning to Identify Genders from Raw Speech Signal Using CNNs

open access: yesInterspeech, 2018
Automatic Gender Recognition (AGR) is the task of identifying the gender of a speaker given a speech signal. Standard approaches extract features like fundamental frequency and cepstral features from the speech signal and train a binary classi-fier ...
Selen Hande Kabil   +2 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy