Results 31 to 40 of about 18,884,699 (343)

Multi-channel spectrograms for speech processing applications using deep learning methods

open access: yesPattern Analysis and Applications, 2020
Time–frequency representations of the speech signals provide dynamic information about how the frequency component changes with time. In order to process this information, deep learning models with convolution layers can be used to obtain feature maps ...
T. Arias-Vergara   +9 more
semanticscholar   +1 more source

On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration [PDF]

open access: yesAutomatic Speech Recognition & Understanding, 2023
Large language models (LLMs) have achieved remarkable success in the field of natural language processing, enabling better human-computer interaction using natural language.
Jian Wu   +10 more
semanticscholar   +1 more source

State-of-the-art on monolingual lexicography for Greece (EL)

open access: yesSlovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, 2019
The authors report on a recent survey on monolingual dictionaries available on the Greek market. General dictionaries outnumber spelling and educational ones and enjoy a prestigious status.
Stella Markantonatou, Voula Giouli
doaj   +1 more source

Temporal order processing of syllables in the left parietal lobe [PDF]

open access: yes, 2009
Speech processing requires the temporal parsing of syllable order. Individuals suffering from posterior left hemisphere brain injury often exhibit temporal processing deficits as well as language deficits.
Baker, Julie M.   +4 more
core   +2 more sources

Can children with speech difficulties process an unfamiliar accent? [PDF]

open access: yes, 2001
This study explores the hypothesis that children identified as having phonological processing problems may have particular difficulty in processing a different accent. Children with speech difficulties (n = 18) were compared with matched controls on four
Nathan, L., Wells, B.
core   +1 more source

Sign Language Technologies and the Critical Role of SL Resources in View of Future Internet Accessibility Services

open access: yesTechnologies, 2019
In this paper, we touch upon the requirement for accessibility via Sign Language as regards dynamic composition and exchange of new content in the context of natural language-based human interaction, and also the accessibility of web services and ...
Eleni Efthimiou   +5 more
doaj   +1 more source

Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition

open access: yesIEEE Access, 2021
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj   +1 more source

A Novel Jointly Optimized Cooperative DAE-DNN Approach Based on a New Multi-Target Step-Wise Learning for Speech Enhancement

open access: yesIEEE Access, 2023
In this paper, we present a new supervised speech enhancement approach based on the cooperative structure of deep autoencoders (DAEs) as generative models and deep neural networks (DNN).
Matin Pashaian   +2 more
doaj   +1 more source

NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2021
Localization of multiple speakers using microphone arrays remains a challenging problem, especially in the presence of noise and reverberation. State-of-the-art localization algorithms generally exploit the sparsity of speech in some representation for ...
Sushmita Thakallapalli   +2 more
doaj   +1 more source

iCUS: Intelligent CU Size Selection for HEVC Inter Prediction

open access: yesIEEE Access, 2020
The hierarchical quadtree partitioning of Coding Tree Units (CTU) is one of the striking features in HEVC that contributes towards its superior coding performance over its predecessors.
Buddhiprabha Erabadda   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy