Results 31 to 40 of about 18,884,699 (343)
Multi-channel spectrograms for speech processing applications using deep learning methods
Time–frequency representations of the speech signals provide dynamic information about how the frequency component changes with time. In order to process this information, deep learning models with convolution layers can be used to obtain feature maps ...
T. Arias-Vergara +9 more
semanticscholar +1 more source
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration [PDF]
Large language models (LLMs) have achieved remarkable success in the field of natural language processing, enabling better human-computer interaction using natural language.
Jian Wu +10 more
semanticscholar +1 more source
State-of-the-art on monolingual lexicography for Greece (EL)
The authors report on a recent survey on monolingual dictionaries available on the Greek market. General dictionaries outnumber spelling and educational ones and enjoy a prestigious status.
Stella Markantonatou, Voula Giouli
doaj +1 more source
Temporal order processing of syllables in the left parietal lobe [PDF]
Speech processing requires the temporal parsing of syllable order. Individuals suffering from posterior left hemisphere brain injury often exhibit temporal processing deficits as well as language deficits.
Baker, Julie M. +4 more
core +2 more sources
Can children with speech difficulties process an unfamiliar accent? [PDF]
This study explores the hypothesis that children identified as having phonological processing problems may have particular difficulty in processing a different accent. Children with speech difficulties (n = 18) were compared with matched controls on four
Nathan, L., Wells, B.
core +1 more source
In this paper, we touch upon the requirement for accessibility via Sign Language as regards dynamic composition and exchange of new content in the context of natural language-based human interaction, and also the accessibility of web services and ...
Eleni Efthimiou +5 more
doaj +1 more source
Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj +1 more source
In this paper, we present a new supervised speech enhancement approach based on the cooperative structure of deep autoencoders (DAEs) as generative models and deep neural networks (DNN).
Matin Pashaian +2 more
doaj +1 more source
Localization of multiple speakers using microphone arrays remains a challenging problem, especially in the presence of noise and reverberation. State-of-the-art localization algorithms generally exploit the sparsity of speech in some representation for ...
Sushmita Thakallapalli +2 more
doaj +1 more source
iCUS: Intelligent CU Size Selection for HEVC Inter Prediction
The hierarchical quadtree partitioning of Coding Tree Units (CTU) is one of the striking features in HEVC that contributes towards its superior coding performance over its predecessors.
Buddhiprabha Erabadda +3 more
doaj +1 more source

