Results 31 to 40 of about 173,179 (314)

Comparison of glottal closure instants detection algorithms for emotional speech

open access: yes, 2020
avaa julkaisu, kun artikkeli saatavillaIn production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants of significant excitation of the vocal tract.
Sudarsana Reddy Kadiri   +5 more
core   +1 more source

wav2vec2-based Speech Rating System for Children with Speech Sound Disorder

open access: yes, 2022
The computational resources were provided by Aalto ScienceIT. This work was supported by NordForsk through the funding to Technology-enhanced foreign and second-language learning of Nordic languages, project number 103893.Speaking is a fundamental way of
Getman, Yaroslav   +8 more
core   +1 more source

Application of Tensor Train Decomposition in S2VT Model for Sign Language Recognition

open access: yesIEEE Access, 2021
Sign language recognition is a conversion of sign language into text or speech, bridging the communication between the hearing and society. Recently, sequence-to-sequence video to text (S2VT) models has been employed in the field of sign language ...
Biao Xu, Shiliang Huang, Zhongfu Ye
doaj   +1 more source

A Novel Jointly Optimized Cooperative DAE-DNN Approach Based on a New Multi-Target Step-Wise Learning for Speech Enhancement

open access: yesIEEE Access, 2023
In this paper, we present a new supervised speech enhancement approach based on the cooperative structure of deep autoencoders (DAEs) as generative models and deep neural networks (DNN).
Matin Pashaian   +2 more
doaj   +1 more source

The mechanism of speech processing in congenital amusia: Evidence from Mandarin speakers [PDF]

open access: yes, 2012
Congenital amusia is a neuro-developmental disorder of pitch perception that causes severe problems with music processing but only subtle difficulties in speech processing.
WF, Thompson   +36 more
core   +1 more source

Ubiquitous speech processing

open access: yes2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), 2002
In the ubiquitous (pervasive) computing era, it is expected that everybody will access information services anytime anywhere, and these services are expected to augment various human intelligent activities. Speech recognition technology can play an important role in this era by providing: (a) conversational systems for accessing information services ...
Sadaoki Furui   +5 more
openaire   +2 more sources

NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2021
Localization of multiple speakers using microphone arrays remains a challenging problem, especially in the presence of noise and reverberation. State-of-the-art localization algorithms generally exploit the sparsity of speech in some representation for ...
Sushmita Thakallapalli   +2 more
doaj   +1 more source

iCUS: Intelligent CU Size Selection for HEVC Inter Prediction

open access: yesIEEE Access, 2020
The hierarchical quadtree partitioning of Coding Tree Units (CTU) is one of the striking features in HEVC that contributes towards its superior coding performance over its predecessors.
Buddhiprabha Erabadda   +3 more
doaj   +1 more source

Elastic CRFs for Open-Ontology Slot Filling

open access: yesApplied Sciences, 2021
Slot filling is a crucial component in task-oriented dialog systems that is used to parse (user) utterances into semantic concepts called slots. An ontology is defined by the collection of slots and the values that each slot can take.
Yinpei Dai   +5 more
doaj   +1 more source

Audiovisual benefits for speech processing speed among children with hearing loss

open access: yes, 2019
Children with hearing loss face a range of challenges when listening to and processing speech; in particular, they may process spoken language slowly in comparison to normalhearing peers [1].
Holt, Rebecca   +5 more
core   +1 more source

Home - About - Disclaimer - Privacy