Results 31 to 40 of about 3,807,781 (317)
Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise [PDF]
In this paper we introduce a new cepstral coefficient extraction method based on an intelligibility measure for speech in noise, the Glimpse Proportion measure.
King, S. +4 more
core +3 more sources
Summary Introduction: In recent years, the benefits associated with the use of cochlear implants (CIs), especially with regard to speech perception, have proven to surpass those produced by the use of hearing aids, making CIs a highly ...
Fernanda Soares Granço +4 more
doaj +1 more source
Making information accessible for people with aphasia in healthcare
People with aphasia are often unable to access healthcare due to difficulties understanding and using spoken and written language, which impacts every step of their healthcare journey and outcomes.
Simon Grobler +2 more
doaj +1 more source
Using the beat histogram for speech rhythm description and language identification [PDF]
In this paper we present a novel approach for the description of speech rhythm and the extraction of rhythm-related features for automatic language identification (LID).
Lykartsis, Athanasios, Weinzierl, Stefan
core +1 more source
ESPnet: End-to-End Speech Processing Toolkit [PDF]
This paper introduces a new open source platform for end-to-end speech processing named ESPnet. ESPnet mainly focuses on end-to-end automatic speech recognition (ASR), and adopts widely-used dynamic neural network toolkits, Chainer and PyTorch, as a main
Shinji Watanabe +11 more
semanticscholar +1 more source
Speakers adjust their voice when talking in noise, which is known as Lombard speech. These acoustic adjustments facilitate speech comprehension in noise relative to plain speech (i.e., speech produced in quiet).
Bosker, H., Cooke, M.
core +1 more source
• Background:Formant structure is one of the most important acoustic characteristics of Sounds.We can differentiate Vowels according to their first three Formants.
A Bahrami +4 more
doaj +1 more source
Neural Processing of Speech Sounds in ASD and First-Degree Relatives
Efficient neural encoding of sound plays a critical role in speech and language, and when impaired, may have reverberating effects on communication skills.
Shivani P. Patel +7 more
semanticscholar +1 more source
In this paper I argue that pragmatic ability must become part of what we teach in the classroom if we are to realize the goals of communicative competence for our students.
Teaching Speech Acts
doaj
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech [PDF]
This paper introduces a new speech corpus called "LibriTTS" designed for text-to-speech use. It is derived from the original audio and text materials of the LibriSpeech corpus, which has been used for training and evaluating automatic speech recognition ...
H. Zen +7 more
semanticscholar +1 more source

