Results 1 to 10 of about 3,292,625 (393)

Speech Recognition with no speech or with noisy speech [PDF]

open access: yesarXiv, 2019
The performance of automatic speech recognition systems(ASR) degrades in the presence of noisy speech. This paper demonstrates that using electroencephalography (EEG) can help automatic speech recognition systems overcome performance loss in the presence of noise.
Gautam Krishna   +3 more
arxiv   +5 more sources

Frontier Research on Low-Resource Speech Recognition Technology [PDF]

open access: yesSensors, 2023
With the development of continuous speech recognition technology, users have put forward higher requirements in terms of speech recognition accuracy. Low-resource speech recognition, as a typical speech recognition technology under restricted conditions,
Wushour Slam, Yanan Li, Nurmamet Urouvas
doaj   +2 more sources

Implicit learning and individual differences in speech recognition: an exploratory study [PDF]

open access: yesFrontiers in Psychology, 2023
Individual differences in speech recognition in challenging listening environments are pronounced. Studies suggest that implicit learning is one variable that may contribute to this variability. Here, we explored the unique contributions of three indices
Ranin Khayr, Hanin Karawani, Karen Banai
doaj   +2 more sources

Application of dynamic time warping optimization algorithm in speech recognition of machine translation [PDF]

open access: yesHeliyon, 2023
Speech recognition is the foundation of human-computer interaction technology and an important aspect of speech signal processing, with broad application prospects. Therefore, it is very necessary to recognize speech.
Shaohua Jiang, Zheng Chen
doaj   +2 more sources

Speech Recognition with Augmented Synthesized Speech [PDF]

open access: yesarXiv, 2019
Recent success of the Tacotron speech synthesis architecture and its variants in producing natural sounding multi-speaker synthesized speech has raised the exciting possibility of replacing expensive, manually transcribed, domain-specific, human speech that is used to train speech recognizers.
Zelin Wu   +6 more
arxiv   +5 more sources

THE RECOGNITION OF SPEECH BY MACHINE [PDF]

open access: green, 1961
"May 1, 1961." "Based on a thesis submitted to the Department of Electrical Engineering, M. I. T. ... 1959, in partial fulfillment of the requirements for the degree of Doctor of Science." "May 1, 1961."
George W. Hughes
openalex   +4 more sources

Robust Speech Recognition via Large-Scale Weak Supervision [PDF]

open access: yesInternational Conference on Machine Learning, 2022
We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet. When scaled to 680,000 hours of multilingual and multitask supervision, the resulting models generalize well to ...
Alec Radford   +5 more
semanticscholar   +1 more source

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages [PDF]

open access: yesarXiv.org, 2023
We introduce the Universal Speech Model (USM), a single large model that performs automatic speech recognition (ASR) across 100+ languages. This is achieved by pre-training the encoder of the model on a large unlabeled multilingual dataset of 12 million (
Yu Zhang   +26 more
semanticscholar   +1 more source

End-to-End Speech Recognition: A Survey [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2023
In the last decade of automatic speech recognition (ASR) research, the introduction of deep learning has brought considerable reductions in word error rate of more than 50% relative, compared to modeling without deep learning.
Rohit Prabhavalkar   +4 more
semanticscholar   +1 more source

Prompting Large Language Models with Speech Recognition Abilities [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2023
Large language models (LLMs) have proven themselves highly flexible, able to solve a wide range of generative tasks, such as abstractive summarization and open-ended question answering.
Yassir Fathullah   +11 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy