Results 261 to 270 of about 3,807,781 (317)
Some of the next articles are maybe not open access.
Moshi: a speech-text foundation model for real-time dialogue
arXiv.orgWe introduce Moshi, a speech-text foundation model and full-duplex spoken dialogue framework. Current systems for spoken dialogue rely on pipelines of independent components, namely voice activity detection, speech recognition, textual dialogue and text ...
Alexandre D'efossez +7 more
semanticscholar +1 more source
arXiv.org
Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity.
Zhihao Du +11 more
semanticscholar +1 more source
Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity.
Zhihao Du +11 more
semanticscholar +1 more source
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
arXiv.orgWe introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech
Philip Anastassiou +45 more
semanticscholar +1 more source
Clinics in Plastic Surgery, 1990
Babies are born nonverbal, yet they spontaneously and seemingly effortlessly acquire the complex skills necessary for oral communication. Interestingly, many of these skills have "golden periods" of maximally efficient learning--failure to acquire the skill at that time may lead to speech delay.
R, Peterson, S, Velleman
openaire +2 more sources
Babies are born nonverbal, yet they spontaneously and seemingly effortlessly acquire the complex skills necessary for oral communication. Interestingly, many of these skills have "golden periods" of maximally efficient learning--failure to acquire the skill at that time may lead to speech delay.
R, Peterson, S, Velleman
openaire +2 more sources
2014
The goal of any speech service is the transmission and/or processing of speech signals. In this chapter we discuss the Quality of Experience (QoE) of speech communication systems, including networks, speech processing applications and terminals.
Côté, Nicolas, Berger, Jens
openaire +2 more sources
The goal of any speech service is the transmission and/or processing of speech signals. In this chapter we discuss the Quality of Experience (QoE) of speech communication systems, including networks, speech processing applications and terminals.
Côté, Nicolas, Berger, Jens
openaire +2 more sources
A tutorial on hidden Markov models and selected applications in speech recognition
Proceedings of the IEEE, 1989L. Rabiner
semanticscholar +1 more source
The cortical organization of speech processing
Nature Reviews Neuroscience, 2007G. Hickok, D. Poeppel
semanticscholar +1 more source
Parallel and distributed encoding of speech across human auditory cortex
Cell, 2021Liberty Hamilton +2 more
exaly
2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), 2001
A. Rix +3 more
semanticscholar +1 more source
A. Rix +3 more
semanticscholar +1 more source

