Results 261 to 270 of about 3,807,781 (317)
Some of the next articles are maybe not open access.

Moshi: a speech-text foundation model for real-time dialogue

arXiv.org
We introduce Moshi, a speech-text foundation model and full-duplex spoken dialogue framework. Current systems for spoken dialogue rely on pipelines of independent components, namely voice activity detection, speech recognition, textual dialogue and text ...
Alexandre D'efossez   +7 more
semanticscholar   +1 more source

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

arXiv.org
Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity.
Zhihao Du   +11 more
semanticscholar   +1 more source

Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

arXiv.org
We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech
Philip Anastassiou   +45 more
semanticscholar   +1 more source

Speech Development

Clinics in Plastic Surgery, 1990
Babies are born nonverbal, yet they spontaneously and seemingly effortlessly acquire the complex skills necessary for oral communication. Interestingly, many of these skills have "golden periods" of maximally efficient learning--failure to acquire the skill at that time may lead to speech delay.
R, Peterson, S, Velleman
openaire   +2 more sources

Speech Communication

2014
The goal of any speech service is the transmission and/or processing of speech signals. In this chapter we discuss the Quality of Experience (QoE) of speech communication systems, including networks, speech processing applications and terminals.
Côté, Nicolas, Berger, Jens
openaire   +2 more sources

Speech! Speech!

World Literature Today, 2002
David Rogers, Geoffrey Hill
openaire   +1 more source

The cortical organization of speech processing

Nature Reviews Neuroscience, 2007
G. Hickok, D. Poeppel
semanticscholar   +1 more source

Parallel and distributed encoding of speech across human auditory cortex

Cell, 2021
Liberty Hamilton   +2 more
exaly  

Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs

2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), 2001
A. Rix   +3 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy