Results 311 to 320 of about 7,840,073 (343)

REST: Retrieval-Based Speculative Decoding

North American Chapter of the Association for Computational Linguistics, 2023
We introduce Retrieval-Based Speculative Decoding (REST), a novel algorithm designed to speed up language model generation. The key insight driving the development of REST is the observation that the process of text generation often includes certain ...
Zhenyu He   +4 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy