Results 151 to 160 of about 249,885 (208)

An empirical study of LLaMA3 quantization: from LLMs to MLLMs. [PDF]

open access: yesVis Intell
Huang W   +9 more
europepmc   +1 more source

Reliable ECG Anomaly Detection on Edge Devices for Internet of Medical Things Applications. [PDF]

open access: yesSensors (Basel)
Hizem M   +4 more
europepmc   +1 more source

AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration

Conference on Machine Learning and Systems, 2023
Large language models (LLMs) have transformed numerous AI applications. On-device LLM is becoming increasingly important: running LLMs locally on edge devices can reduce cloud computing costs and protect users' privacy.
Ji Lin   +5 more
semanticscholar   +1 more source

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Conference on Machine Learning and Systems, 2023
The growing demand for Large Language Models (LLMs) in applications such as content generation, intelligent chatbots, and sentiment analysis poses considerable challenges for LLM service providers.
Yilong Zhao   +9 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy