Results 231 to 240 of about 188,127 (258)
Some of the next articles are maybe not open access.

A Comprehensive Survey on Knowledge Distillation

Trans. Mach. Learn. Res.
Deep Neural Networks (DNNs) have achieved notable performance in the fields of computer vision and natural language processing with various applications in both academia and industry.
Amir M. Mansourian   +10 more
semanticscholar   +1 more source

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

International Conference on Machine Learning
We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator.
Mingyuan Zhou   +4 more
semanticscholar   +1 more source

A Survey on Knowledge Distillation of Large Language Models

arXiv.org
In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a pivotal methodology for transferring advanced capabilities from leading proprietary LLMs, such as GPT-4, to their open-source counterparts like LLaMA and Mistral ...
Xiaohan Xu   +8 more
semanticscholar   +1 more source

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

arXiv.org
Knowledge distillation improves large language model (LLM) reasoning by compressing the knowledge of a teacher LLM to train smaller LLMs. On-policy distillation advances this approach by having the student sample its own trajectories while a teacher LLM ...
Siyan Zhao   +6 more
semanticscholar   +1 more source

Experimental demonstration of logical magic state distillation

Nature
Realizing universal fault-tolerant quantum computation is a key goal in quantum information science1, 2, 3–4. By encoding quantum information into logical qubits using quantum error correcting codes, physical errors can be detected and corrected ...
Pedro Sales Rodriguez   +72 more
semanticscholar   +1 more source

D4M: Dataset Distillation via Disentangled Diffusion Model

Computer Vision and Pattern Recognition
Dataset distillation offers a lightweight synthetic dataset for fast network training with promising test accuracy. To imitate the performance of the original dataset, most approaches employ bi-level optimization and the distillation space relies on the ...
Duo Su   +4 more
semanticscholar   +1 more source

Reciprocal Teacher-Student Learning via Forward and Feedback Knowledge Distillation

IEEE transactions on multimedia
Knowledge distillation (KD) is a prevalent model compression technique in deep learning, aiming to leverage knowledge from a large teacher model to enhance the training of a smaller student model.
Jianping Gou   +6 more
semanticscholar   +1 more source

Reinforcement Learning via Self-Distillation

arXiv.org
Large language models are increasingly post-trained with reinforcement learning in verifiable domains such as code and math. Yet, current methods for reinforcement learning with verifiable rewards (RLVR) learn only from a scalar outcome reward per ...
Jonas Hubotter   +10 more
semanticscholar   +1 more source

Spot-Adaptive Knowledge Distillation

IEEE Transactions on Image Processing, 2022
Jie Song, Jingwen Ye, Mingli Song
exaly  

Pore wetting in membrane distillation: A comprehensive review

Progress in Materials Science, 2021
Hooman Chamani   +2 more
exaly  

Home - About - Disclaimer - Privacy