Knowledge distillation - Open Access .click

Results 11 to 20 of about 141,062 (268)

Applied Sciences, 2022
Knowledge distillation (KD) is a method in which a teacher network guides the learning of a student network, thereby resulting in an improvement in the performance of the student network.
Chuanyun Xu +6 more
doaj +1 more source

Spot-Adaptive Knowledge Distillation

IEEE Transactions on Image Processing, 2022
12 pages, 8 ...
Jie Song, Ying Chen, Jingwen Ye, Mingli Song +3 more
openaire +3 more sources

Jisuanji kexue, 2023
Due to high data pre-processing costs and missing local features detection in self-distillation methods for models compression,a similarity and consistency by self-distillation(SCD) method is proposed to improve model classification accuracy.Firstly ...
WAN Xu, MAO Yingchi, WANG Zibo, LIU Yi, PING Ping
doaj +1 more source

Distilling Knowledge via Knowledge Review [PDF]

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
CVPR ...
Chen, Pengguang +3 more
openaire +2 more sources

Review of Recent Distillation Studies [PDF]

MATEC Web of Conferences, 2023
Knowledge distillation has gained a lot of interest in recent years because it allows for compressing a large deep neural network (teacher DNN) into a smaller DNN (student DNN), while maintaining its accuracy.
Gao Minghong
doaj +1 more source

Recurrent Knowledge Distillation [PDF]

2018 25th IEEE International Conference on Image Processing (ICIP), 2018
Knowledge distillation compacts deep networks by letting a small student network learn from a large teacher network. The accuracy of knowledge distillation recently benefited from adding residual layers. We propose to reduce the size of the student network even further by recasting multiple residual layers in the teacher network into a single recurrent
Pintea, S. (author) +2 more
openaire +3 more sources

A Virtual Knowledge Distillation via Conditional GAN

IEEE Access, 2022
Knowledge distillation aims at transferring the knowledge from a pre-trained complex model, called teacher, to a relatively smaller and faster one, called student. Unlike previous works that transfer the teacher’s softened distributions or feature
Sihwan Kim
doaj +1 more source

Annealing Knowledge Distillation [PDF]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Significant memory and computational requirements of large deep neural networks restrict their application on edge devices. Knowledge distillation (KD) is a prominent model compression technique for deep neural networks in which the knowledge of a trained large teacher model is transferred to a smaller student model.
Jafari, Aref +3 more
openaire +2 more sources

Feature fusion-based collaborative learning for knowledge distillation

International Journal of Distributed Sensor Networks, 2021
Deep neural networks have achieved a great success in a variety of applications, such as self-driving cars and intelligent robotics. Meanwhile, knowledge distillation has received increasing attention as an effective model compression technique for ...
Yiting Li +4 more
doaj +1 more source

Hint-Dynamic Knowledge Distillation

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Knowledge Distillation (KD) transfers the knowledge from a high-capacity teacher model to promote a smaller student model. Existing efforts guide the distillation by matching their prediction logits, feature embedding, etc., while leaving how to efficiently utilize them in junction less explored.
Liu, Yiyang +4 more
openaire +2 more sources

fos: computer and information sciences
machine learning cs.lg
computer science - machine learning

computer vision and pattern recognition cs.cv
model compression
deep learning

image classification
deep neural network
self-distillation