model_distillation

This is a knowledge distillation model based on pytorch-cifar repository of kuangliu

Based on pytorch-cifar repository's classic model collection, conduct knowledge distillation learning on classic models, analyze the CIFAR10 classification learning effect of the model, and evaluate the memory advantage of knowledge distillation

Prerequisites

Python 3.6+
PyTorch 1.0+

Training

# Start training with: 
python main.py

# You can manually resume the training with: 
python main.py --resume --lr=0.01

Accuracy

Model	Acc.
VGG16	92.64%
ResNet18	93.02%
ResNet50	93.62%
ResNet101	93.75%
RegNetX_200MF	94.24%
RegNetY_400MF	94.29%
MobileNetV2	94.43%
ResNeXt29(32x4d)	94.73%
ResNeXt29(2x64d)	94.82%
SimpleDLA	94.89%
DenseNet121	95.04%
PreActResNet18	95.11%
DPN92	95.16%
DLA	95.47%

Knowledge distillation script usage

This project supports DLA → MobileNetV2 knowledge distillation training, and the script file is distill_dla_mobilenetv2.py。

Basic commands

python distill_dla_mobilenetv2.py

Optional parameter description

--lr Learning Rate (Default 0.05)
--epochs Number of training rounds (default 200)
--alpha Hard loss weight (default 0.7)
--temp Distillation temperature (default 5.0)
--batch_size Batch size (default 128)
--resume Resume training from the latest checkpoint

for example：

python distill_dla_mobilenetv2.py --lr 0.01 --epochs 100 --alpha 0.5 --temp 4.0 --batch_size 64

Training process:

Auto-load ./checkpoint/dla.pth as the teacher model weight.
The best student model is saved to ./checkpoint/mobilenetv2_distilled.pth during training.
The latest checkpoints will be saved to ./checkpoint/mobilenetv2_latest.pth in each round, and the --resume parameter can be used to resume the training.

View the results

After the training is over, the terminal outputs the best accuracy. Model weights can be found under the 'checkpoint' folder.

If you need to customize the data path or model structure, please refer to the parameter settings section of the script to modify it.

模型蒸馏 (model_distillation)

本项目基于 kuangliu 的 pytorch-cifar 仓库实现知识蒸馏模型。通过该仓库的经典模型集合，对经典模型进行知识蒸馏学习，分析模型在 CIFAR10 分类任务上的学习效果，并评估知识蒸馏带来的内存优势。

环境要求

Python 3.6+
PyTorch 1.0+

训练方法

# Start training with: 
python main.py

# You can manually resume the training with: 
python main.py --resume --lr=0.01

准确率对比

Model	Acc.
VGG16	92.64%
ResNet18	93.02%
ResNet50	93.62%
ResNet101	93.75%
RegNetX_200MF	94.24%
RegNetY_400MF	94.29%
MobileNetV2	94.43%
ResNeXt29(32x4d)	94.73%
ResNeXt29(2x64d)	94.82%
SimpleDLA	94.89%
DenseNet121	95.04%
PreActResNet18	95.11%
DPN92	95.16%
DLA	95.47%

知识蒸馏脚本使用说明

本项目支持 DLA → MobileNetV2 的知识蒸馏训练，脚本文件为 distill_dla_mobilenetv2.py。

基础命令

python distill_dla_mobilenetv2.py

可选参数说明

--lr 学习率（默认 0.05）
--epochs 训练轮数（默认 200）
--alpha 硬损失权重（默认 0.7）
--temp 蒸馏温度（默认 5.0）
--batch_size 批大小（默认 128）
--resume 从最新检查点恢复训练

使用示例：

python distill_dla_mobilenetv2.py --lr 0.01 --epochs 100 --alpha 0.5 --temp 4.0 --batch_size 64

训练流程

1.自动加载 ./checkpoint/dla.pth 作为教师模型权重 2.训练过程中最佳学生模型将保存至 ./checkpoint/mobilenetv2_distilled.pth 3.每轮训练的最新检查点将保存至 ./checkpoint/mobilenetv2_latest.pth，可使用 --resume 参数恢复训练

查看结果

训练结束后，终端将输出最佳准确率。所有模型权重可在 checkpoint 文件夹中找到。

如需自定义数据路径或模型结构，请参考脚本中的参数设置部分进行修改。

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
models		models
LICENSE		LICENSE
README.md		README.md
distill_cross_validation.py		distill_cross_validation.py
distill_dla_mobilenetv2.py		distill_dla_mobilenetv2.py
main.py		main.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

model_distillation

Prerequisites

Training

Accuracy

Knowledge distillation script usage

Basic commands

Optional parameter description

Training process:

View the results

模型蒸馏 (model_distillation)

环境要求

训练方法

准确率对比

知识蒸馏脚本使用说明

基础命令

可选参数说明

训练流程

查看结果

About

Uh oh!

Releases

Packages

Languages

License

muyuliyan/model_distillation

Folders and files

Latest commit

History

Repository files navigation

model_distillation

Prerequisites

Training

Accuracy

Knowledge distillation script usage

Basic commands

Optional parameter description

Training process:

View the results

模型蒸馏 (model_distillation)

环境要求

训练方法

准确率对比

知识蒸馏脚本使用说明

基础命令

可选参数说明

训练流程

查看结果

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages