Distillation from WRN 22-4 (teacher) to WRN 16-2 (student) on CIFAR-10 dataset. Pre-trained teacher network (WRN 22-4) is included. Just run the code. Please change base learning rate to 0.1 for ...