arxiv Model compression via distillation and quantization