Recreated deep compression's pruning, quantization and huffman encoding pipeline
Based on the ICLR 2016 paper: Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding by Song Han et al. Link to the paper: https://arxiv.org/abs/1510.00149