TY - GEN
T1 - Hybrid Approach for Efficient Quantization of Weights in Convolutional Neural Networks
AU - Seo, Sanghyun
AU - Kim, Juntae
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/5/25
Y1 - 2018/5/25
N2 - Convolutional neural networks(CNN) have achieved outstanding results in the fields of image recognition which classifies objects in the input images. In the deep neural networks such as CNN, the number of layers and the number of neurons in each layer are large. In other words, the deep neural networks requires relatively large storage space and calculation process. However, in embedded devices for object recognition in autonomous vehicles, large storage space and high computational complexity are constraints. For this reasons, various methodologies have been proposed to apply CNN to small embedded hardware such as mobile devices, FPGA and ASIC efficiently. In this paper, we quantize the weights of AlexNet without a large drop in accuracy by using a hybrid quantizer using uniform quantizer and k-means clustering.
AB - Convolutional neural networks(CNN) have achieved outstanding results in the fields of image recognition which classifies objects in the input images. In the deep neural networks such as CNN, the number of layers and the number of neurons in each layer are large. In other words, the deep neural networks requires relatively large storage space and calculation process. However, in embedded devices for object recognition in autonomous vehicles, large storage space and high computational complexity are constraints. For this reasons, various methodologies have been proposed to apply CNN to small embedded hardware such as mobile devices, FPGA and ASIC efficiently. In this paper, we quantize the weights of AlexNet without a large drop in accuracy by using a hybrid quantizer using uniform quantizer and k-means clustering.
KW - Convolutional Neural Networks
KW - Hybrid Quantizer
KW - Neural Networks Compression
KW - Weights Quantization
UR - http://www.scopus.com/inward/record.url?scp=85048514672&partnerID=8YFLogxK
U2 - 10.1109/BigComp.2018.00114
DO - 10.1109/BigComp.2018.00114
M3 - Conference contribution
AN - SCOPUS:85048514672
T3 - Proceedings - 2018 IEEE International Conference on Big Data and Smart Computing, BigComp 2018
SP - 638
EP - 641
BT - Proceedings - 2018 IEEE International Conference on Big Data and Smart Computing, BigComp 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE International Conference on Big Data and Smart Computing, BigComp 2018
Y2 - 15 January 2018 through 18 January 2018
ER -