PalQuant: Accelerating High-precision Networks on Low-precision Accelerators

Hu, Qinghao; Li, Gang; Wu, Qiman; Cheng, Jian

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2208

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: PalQuant: Accelerating High-precision Networks on Low-precision Accelerators

Authors: Qinghao Hu, Gang Li, Qiman Wu, Jian Cheng

(Submitted on 3 Aug 2022)

Abstract: Recently low-precision deep learning accelerators (DLAs) have become popular due to their advantages in chip area and energy consumption, yet the low-precision quantized models on these DLAs bring in severe accuracy degradation. One way to achieve both high accuracy and efficient inference is to deploy high-precision neural networks on low-precision DLAs, which is rarely studied. In this paper, we propose the PArallel Low-precision Quantization (PalQuant) method that approximates high-precision computations via learning parallel low-precision representations from scratch. In addition, we present a novel cyclic shuffle module to boost the cross-group information communication between parallel low-precision groups. Extensive experiments demonstrate that PalQuant has superior performance to state-of-the-art quantization methods in both accuracy and inference speed, e.g., for ResNet-18 network quantization, PalQuant can obtain 0.52\% higher accuracy and 1.78$\times$ speedup simultaneously over their 4-bit counter-part on a state-of-the-art 2-bit accelerator. Code is available at \url{this https URL}.

Comments:	accepted by ECCV2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2208.01944 [cs.CV]
	(or arXiv:2208.01944v1 [cs.CV] for this version)

Submission history

From: Qinghao Hu [view email]
[v1] Wed, 3 Aug 2022 09:44:13 GMT (337kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.01944

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: PalQuant: Accelerating High-precision Networks on Low-precision Accelerators

Submission history