Projects per year
Abstract
Fully binarised convolutional neural networks (CNNs) deliver very high inference performance using singlebit weights and activations, together with XNOR type operators for the kernel convolutions. Current research shows that full binarisation results in a degradation of accuracy and different
approaches to tackle this issue are being investigated such as using more complex models as accuracy reduces. This paper proposes an alternative based on a multi-precision CNN framework
that combines a binarised and a floating point CNN in a pipeline configuration deployed on heterogeneous hardware. The binarised CNN is mapped onto an FPGA device and used to perform inference over the whole input set while the floating point network is mapped onto a CPU device and performs reinference only when the classification confidence level is low. A light-weight confidence mechanism enables a flexible trade-off between accuracy and throughput. To demonstrate the concept, we choose a Zynq 7020 device as the hardware target and show
that the multi-precision network is able to increase the BNN accuracy from 78.5% to 82.5% and the CPU inference speed from 29.68 to 90.82 images/sec.
approaches to tackle this issue are being investigated such as using more complex models as accuracy reduces. This paper proposes an alternative based on a multi-precision CNN framework
that combines a binarised and a floating point CNN in a pipeline configuration deployed on heterogeneous hardware. The binarised CNN is mapped onto an FPGA device and used to perform inference over the whole input set while the floating point network is mapped onto a CPU device and performs reinference only when the classification confidence level is low. A light-weight confidence mechanism enables a flexible trade-off between accuracy and throughput. To demonstrate the concept, we choose a Zynq 7020 device as the hardware target and show
that the multi-precision network is able to increase the BNN accuracy from 78.5% to 82.5% and the CPU inference speed from 29.68 to 90.82 images/sec.
Original language | English |
---|---|
Title of host publication | Design, Automation & Test in Europe Conference & Exhibition (DATE), 2018 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 419-424 |
Number of pages | 6 |
ISBN (Electronic) | 9783981926309, 9783981926316 |
DOIs | |
Publication status | E-pub ahead of print - 23 Apr 2018 |
Event | DATE 2018 - Dresden, Germany Duration: 19 Mar 2018 → 23 Mar 2018 |
Publication series
Name | |
---|---|
ISSN (Print) | 1558-1101 |
Conference
Conference | DATE 2018 |
---|---|
Country/Territory | Germany |
City | Dresden |
Period | 19/03/18 → 23/03/18 |
Fingerprint
Dive into the research topics of 'Multi-precision convolutional neural networks on heterogeneous hardware'. Together they form a unique fingerprint.Projects
- 1 Finished
-
ENergy Efficient Adaptive Computing with multi-grain heterogeneous architectures (ENEAC)
Nunez-Yanez, J. L. (Principal Investigator)
5/01/16 → 4/01/20
Project: Research