Abstract
In this paper, we investigate the application of early-exit strategies to quantized neural networks with binarized weights, mapped to low-cost FPGA SoC devices. The increasing complexity of network models means that hardware reuse and heterogeneous execution are needed and this opens the opportunity to evaluate the prediction confidence level early on. We apply the early-exit strategy to a network model suitable for ImageNet classification that combines weights with floating-point and binary arithmetic precision. The experiments show an improvement in inferred speed of around 20% using an early-exit network, compared with using a single primary neural network, with a negligible accuracy drop of 1.56%.
Original language | English |
---|---|
Title of host publication | Proceedings - 2022 25th Euromicro Conference on Digital System Design, DSD 2022 |
Editors | Himar Fabelo, Samuel Ortega, Amund Skavhaug |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 197-204 |
Number of pages | 8 |
ISBN (Electronic) | 9781665474047 |
DOIs | |
Publication status | Published - 2022 |
Event | 25th Euromicro Conference on Digital System Design, DSD 2022 - Maspalomas, Spain Duration: 31 Aug 2022 → 2 Sept 2022 |
Publication series
Name | Proceedings - 2022 25th Euromicro Conference on Digital System Design, DSD 2022 |
---|---|
ISSN (Print) | 2639-3859 |
ISSN (Electronic) | 2771-2508 |
Conference
Conference | 25th Euromicro Conference on Digital System Design, DSD 2022 |
---|---|
Country/Territory | Spain |
City | Maspalomas |
Period | 31/08/22 → 2/09/22 |
Bibliographical note
Funding Information:ACKNOWLEDGMENTS This research was partially funded by the Royal Society Industry fellowship, INF\R2\192044 Machine Intelligence at the Network Edge (MINET), EPSRC HOPWARE EP\RV040863\1 and Leverhulme trust international fellowship High-performance video analytics with parallel heterogeneous neural networks IF-2021-003 .
Publisher Copyright:
© 2022 IEEE.
Keywords
- Early-exit
- FPGAs
- Hardware Acceleration
- Neural Network