Skip to main navigation Skip to search Skip to main content

Optimizing Reconfigurable Recurrent Neural Networks

Zhiqiang Que, Hiroki Nakahara, Eriko Nurvitadhi, Hongxiang Fan, Chenglong Zeng, Jiuxi Meng, Xinyu Niu, Wayne Luk

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

34 Citations (Scopus)

Abstract

This paper proposes a novel latency-hiding hardware architecture based on column-wise matrix-vector multiplication to eliminate data dependency, improving the throughput of systems of RNN models. In addition, a flexible checkerboard tiling strategy is introduced to allow large weight matrices, while supporting element-based parallelism and vector-based parallelism. These optimizations improve the exploitation of the available parallelism to increase run-time hardware utilization and boost inference throughput. Furthermore, a quantization scheme with fine-tuning is proposed to achieve high accuracy. Evaluation results show that the proposed architecture can enhance performance and energy efficiency with little accuracy loss. It achieves 1.05 to 3.35 times better performance and 1.22 to 3.92 times better hardware utilization than a state-of-theart FPGA-based LSTM design, which shows that our approach contributes to high performance FPGA-based LSTM systems.
Original languageEnglish
Title of host publication2020 IEEE 28th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages10-18
Number of pages9
ISBN (Electronic)9781728158037
ISBN (Print)9781728158044
DOIs
Publication statusPublished - 11 Jun 2020
Event28th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2020 - Fayetteville, United States
Duration: 3 May 20206 May 2020

Publication series

NameProceedings (Annual IEEE Symposium on Field-Programmable Custom Computing Machines)
PublisherIEEE
ISSN (Print)2576-2613
ISSN (Electronic)2576-2621

Conference

Conference28th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines, FCCM 2020
Country/TerritoryUnited States
CityFayetteville
Period3/05/206/05/20

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Fingerprint

Dive into the research topics of 'Optimizing Reconfigurable Recurrent Neural Networks'. Together they form a unique fingerprint.

Cite this