Abstract
High Energy Physics studies the fundamental forces and elementary particles of the Universe. With the unprecedented scale of experiments comes the challenge of accurate, ultra-low latency decision-making. Transformer Neural Networks (TNNs) have been proven to accomplish cutting-edge accuracy in classification for hadronic jet tagging. Nevertheless, software-centered solutions targeting CPUs and GPUs lack the inference speed required for real-time particle triggers, most notably those at the CERN Large Hadron Collider. This paper proposes a novel TNN-based architecture, efficiently mapped to Field-Programmable Gate Arrays, that outperforms GPU inference capabilities involving state-of-the-art neural network models by approximately 1000 times while preserving comparable classification accuracy. The design offers high customizability and aims to bridge the gap between hardware and software development by using High-Level Synthesis. Moreover, we propose a novel model-independent post-training quantization search algorithm that works in general hardware environments according to user-defined constraints. Experimental evaluation yields a 64% reduction in overall bit-widths with a 2% accuracy loss.
| Original language | English |
|---|---|
| Title of host publication | 2022 International Conference on Field-Programmable Technology (ICFPT) |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781665453363 |
| ISBN (Print) | 9781665453370 |
| DOIs | |
| Publication status | Published - 15 Dec 2022 |
| Event | 21st International Conference on Field-Programmable Technology, FPT 2022 - Hong Kong, Hong Kong Duration: 5 Dec 2022 → 9 Dec 2022 |
Publication series
| Name | Proceedings (International Conference on Field-Programmable Technology) |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 2837-0430 |
| ISSN (Electronic) | 2837-0449 |
Conference
| Conference | 21st International Conference on Field-Programmable Technology, FPT 2022 |
|---|---|
| Country/Territory | Hong Kong |
| City | Hong Kong |
| Period | 5/12/22 → 9/12/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Fingerprint
Dive into the research topics of 'Accelerating Transformer Neural Networks on FPGAs for High Energy Physics Experiments'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver