Skip to main navigation Skip to search Skip to main content

Accelerating Transformer Neural Networks on FPGAs for High Energy Physics Experiments

Filip Wojcicki, Zhiqiang Que*, Alexander D. Tapper, Wayne Luk

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

12 Citations (Scopus)

Abstract

High Energy Physics studies the fundamental forces and elementary particles of the Universe. With the unprecedented scale of experiments comes the challenge of accurate, ultra-low latency decision-making. Transformer Neural Networks (TNNs) have been proven to accomplish cutting-edge accuracy in classification for hadronic jet tagging. Nevertheless, software-centered solutions targeting CPUs and GPUs lack the inference speed required for real-time particle triggers, most notably those at the CERN Large Hadron Collider. This paper proposes a novel TNN-based architecture, efficiently mapped to Field-Programmable Gate Arrays, that outperforms GPU inference capabilities involving state-of-the-art neural network models by approximately 1000 times while preserving comparable classification accuracy. The design offers high customizability and aims to bridge the gap between hardware and software development by using High-Level Synthesis. Moreover, we propose a novel model-independent post-training quantization search algorithm that works in general hardware environments according to user-defined constraints. Experimental evaluation yields a 64% reduction in overall bit-widths with a 2% accuracy loss.
Original languageEnglish
Title of host publication2022 International Conference on Field-Programmable Technology (ICFPT)
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665453363
ISBN (Print)9781665453370
DOIs
Publication statusPublished - 15 Dec 2022
Event21st International Conference on Field-Programmable Technology, FPT 2022 - Hong Kong, Hong Kong
Duration: 5 Dec 20229 Dec 2022

Publication series

NameProceedings (International Conference on Field-Programmable Technology)
PublisherIEEE
ISSN (Print)2837-0430
ISSN (Electronic)2837-0449

Conference

Conference21st International Conference on Field-Programmable Technology, FPT 2022
Country/TerritoryHong Kong
CityHong Kong
Period5/12/229/12/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Fingerprint

Dive into the research topics of 'Accelerating Transformer Neural Networks on FPGAs for High Energy Physics Experiments'. Together they form a unique fingerprint.

Cite this