Abstract
Transformers are the state-of-the-art model architectures and widely used in application areas of machine learning. However the performance of such architectures is less well explored in the ultra-low latency domains where deployment on FPGAs or ASICs is required. Such domains include the trigger and data acquisition systems of the LHC experiments. We present a transformer-based algorithm for jet tagging built with the HGQ2 framework, which is able to produce a model with heterogeneous bitwidths for fast inference on FPGAs, as required in the trigger systems at the LHC experiments. The bitwidths are acquired during training by minimizing the total bit operations as an additional parameter. By allowing a bitwidth of zero, the model is pruned in-situ during training. Using this quantization-aware approach, our algorithm achieves state-of-the-art performance while also retaining permutation invariance which is a key property for particle physics applications. Due to the strength of transformers in representation learning, our work also serves as a stepping stone for the development of a larger foundation model for trigger applications.
| Original language | English |
|---|---|
| Title of host publication | The European Physical Society Conference on High Energy Physics (EPS-HEP2025) |
| Publisher | Proceedings of Science |
| Number of pages | 9 |
| DOIs | |
| Publication status | Published - 11 Mar 2026 |
| Event | 2025 European Physical Society Conference on High Energy Physics, EPS-HEP 2025 - Marseille, France Duration: 7 Jul 2025 → 11 Jul 2025 |
Publication series
| Name | The European Physical Society Conference on High Energy Physics |
|---|---|
| Publisher | PoS |
| ISSN (Electronic) | 1824-8039 |
Conference
| Conference | 2025 European Physical Society Conference on High Energy Physics, EPS-HEP 2025 |
|---|---|
| Country/Territory | France |
| City | Marseille |
| Period | 7/07/25 → 11/07/25 |
Bibliographical note
Publisher Copyright:© Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0) All rights for text and data mining, AI training, and similar technologies for commercial purposes, are reserved.
Fingerprint
Dive into the research topics of 'Low-latency Jet Tagging for HL-LHC Using Transformer Architectures'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver