Abstract
In order to take advantage of the burgeoning diversity in processors at the frontier of supercomputing, the HPC community is migrating and improving codes to utilise heterogeneous nodes, where accelerators, principally GPUs, are highly prevalent in top-tier supercomputer designs. Programs therefore need to embrace at least some of the complexities of heterogeneous architectures. Parallel programming models have evolved to express heterogeneous paradigms whilst providing mechanisms for writing portable, performant programs. History shows that technologies first introduced at the frontier percolate down to local workhorse systems. However, we expect there will always be a mix of systems, some heterogeneous, but some remaining as homogeneous CPU systems. Thus it is important to ensure codes adapted for heterogeneous systems continue to run efficiently on CPUs. In this study, we explore how well widely used heterogeneous programming models perform on CPU-only platforms, and survey the performance portability they offer on the latest CPU architectures.
Original language | English |
---|---|
Title of host publication | 2022 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 1-13 |
Number of pages | 13 |
ISBN (Electronic) | 978-1-6654-6021-7 |
ISBN (Print) | 978-1-6654-6022-4 |
DOIs | |
Publication status | E-pub ahead of print - 30 Jan 2023 |
Event | 2022 International Workshop on Performance Portability and Productivity (P3HPC) - Duration: 13 Nov 2022 → 13 Nov 2022 |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Print) | 2831-3895 |
ISSN (Electronic) | 2831-3909 |
Conference
Conference | 2022 International Workshop on Performance Portability and Productivity (P3HPC) |
---|---|
Period | 13/11/22 → 13/11/22 |
Bibliographical note
Funding Information:For the purpose of open access, the author(s) has applied a Creative Commons Attribution (CC BY) license to any Accepted Manuscript version arising. This work was in part funded by EPSRC through the Strategic Partnership in Computational Science for Advanced Simulation and Modelling of Engineering Systems (ASiMoV) project, EP/S005072/1.
Funding Information:
This work used the Isambard 2 UK National Tier-2 HPC Service (http://gw4.ac.uk/isambard/) operated by GW4 and the UK Met Office, and funded by EPSRC (EP/T022078/1). The University of Bristol is an Intel oneAPI Center of Excellence, which helped support this work. Thanks to AWS for supporting access to Graviton 3.
Publisher Copyright:
© 2022 IEEE.