TY - GEN
T1 - Accelerating Hydrocodes with OpenACC, OpenCL and CUDA
AU - Herdman, J.A.
AU - Gaudin, W.P.
AU - McIntosh-Smith, Simon N
AU - Boulton, Michael
AU - Beckingsale, D.A.
AU - Mallinson, A.C.
AU - Jarvis, S.A.
PY - 2012/6/29
Y1 - 2012/6/29
N2 - Hardware accelerators such as GPGPUs are becoming increasingly common in HPC platforms and their use is widely recognised as being one of the most promising approaches for reaching exascale levels of performance.Large HPC centres, such as AWE, have made huge investments in maintaining their existing scientific software codebases, the vast majority of which were not designed to effectively utilise accelerator devices.Consequently, HPC centres will have to decide how to develop their existing applications to take best advantage of future HPC system architectures.Given limited development and financial resources, it is unlikely that all potential approaches will be evaluated for each application. We are interested in how this decision making can be improved, and this work seeks to directly evaluate three candidate technologies---OpenACC, OpenCL and CUDA---in terms of performance, programmer productivity, and portability using a recently developed Lagrangian-Eulerian explicit hydrodynamics mini-application. We find that OpenACC is an extremely viable programming model for accelerator devices, improving programmer productivity and achieving better performance than OpenCL and CUDA.
AB - Hardware accelerators such as GPGPUs are becoming increasingly common in HPC platforms and their use is widely recognised as being one of the most promising approaches for reaching exascale levels of performance.Large HPC centres, such as AWE, have made huge investments in maintaining their existing scientific software codebases, the vast majority of which were not designed to effectively utilise accelerator devices.Consequently, HPC centres will have to decide how to develop their existing applications to take best advantage of future HPC system architectures.Given limited development and financial resources, it is unlikely that all potential approaches will be evaluated for each application. We are interested in how this decision making can be improved, and this work seeks to directly evaluate three candidate technologies---OpenACC, OpenCL and CUDA---in terms of performance, programmer productivity, and portability using a recently developed Lagrangian-Eulerian explicit hydrodynamics mini-application. We find that OpenACC is an extremely viable programming model for accelerator devices, improving programmer productivity and achieving better performance than OpenCL and CUDA.
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-84856329242&partnerID=8YFLogxK
U2 - 10.1109/SC.Companion.2012.66
DO - 10.1109/SC.Companion.2012.66
M3 - Conference Contribution (Conference Proceeding)
AN - SCOPUS:84876578465
SN - 9781467362184
SP - 465
EP - 471
BT - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis
PB - IEEE Computer Society
ER -