Many-core acceleration of a discrete ordinates transport mini-app at extreme scale

Tom J Deakin*, Simon N McIntosh-Smith, Wayne Gaudin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

6 Citations (Scopus)
267 Downloads (Pure)

Abstract

Time-dependent deterministic discrete ordinates transport codes are an important class of application which provide significant challenges for large, many-core systems. One such challenge is the large memory capacity needed by the solve step, which requires us to have a scalable solution in order to have enough node-level memory to store all the data. In our previous work, we demonstrated the first implementation which showed a significant performance benefit for single node solves using GPUs. In this paper we extend our work to large problems and demonstrate the scalability of our solution on two Petascale GPU-based supercomputers: Titan at Oak Ridge and Piz Daint at CSCS. Our results show that our improved node-level parallelism scheme scales just as well across large systems as previous approaches when using the tried and tested KBA domain decomposition technique. We validate our results against an improved performance model which predicts the runtime of the main ‘sweep’ routine when running on different hardware, including CPUs or GPUs.

Original languageEnglish
Title of host publicationHigh Performance Computing, Networking and Storage
Subtitle of host publicationProceedings of the 30th International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19-23, 2016
Editors Julian M Kunkel, Pavan Balaji, Jack Dongarra
PublisherSpringer Verlag
Pages429-448
Number of pages20
ISBN (Electronic)9783319413211
ISBN (Print)9783319413204
DOIs
Publication statusPublished - 15 Jun 2016
Event31st International Conference on High Performance Computing, ISC High Performance 2016 - Frankfurt, Germany
Duration: 19 Jun 201623 Jun 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9697
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference31st International Conference on High Performance Computing, ISC High Performance 2016
CountryGermany
CityFrankfurt
Period19/06/1623/06/16

Fingerprint Dive into the research topics of 'Many-core acceleration of a discrete ordinates transport mini-app at extreme scale'. Together they form a unique fingerprint.

Cite this