We present an OpenCL implementation of our improved algorithm which demonstrates a speedup of up to 2.5x the transport sweep performance on a many-core GPGPU device compared to a state-of-the-art multi-core node; the first time this scale of speedup has been achieved for algorithms of this class.
|Title of host publication||2015 IEEE International Conference on Cluster Computing|
|Subtitle of host publication||Workshop on Representative Applications|
|Publication status||Published - 8 Sep 2015|
|Event||International Workshop on Representative Applications (WRAp) - IEEE Cluster, Chicago, United States|
Duration: 8 Sep 2015 → 8 Sep 2015
|Workshop||International Workshop on Representative Applications (WRAp)|
|Period||8/09/15 → 8/09/15|
FingerprintDive into the research topics of 'Expressing Parallelism on Many-Core for Deterministic Discrete Ordinates Transport'. Together they form a unique fingerprint.
Susan L Pywell (Manager), Simon A Burbidge (Other), Polly E Eccleston (Other) & Simon H Atack (Other)