In Partial Fulfillment of the Requirements for the Degree of Master of Science
will defend his thesis
A Strategy for Mapping Threads to GPUs in a Directive-Based Programming Model
The proliferation of accelerators in modern clusters makes efficient coprocessor programming a key requirement if application codes are to achieve high levels of performance with acceptable energy consumption on such platforms. This has led to considerable effort to provide suitable programming models for these accelerators, especially within the OpenMP community. While OpenMP 4.5 offers a rich set of directives, clauses and runtime calls to fully utilize accelerators, an efficient implementation o f OpenMP 4.5 for GPUs remains a non-trivial task, given their multiple levels of thread parallelism.
In this thesis, we describe a new implementation of the corresponding features of OpenMP 4.5 for GPUs based on a one-to-one mapping of its loop hierarchy parallelism to the GPU thread hier- archy. We assess the impact of this mapping, in particular the use of GPU warps to handle innermost loop execution, on the performance of GPU execution via a set of benchmarks that include a version of the NAS parallel benchmarks specifically developed for this research; we also used the Matrix-Matrix multiplication, Jacobi, Gauss and Laplacian kernels.
Date: Thursday, April 20, 2017
Time: 10:00 AM
Place: PGH 516
Advisor: Dr. Edgar Gabriel
Faculty, students, and the general public are invited.