Speculative Execution On Multi-GPU Systems
Gregory Diamos and Sudhakar Yalamanchili. “Speculative Execution On Multi-GPU Systems.” IEEE International Parallel & Distributed Processing Symposium. April 2010.
Abstract
In previous work, we presented the Harmony execution model for computing on heterogeneous systems with several CPUs and accelerators. In this paper, we extend Harmony to target systems with multiple accelerators using control speculation to expose parallelism. We refer to this technique as Kernel Level Speculation (KLS). We argue that dynamic parallelization techniques such as KLS are sufficient to scale applications across several accelerators based on the intuition that there will be fewer distinct accelerators than cores within each accelerator. In this paper, we use a complete prototype of the Harmony runtime that we developed to explore the design decisions and trade-offs in the implementation of KLS. We show that KLS improves parallelism to a sufficient degree while retaining a sequential programming model. We accomplish this by demonstrating good scaling of KLS on a highly heterogeneous system with three distinct accelerator types and ten processors.
Download
Speculative Execution On Multi-GPU Systems [PDF]
Citation
AUTHOR=”Gregory Diamos and Sudakhar Yalamanchili”,
TITLE=”Speculative Execution on {Multi-GPU} Systems”,
BOOKTITLE=”24th IEEE International Parallel \& Distributed Processing Symposium”,
ADDRESS=”Atlanta, Georgia, USA”,
DAYS=19,
MONTH=4,
YEAR=2010,
KEYWORDS=”Harmony; Heterogeneous; Runtimes; Compilers; CUDA; GPU; GPGPU;”
}