Architecture-Independent Modeling of Intra-Node Data Movement

Architecture-Independent Modeling of Intra-Node Data Movement

Eric Anger, Sudhakar Yalamanchili, Scott Pakin and Patrick McCormick. “Architecture-Independent Modeling of Intra-Node Data Movement.” The LLVM Compiler Infrastructure in HPC Workshop (in conjunction with Supercomputing 2014). November 2014.

Abstract

A primary concern of future high performance systems is the way data movement is managed; the sheer scale of data to be processed directly affects the achievable performance these systems can attain. However, the increasingly complex but inherently symbiotic relationships between upcoming scientific applications and high-performance architectures necessitate increasingly informative and flexible tools to ensure performance goals are met.

In this work we develop a memory-hierarchy model that quantifies a given application’s cache behavior. What makes this work unique is that we instrument code at compile time, gather architecture-independent data at run time using a generic memory-hierarchy model, and delay selecting a particular cache hierarchy (levels, sizes, and associativities) to a post-processing step, where cache performance can be derived rapidly without having to re-run a slow cache simulator. We show that this approach is capable of predicting cache misses to within 13% of what is predicted by a traditional, high-fdelity, but slow cache simulator

Download

paper [PDF]

Citation

@inproceedings{anger-llvm2014,
author={Eric Anger and Sudhakar Yalamanchili and Scott Pakin and Patrick McCormick},
booktitle={The LLVM Compiler Infrastructure in HPC Workshop (in conjunction with Supercomputing 2014)},
title={Architecture-Independent Modeling of Intra-Node Data Movement},
year={2014},
month={November},
}