Near Data Processing: Impact and Optimization of 3D Memory System Architecture on the Uncore

Near Data Processing: Impact and Optimization of 3D Memory System Architecture on the Uncore

Syed Minhaj Hassan, Sudhakar Yalamanchili and Saibal Mukhopadhyay. “Near Data Processing: Impact and Optimization of 3D Memory System Architecture on the Uncore.” 2015 International Symposium on Memory Systems (Memsys 2015). October 2015.

Abstract

A promising recent development that can provide continued scaling of performance is the ability to stack multiple DRAM layers on a multi-core processor die. This paper analyzes the interaction between the interconnection network and the memory hierarchy in such systems, and its impact on system performance. We explore the design considerations of a 3D system with DRAM-on-processor stacking and note that full advantages of 3D can only be achieved by configuring the memory with high number of channels. This significantly increases memory level parallelism which results in decreasing the trackingc per DRAM bank, reducing their queuing delays, but increasing it on the interconnection network, making remote accesses expensive. To reduce the latency and trackingcc on the network, we propose restructuring the memory hierarchy to a memory-side cache organization and also explore the ects of various address translations and OS page allocation strategies. Our results indicate that a carefully designed 3D memory system can already improve performance by 25-35% without looking towards new sophisticated techniques.

Download

paper [PDF]

Citation

@inproceedings{minhaj-memsys2015,
author={Syed Minhaj Hassan and Sudhakar Yalamanchili and Saibal Mukhopadhyay},
booktitle={2015 International Symposium on Memory Systems (Memsys 2015)},
title={Near Data Processing: Impact and Optimization of 3D Memory System Architecture on the Uncore},
year={2015},
month={October},
}