Processing-In-Memory (PIM) Architectures

Neil Jasper

NPAC

Processing-In-Memory (PIM) Architectures

Acknowledgements

Memory & CPU Bandwidth Gap

Key Points

This Talk: A Better Way!

Observations on This Talk

HPCC & TeraFlops

Petaflop Chain of Events

Results from Pasadena ‘94

PetaFlops Applications

Pasadena Architectures

Bodega Bay: Primary Memory

Bodega Bay: Secondary Memory

Bodega Bay: Aggregate I/O

Cost Considerations: Processors

Cost Considerations: Memory

The “Hidden Costs” of Modern Systems

The Overlooked Bandwidth

Modern “Alternative” RAMs

Processing In Memory (PIM):

PIM: Optimizing the System

Market Demand for Dense Processing

Current PIM Chips

Key Problem: Memory Density

Vendors with Known DRAM PIM Capability

EXECUBE: The First High Density PIM

Execube Processing Node

Tiling of Execube Processing Nodes

Lessons Learned from EXECUBE

New “Strawman” PIM

“Strawman” Chip Floorplan

Strawman Chip Interfaces

Strawman PIM Chip with I/O Macros

Strawman Properties

Strawman PIM “Memory Card”

Choosing the Processing Macro

Performance Per Transistor

SIA-Based PIM Chip Projections

Silicon Area for a Teraflop

Parallelism

Petaflop PIM System Size

Power Projections (Logic)

Power Per Sq. Cm

3D Stacking

Potential PIM Cube

Potential PIM Cube

Further Work: Hardware

Further Work:Algorithm Development

Further Work: Software Development

Current ND PIM Work In Progress

Conclusion