From Master Foilset for CPS615 Introduction -- Material from Culler and Koelbel Computational Science for Simulations -- Fall Semester 1998. byGeoffrey C. Fox, Nancy McCracken
Combining the (dis)advantages of shared and distributed memory
Lots of hierarchical designs are appearing.
Typically, "shared memory nodes" with 4 to 32 processors
Each processor has a local cache
Processors within a node access shared memory
Nodes can get data from or put data to other nodes' memories