Fixing 10-20 Terabytes of Memory, we can get
|
16000 way parallel natural evolution of today's machines with various architectures from distributed shared memory to clustered heirarchy
-
Peak Performance is 150 Teraflops with memory systems like today but worse with more levels of cache
|
5000 way parallel Superconducting system with 1 Petaflop performance but terrible imbalance between CPU and memory speeds
|
12 million way parallel PIM system with 12 petaflop performance and "distributed memory architecture" as off chip access with have serious penalities
|
There are many hybrid and intermediate choices -- these are extreme examples of "pure" architectures
|