Basic IMAGE version of Foils prepared 17 November 98

Foil 92 Cache Only COMA Machines

From Master Foilset for HPC Achitecture Overview CPS615 Introduction to Computational Science -- Fall Semester 1998. by Geoffrey C. Fox
(0 to 5):





© Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Sun Apr 11 1999

Table of Contents for Master Foilset for HPC Achitecture Overview


1 Computer Architecture for Computational Science
2 Abstract of Computer Architecture Overview
3 Some NPAC Parallel Machines
4 Architectural Trends I
5 Architectural Trends
6 3 Classes of VLSI Design?
7 Architectural Trends: Bus-based SMPs
8 Bus Bandwidth
9 Economics
10 Important High Performance Computing Architectures
11 Some General Issues Addressed by High Performance Architectures
12 What is a Pipeline -- Cafeteria Analogy?
13 Example of MIPS R4000 Floating Point
14 MIPS R4000 Floating Point Stages
15 Sequential Memory Structure
16 Cache Issues I
17 Cache Issues II
18 Spatial versus Temporal Locality I
19 Spatial versus Temporal Locality II
20 Parallel Computer Memory Structure
21 Cache Coherence
22 Raw Uniprocessor Performance: Cray v. Microprocessor LINPACK n by n Matrix Solves
23 Raw Parallel Performance: LINPACK
24 Linear Linpack HPC Performance versus Time
25 Top 10 Supercomputers November 1998
26 Distribution of 500 Fastest Computers
27 CPU Technology used in Top 500 versus Time
28 Geographical Distribution of Top 500 Supercomputers versus time
29 Node Technology used in Top 500 Supercomputers versus Time
30 Total Performance in Top 500 Supercomputers versus Time and Manufacturer
31 Number of Top 500 Systems as a function of time and Manufacturer
32 Total Number of Top 500 Systems Installed June 98 versus Manufacturer
33 Two Basic Programming Models
34 Shared Address Space Architectures
35 Shared Address Space Model
36 Communication Hardware
37 History -- Mainframe
38 History -- Minicomputer
39 Scalable Interconnects
40 Message Passing Architectures
41 Message-Passing Abstraction e.g. MPI
42 First Message-Passing Machines
43 Mark2 Hypercube built by JPL(1985) Cosmic Cube (1983) built by Caltech (Chuck Seitz)
44 64 Ncube Processors (each with 6 memory chips) on a large board
45 ncube1 Chip -- integrated CPU and communication channels
46 Example of Message Passing System: IBM SP-2
47 Example of Message Passing System: Intel Paragon
48 Clusters of PC's 1986-1998
49 HP Kayak PC (300 MHz Intel Pentium II) vs Origin 2000
50 Cray Vector Supercomputers
51 Cray/SGI memory latencies
52 Architecture of Cray T3E
53 T3E Messaging System
54 Cray T3E Cache Structure
55 Cray T3E Cache Performance
56 Finite Difference Example for T3E Cache Use I
57 Finite Difference Example for T3E Cache Use II
58 How to use Cache in Example I
59 How to use Cache in Example II
60 SGI Origin 2000 I
61 SGI Origin II
62 SGI Origin Block Diagram
63 SGI Origin III
64 SGI Origin 2 Processor Node Board
65 Performance of NCSA 128 node SGI Origin 2000
66 Cache Coherent or Not?
67 Summary of Cache Coherence Approaches
68 SMP Example: Intel Pentium Pro Quad
69 Sun E10000 in a Nutshell
70 Sun Enterprise Systems E6000/10000
71 Starfire E10000 Architecture I
72 Starfire E10000 Architecture II
73 Sun Enterprise E6000/6500 Architecture
74 Sun's Evaluation of E10000 Characteristics I
75 Sun's Evaluation of E10000 Characteristics II
76 Scalability of E1000
77 MPI Bandwidth on SGI Origin and Sun Shared Memory Machines
78 Latency Measurements on Origin and Sun for MPI
79 Tera Multithreaded Supercomputer
80 Tera Computer at San Diego Supercomputer Center
81 Overview of the Tera MTA I
82 Overview of the Tera MTA II
83 Tera 1 Processor Architecture from H. Bokhari (ICASE)
84 Tera Processor Characteristics
85 Tera System Diagram
86 Interconnect / Communications System of Tera I
87 Interconnect / Communications System of Tera II
88 T90/Tera MTA Hardware Comparison
89 Tera Configurations / Performance
90 Performance of MTA wrt T90 and in parallel
91 Tera MTA Performance on NAS Benchmarks Compared to T90
92 Cache Only COMA Machines
93 Examples of Some SIMD machines
94 Consider Scientific Supercomputing
95 Toward Architectural Convergence
96 Convergence: Generic Parallel Architecture
97 SIMD CM 2 from Thinking Machines
98 Official Thinking Machines Specification of CM2
99 GRAPE Special Purpose Machines
100 Quantum ChromoDynamics (QCD) Special Purpose Machines
101 ASCI Red -- Intel Supercomputer at Sandia
Click outside pointer rectangle to move pointer
Click on Pointer to Hide
Click on Pointer + ALT to toggle message hiding
Click on Pointer + CNTL to abolish pointer
Click on Pointer + Shift to cycle families
Click outside + Alt is Change Image
Click outside + Control is Double Size
Click outside + Shift is Halve Size
Right Mouse Down on Pointer Toggles Index
Shift Right Mouse aligns top with scrolled Page
While With Mouse Down on Current Pointer
h hides This Message while m restores
i Toggles Index Aligned with Page Top
j Toggles Index Aligned with Scrolled View Top
a Abolishes Pointer while CNTL-Click restores
f cycles through pointer families
c cycles through members of a family
u increases Size Up and d decreases Down
Mouse Up-Down between changes of
Pointer to process new option