Full Index for Scripted Second set of lectures on CPS615 Parallel Computing Overview

Foilset Search Full Index for Scripted foilset

Second set of lectures on CPS615 Parallel Computing Overview

Given by Geoffrey C. Fox at CPS615 Basic Simulation Track for Computational Science on Fall Semester 95. Foils prepared 18 Sept 1995

This starts with a discussion of Parallel Computing using analogies from nature
It uses foils and material from CSEP chapter on Computer Architecture to discuss how and why to build a parallel computer including synchronization memory structure and network issues
SIMD and MIMD Architectures with a brief comparison of workstation networks with closely coupled systems
A look to the future is based on results from Petaflops workshop

This mixed presentation uses parts of the following base foilsets which can also be looked at on their own!

Master Material for Second set of lectures on CPS615 Parallel Computing Overview
Master Set A of Overview Material on Parallel Computing for CPS615 Foils
Master Set B of Overview Material on Parallel Computing for CPS615 Foils

Table of Contents for Second set of lectures on CPS615 Parallel Computing Overview

There are two types of foils -- html and image which are each available in basic and JavaScript enabled "focused" style
(basic:)(focus style:) Denote Foils where Image Critical
(basic:)(focus style:) Denote Foils where Image has important information
(basic:)(focus style:) Denote Foils where HTML is sufficient
Indicates Available audio which is greyed out if missing

CPS 615 Lectures 1995 Fall Semester -- set 2

This Presentation is made up by selecting foils from modules or or foilsets which are sometimes specific to CPS615 and sometimes generic
There is a master foilset CPS615master95-1,2.. and set of specific modules labelled CPS615A95 B95 etc.
See CPS 615 Basic Overview of Computational Science -- Simulation Track for complete course information

CPS615 -- Base Course for the Simulation Track of Computational Science
Fall Semester 1995 --
Lecture Stream 2

Abstract of Lecture Stream 2 of CPS615

Elementary Discussion of Parallel Computing

Including Analogies with Society

Elementary Discussion of
Parallel Computing

Single nCUBE2 CPU Chip

64 Node nCUBE Board

CM-5 in NPAC Machine Room

Basic METHODOLOGY of Parallel Computing

Concurrent Computation as a Mapping Problem -I

Concurrent Computation as a Mapping Problem - II

Concurrent Computation as a Mapping Problem - III

Finite Element Mesh From Nastran
(mesh only shown in upper half)

A Simple Equal Area Decomposition

Decomposition After Annealing
(one particularly good but nonoptimal decomposition)

Parallel Processing and Society

Concurrent Construction of a Wall
Using N = 8 Bricklayers
Decomposition by Vertical Sections

Quantitative Speed-Up Analysis for Construction of Hadrian's Wall

Amdahl's law for Real World Parallel Processing

Pipelining --Another Parallel Processing Strategy for Hadrian's Wall

Hadrian's Wall Illustrates that the Topology of Processor Must Include Topology of Problem

General Speed Up Analysis

Comparison of The Complete Problem to the subproblems formed in domain decomposition

Hadrian's Wall Illustrating an
Irregular but Homogeneous Problem

Some Problems are Inhomogeneous Illustrated by:
An Inhomogeneous Hadrian Wall with Decoration

Global and Local Parallelism Illustrated by Hadrian's Wall

Parallel I/O Illustrated by
Concurrent Brick Delivery for Hadrian's Wall
Bandwidth of Trucks and Roads
Matches that of Masons

Nature's Concurrent Computers

Comparison of Concurrent Processing in Society and Computing

General Overview of Computer Architecture

Computational Science CPS615
Simulation Track Overview
Foilsets B 1995

Abstract of CPS615 Foilsets B 1995

Overview of
Parallel Hardware Architecture

3 Major Basic Hardware Architectures

Examples of the Three Current Concurrent Supercomputer Architectures

Parallel Computer Architecture Issues

General Types of Synchronization

Granularity of Parallel Components

Types of Parallel Memory Architectures
-- Logical Structure

Types of Parallel Memory Architectures -- Physical Characteristics

Diagrams of Shared and Distributed Memories

Overview of Interconnection Networks

Survey of Issues in Communication Networks

Glossary of Useful Concepts in Communication Systems

Switch and Bus based Architectures

Classes of Communication Network include ...

Point to Point Networks (Store and Forward) -- I

Examples of Interconnection Topologies

Degree and Diameter of Ring and Mesh(Torus) Architectures

Details on Hypercube and Mapping to Meshs

Degree and Diameter of Hypercube and Tree Architectures

Rules for Making Hypercube Network Topologies

Mapping of Hypercubes into Three Dimensional Meshes

Mapping of Hypercubes into One Dimensional Systems

The One dimensional Mapping can be thought of as for one dimensional problem solving or one dimensional layout of chips forming hypercube

Hypercube Versus Mesh Topologies

Practical Network Issues

Point to Point Networks (Store and Forward) -- II

Latency and Bandwidth of a Network

Transfer Time in Microseconds for both Shared Memory Operations and Explicit Message Passing

Latency/Bandwidth Space for 0-byte message(Latency) and 1 MB message(bandwidth).

Switches versus Processor Networks

Circuit Switched Networks

Parallel Architectures in More Detail

General Issues

Let's Return to General Parallel Architectures in more detail

Overview of Computer Architecture Issues

Some Global Computer Architecture Issues

Two General Real World Architectural Issues

SIMD MIMD Shared versus Distributed

MIMD Distributed Memory Architecture

Some MIMD Architecture Issues

SIMD (Single Instruction Multiple Data) Architecture

SIMD Architecture Issues

Shared Memory Architecture

Shared versus Distributed Memory

Classic Vector Supercomputers

The General Structure of a full sized CRAY C-90

The General Structure of a NEC SX-3
Classic Vector Supercomputer

Comparison of MIMD and SIMD Parallelism seen on Classic Vector Supercomputers

Petaflop Performance in the Year 2015

What will happen in the year 2015 with .05 micron feature size and Petaflop Supercomputers using CMOS

CMOS Technology and Parallel Processor Chip Projections

Processor Chip Requirements for a Petaflop Machine Using 0.05 Micron Technology

Three Designs for a Year 2015 Petaflops machine with 0.05 micron technology

The Global Shared Memory Category I Petaflop Architecture

Category II Petaflop Architecture -- Network of microprocessors

Category III Petaflop Design -- Processor in Memory (PIM)

Necessary Latency to Support Three Categories

Chip Density Projections to year 2013

DRAM Chip count for Construction of Petaflop computer in year 2013 using 64 Gbit memory parts

Memory Chip Bandwidth in Gigabytes/sec

Power and I/O Bandwidth (I/O Connections) per Chip throught the year 2013

Clock Speed and I/O Speed in megabytes/sec per pin through year 2013

Full WebWisdom URL and this Foilset Search
This contains all WebWisdom links preceded by those referenced in this foilset

List of WebWisdom URL's Used in this Foilset

key cps615homepage95 URL http://www.npac.syr.edu/projects/cps615fall95/ * CPS 615 Basic Overview of Computational Science -- Simulation Track by gcf on Sept 1,1995
Times 2 Foils referenced Script Script

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Fri Aug 15 1997