Basic HTML version of Foils prepared December 6 98

Foil 13 How To Get Performance From Commodity Processors?

From Java Access to Numerical Libraries: Compiling Fortran to Java SC98 Orlando Java Grande Panel -- November 13 98. by Jack Dongarra, Christian Deane, Keith Seymour, Clint Whaley


1 Today's processors can achieve high-performance, but this requires extensive machine-specific hand tuning.
2 Routines have a large design space w/many parameters
  • blocking sizes, loop nesting permutations, loop unrolling depths, software pipelining strategies, register allocations, and instruction schedules.
  • Complicated interactions with the increasingly sophisticated microarchitectures of new microprocessors.
3 A few months ago no tuned BLAS for Pentium for Linux.
4 Need for quick deployment of optimized routines.
5 ATLAS - Automatic Tuned Linear Algebra Software

in Table To:


© Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Sun Dec 6 1998