Basic HTML version of Foils prepared December 6 98

Foil 13 How To Get Performance From Commodity Processors?

From Java Access to Numerical Libraries: Compiling Fortran to Java SC98 Orlando Java Grande Panel -- November 13 98. by Jack Dongarra, Christian Deane, Keith Seymour, Clint Whaley


Today's processors can achieve high-performance, but this requires extensive machine-specific hand tuning.
Routines have a large design space w/many parameters
  • blocking sizes, loop nesting permutations, loop unrolling depths, software pipelining strategies, register allocations, and instruction schedules.
  • Complicated interactions with the increasingly sophisticated microarchitectures of new microprocessors.
A few months ago no tuned BLAS for Pentium for Linux.
Need for quick deployment of optimized routines.
ATLAS - Automatic Tuned Linear Algebra Software



© Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Sun Dec 6 1998