Official HPCC Results from the Acceptance Tests

HPCC Results

Official HPCC Results from the Acceptance Tests

Alamo (view machine details)

HPCC Results G-HPL G-PTRANS G-FFTE G-Random G-STREAM EP-STREAM EP-DGEMM Random Ring Bandwidth Random Ring Latency % HPL Peak(*)
Node CPU Core TFLOP/s GB/s GLFOP/s Gup/s GB/s GB/s GFLOP/s GB/s usec Percent
32 64 256 2.291 29.81 93.54 0.92 3700.060 3.71 10.691 0.219 14.86 84.12

India (view machine details)

HPCC Results G-HPL G-PTRANS G-FFTE G-Random G-STREAM EP-STREAM EP-DGEMM Random Ring Bandwidth Random Ring Latency % HPL Peak(*)
Node CPU Core TFLOP/s GB/s GLFOP/s Gup/s GB/s GB/s GFLOP/s GB/s usec Percent
128 256 1024 11.3428 59.5363 225.7700 2.2630 3700.060 3.6133 12.0419 0.0852 19.6622 94.513
84 168 672 7.3097 65.9453 135.6070 1.7364 2438.271 3.6284 11.7255 0.0968 18.1360 92.812
32 64 512 2.8210 21.1257 70.3138 0.0507 865.193 3.3797 12.0083 0.1264 15.0372 94.022

(*) Calculated using the base frequency of the processor. (Intel Turbo Boost technology was enabled)

Hotel (view machine details)

HPCC Results G-HPL G-PTRANS G-FFTE G-Random G-STREAM EP-STREAM EP-DGEMM Random Ring Bandwidth Random Ring Latency % HPL Peak(*)
Node CPU Core TFLOP/s GB/s GLFOP/s Gup/s GB/s GB/s GFLOP/s GB/s usec Percent
84 168 672 6.9401 83.3561 158.4620 2.3839 2512.2854 3.7385 11.2254 0.1280 6.1591 90.36
32 64 256 2.6360 26.8054 95.6900 1.4378 959.4496 3.7479 11.2031 0.1502 5.5892 90.09

XRay (view machine details)

HPCC Results G-HPL G-PTRANS G-FFTE G-Random G-STREAM EP-STREAM EP-DGEMM Random Ring Bandwidth Random Ring Latency % HPL Peak(*)
Node CPU Core TFLOP/s GB/s GLFOP/s Gup/s GB/s GB/s GFLOP/s GB/s usec Percent
84 168 672 5.38 29.27 75.47 0.62 2017 3.00 8.93 0.047 26.8 83.40


 

HPCC Configuration

General Information

What version was used, how as it compiled etc.

India

  • HPCC version: 1.3.1
  • Compiler: Intel compiler 11.1.038 with OpenMPI-1.3.1.
  • Math Library: Intel MKL library.
  • A few important settings in Makefile:
    • CC=mpicc
    • LINKER=mpicc -mkl

Hotel

  • HPCC version: 1.3.1
  • Compiler: Intel compiler 11.1.038 with Intel MPI-4.0.0.
  • Math Library: Intel MKL library.
  • A few important settings in Makefile:
    • CC=mpicc
    • LINKER=mpicc -mkl

Xray

  • HPCC version: 1.3.1
  • Compiler: Cray compiler (xt-asyncpe-3.4)
  • A few important settings in Makefile:
  • Settings in Makefile: CC=mpicc and LINKER=mpicc -mkl
    • CC=cc
    • CCNOOPT=$(HPL_DEFS) -DLONG_IS_64BITS
    • CCFLAGS=$(HPL_DEFS) -fast -Minfo=loop -Mneginfo=loop -DLONG_IS_64BITS -DFFTE_NP=4
    • CCFLAGS_STREAM=$(HPL_DEFS) -Mnontemporal -Mprefetch=distance:8,nta -Msafeptr -fastsse -Minfo=loop -Mneginfo=loop -DLONG_IS_64BITS

Configuration Files

Below are the contents of input files, hpccinf.txt, for HPCC.

India

HPLinpack benchmark input file
Innovative Computing Laboratory, University of Tennessee
HPL.out output file name (if any)
8 device out (6=stdout,7=stderr,file)
1 # of problems sizes (N)
409600 Ns
1 # of NBs
112 NBs
0 PMAP process mapping (0=Row-,1=Column-major)
1 # of process grids (P x Q)
32 Ps
32 Qs
16.0 threshold
1 # of panel fact
2 PFACTs (0=left, 1=Crout, 2=Right)
1 # of recursive stopping criterium
4 NBMINs (>= 1)
1 # of panels in recursion
3 NDIVs
1 # of recursive panel fact.
2 RFACTs (0=left, 1=Crout, 2=Right)
1 # of broadcast
3 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM)
1 # of lookahead depth
0 DEPTHs (>=0)
2 SWAP (0=bin-exch,1=long,2=mix)
64 swapping threshold
0 L1 in (0=transposed,1=no-transposed) form
0 U in (0=transposed,1=no-transposed) form
1 Equilibration (0=no,1=yes)
16 memory alignment in double (> 0)
##### This line (no. 32) is ignored (it serves as a separator). ######
0 Number of additional problem sizes for PTRANS
1200 10000 30000 values of N
0 number of additional blocking sizes for PTRANS
40 9 8 13 13 20 16 32 64 values of NB

 

Hotel

HPLinpack benchmark input file
Innovative Computing Laboratory, University of Tennessee
HPL.out output file name (if any)
8 device out (6=stdout,7=stderr,file)
1 # of problems sizes (N)
338688 Ns
1 # of NBs
112 NBs
0 PMAP process mapping (0=Row-,1=Column-major)
1 # of process grids (P x Q)
32 Ps
32 Qs
16.0 threshold
1 # of panel fact
2 PFACTs (0=left, 1=Crout, 2=Right)
1 # of recursive stopping criterium
4 NBMINs (>= 1)
1 # of panels in recursion
3 NDIVs
1 # of recursive panel fact.
2 RFACTs (0=left, 1=Crout, 2=Right)
1 # of broadcast
3 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM)
1 # of lookahead depth
0 DEPTHs (>=0)
2 SWAP (0=bin-exch,1=long,2=mix)
64 swapping threshold
0 L1 in (0=transposed,1=no-transposed) form
0 U in (0=transposed,1=no-transposed) form
1 Equilibration (0=no,1=yes)
16 memory alignment in double (> 0)
##### This line (no. 32) is ignored (it serves as a separator). ######
0 Number of additional problem sizes for PTRANS
1200 10000 30000 values of N
0 number of additional blocking sizes for PTRANS
40 9 8 13 13 20 16 32 64 values of NB

 

 

XRay

HPLinpack benchmark input file
Innovative Computing Laboratory, University of Tennessee
HPL.out output file name (if any)
6 device out (6=stdout,7=stderr,file)
1 # of problems sizes (N)
373056 Ns
1 # of NBs
232 NBs
0 PMAP process mapping (0=Row-,1=Column-major)
1 # of process grids (P x Q)
24 Ps
28 Qs
16.0 threshold
1 # of panel fact
1 PFACTs (0=left, 1=Crout, 2=Right)
1 # of recursive stopping criterium
4 NBMINs (>= 1)
1 # of panels in recursion
2 NDIVs
1 # of recursive panel fact.
2 RFACTs (0=left, 1=Crout, 2=Right)
1 # of broadcast
1 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM)
1 # of lookahead depth
0 DEPTHs (>=0)
2 SWAP (0=bin-exch,1=long,2=mix)
64 swapping threshold
0 L1 in (0=transposed,1=no-transposed) form
0 U in (0=transposed,1=no-transposed) form
1 Equilibration (0=no,1=yes)
8 memory alignment in double (> 0)
##### This line (no. 32) is ignored (it serves as a separator). ######
0 Number of additional problem sizes for PTRANS
1200 values of N
4 number of additional blocking sizes for PTRANS
23 31 33 63 values of NB

HPLinpack benchmark input file
Innovative Computing Laboratory, University of Tennessee
HPL.out output file name (if any)
8 device out (6=stdout,7=stderr,file)