1 | In MPI, this is a single call |
2 | CALL MPI_ALLREDUCE (TEST,TEST,1,MPI_REAL,MPI_MAX,comm) |
3 | Flag MPI_MAX specifies global maximum |
4 | The implementation is quite subtle and usually involves a logarithmic tree |
5 | There is a clever extension to get maximum in all processors in "same" time as that on one processor on hypercube and other architectures |