Implementation of HPF Library
As with intrinsic routines, the onus is on the implementor to use the best algorithm
- Reductions and related operations have log2 P - depth communication trees
Early compilers call run-time libraries, some now do inlining