1 |
Alternatively we could send even members of fa to b and odd indexed entries in fb to a.
|
2 | Communication overhead fcomm = Tparallel *Nproc/Tsequential -1 is now given by |
3 | |
4 | We have avoided load imbalance and halved the communication |
5 | compared to simple algorithm. In later foils we will find even better |
6 | methods that get rid of log2N term |