O(N2) Algorithms are extremely efficient on parallel machines as
-
Load balancing guaranteed with equal numbers of particles in each node
-
Communication overhead goes like 1/(grain size n = number of particles in node) . Typical time to communicate word/ typical time to do floating computation) ~ 0.25(from algorithm details) . 10( = tcomm/tfloat)/n
-
So very efficient along as around 50 particles or more per processor
|