1 | In the sequential code, a single processor updates in turn (16x16=256-60=196) internal points. Each update is j --> 0.25 *(j up + jdown + jleft + jright ) involves 4 floating point operations for each point |
2 | In the parallel case, each processor updates in turn the points for which it is responsible - this is "owner computes rule" |
3 | A corner processor updates nine points |
4 | (the small internal points) |
5 | A "general" central processor |
6 | updates sixteen points |