Now we decompose our 16 points (trivial example) into four groups and put 4 points in each processor |
Instead of dimensioning PHI(4) in each processor, one dimensions PHI(6) and runs loops from 2 to 5 with either boundary values or communication setting values of end-points |
Sequential: |
Parallel: |
PHI(6) for Processor 1 |