1 | Fault Tolerance |
2 | The master scheduler is also tasked with the responsibility of |
3 | ensuring that jobs complete successfully. |
4 | It does this by monitoring jobs until they successfully finish. |
5 | If a job fails, due to problems other than an application runtime |
6 | error, it will reschedule the job to run again. |