Basic HTML version of Foils prepared May 7 1996

Foil 28 The Workings of Typical Cluster Management Software - 5

From MetaComputing -- MRA Meeting Part II:The Practical Issues Tutorial for CRPC MRA Meeting at Cornell -- May 7 1996. by Mark Baker, Geoffrey Fox


1 Fault Tolerance
2 The master scheduler is also tasked with the responsibility of
3 ensuring that jobs complete successfully.
4 It does this by monitoring jobs until they successfully finish.
5 If a job fails, due to problems other than an application runtime
6 error, it will reschedule the job to run again.

in Table To:


Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Sun Apr 11 1999