Xref: utzoo comp.unix.questions:24392 comp.sys.sequent:681 Path: utzoo!utgpu!news-server.csri.toronto.edu!clyde.concordia.ca!uunet!mcsun!ukc!icdoc!syma!william From: william@syma.sussex.ac.uk (William Craven) Newsgroups: comp.unix.questions,comp.sys.sequent Subject: Checkpoints for large jobs Keywords: checkpoint interrupt signal Message-ID: <3193@syma.sussex.ac.uk> Date: 6 Aug 90 16:06:18 GMT Organization: University of Sussex Lines: 23 We have a large number of long running background jobs on the Sequent Symmetry at Sussex University. Over the last few weeks we were having to frequently reboot the system. This of course killed the long running jobs and hence the users had to rerun their jobs. With the frequent rebooting it was beginning to annoy these users very much as they had to keep on rerunning their jobs and hence delaying getting their results. Because of this I was wondering whether there is a system which will allow a job to start off from when it last killed either by means of checkpointing or setjmp/longjmp. If there is such a scheme I would be grateful for pointers. As a side issue - can one reload the core file into a process ? If so how. Many thanks, William Craven UNIX Systems, william@syma.sussex.ac.uk Computing Service +44-273-606755 ext 2970 University of Sussex Brighton, BN1 9QJ United Kingdom