Hi,
The last four mornings when arriving at work I've had reports that
jobs submitted through lcgrb01.gridpp.rl.ac.uk remain in a
"Running" state long after they have finished.
Restarting edg-wl-jc has the effect that the same jobs then immediately
move to the "Done" state.
The restart looks like this
Restarting JobController daemon(s)
Stopping JobController... [ OK ]
Stopping CondorG... [FAILED]
Starting JobController... [ OK ]
Starting CondorG... [ OK ]
I've put /var/edgwl/jobcontrol/log/events.log at
/afs/rl.ac.uk/user/t/traylens/events.log
All help appreciated.
Steve
--
Steve Traylen
[log in to unmask]
http://www.gridpp.ac.uk/
|