Hello all,
One of our site's CE PBS server seems to be stalled. Lots of jobs are in
the "E" state for too long; actually they stay stalled that way.
Whereas, most of the jobs are in the Q state with a message like "Not
Running: Draining system to allow starving job to run". I'm afraid to
kill any of the jobs. What would be the correct procedure for
correcting this? I restarted the pbs_server process after restarting
pbs_mom from one of the working nodes that was executing the "E" state
job, but didn't work.
Any help?
Thanks,
./MS
-------------------------------------
Maniel Sotomayor
Software Designer
Hewlett-Packard Technology Center, PR
+1(787)819-7673
|