Hi Felix,
> I have torque/maui.
>
> All jobs that are not online are in the W status. And all jobs are
> trying to reach the next functional station.
>
> I have 66 WN's, 66,65,64,63 are full with jobs, 62 is not functioning
> but is online. The job is stuck in the W status for nr 62. And stays
> there. Isn't is possible for the job to jump tu the next station? Does
> it has to stay in W status only because WN is started?
>
> Is there any form of jumping this station and moving to another one?
It has often been reported that Torque/Maui can get stuck when a single WN
gets into a bad state. A brute-force recipe to deal with that:
1. remove the WN from /var/spool/pbs/server_priv/nodes
2. remove the corresponding jobs from /var/spool/pbs/server_priv/jobs
3. restart the PBS/Torque daemons
|