Hi Tomas,
> [...]
> ---
> Event: Done
> - Arrived = Wed Jun 18 11:48:10 2008 CEST
> - Exit code = 0
> - Host = skurut10-2.egee.cesnet.cz
> - Source = LRMS
> - Status code = OK
> - Timestamp = Wed Jun 18 11:48:10 2008 CEST
So, the job is reported as Done by the job wrapper itself,
but the LogMonitor daemon on the WMS does not see that state
reported by the grid_monitor running on the CE.
This could have various causes. For example, the batch system
may keep reporting the job as running, even after it finished.
I found your Torque configuration causes completed jobs to be
reported in the 'C' state: for how long?
The "pbs" and "lcgpbs" job managers simply ignore that state
and wait for the job to disappear from the "qstat" output:
did you change the "lcgpbs" job manager in that respect?
|