On Wed, 19 Oct 2005, Stijn De Weirdt wrote:
> i would like to know what is going wrong here (ie where the perl script
> hangs on. i bet on the sleep($wait_time). the job has been like this for
> 12h+, giving it already 6 retries. where does the output of the
> log_something ends up anyway?) or what can be done about it. if you need
> more info, let me know.
Hi Stijn,
Apparently the user payload (ie. job) was running until about 11 o'clock
this morning (about 1.5 hours before you wrote), when its proxy ran out.
Therefore I don't believe it was sitting in exactly the state which you
found it in for 12+ hours, but it's true that now it is attempting to
return the job output (with retires) even though there is no possibility
that it will succeed. It will take several hours to fail. That is clearly
not good - I'll look at it straight away.
Thanks alot,
David
--
-------------------------------------------------------------------------
David Smith e-mail: [log in to unmask] tel: +41 22 76 74462
Address: D. Smith, CERN G06610, Bat 28 R-007, 1211 Geneva 23, Switzerland
-------------------------------------------------------------------------
|