Hi Jaime,

I had a similar issue where I fell foul of the torque multiple staging "bug". I saw some error messages in /var/log/messages on the worker node where it said something about scp: file not found.

Do you see any errors like this in /var/log/messages on the worker node?

Sean

On 17 November 2012 03:06, Jaime Ibar <[log in to unmask]> wrote:
Hi all,
we have a weird issue in our cream ce service, if I submit a job the it
gets into the
worker node, in pbs appears as running, the wms marks the job as running
also but
it stays running forever.
I logged in  as normal user in the worker node and I see that the
executable file is missing.
For instance, if I have this in my jdl

...
Executable = "stest.sh";
...

this file is not copied to the worker node and the job stay running forever.
I didn't any hint in cream or woker node logs, all services are running ok,
I tried to disable iptables but the problem is still there.

Does anyone had this error?

--
Jaime Ibar

Institute for Biocomputation and Physics of Complex Systems
University of Zaragoza (BIFI)
-------
Instituto de Biocomputación y Física de Sistemas Complejos
de la Universidad de Zaragoza (BIFI)
-------

e-mail: jibar(at)bifi.es
phone: (+34)876555405
C/ Mariano Esquillor s/n, Edificio I+D
50018 Zaragoza
España





--
Sean Crosby
Research Computing System Administrator and Developer
ARC Centre of Excellence for Particle Physics at the Terascale
School of Physics | University of Melbourne Vic 3010
T: +61 3 8344 8093