On Thu, 3 May 2007, Alexander Piavka wrote:
> Hi all,
>
> At IL-BGU site i have a shared Torque server between lcgCE gCE.
> I've noticed that all jobs running on a WN are idle , these jobs are all
> gCE jobs, if lcgCE is run it finishes ok.
> The reason to these gCE sleeping jobs is that they fail to globus-url-copy for
> globus_url_retry_copy function in JobWrapper script:
> globus_url_retry_copy "file://${workdir}/${f}" "${__output_base_url}${ff}"
> The ${f} is __output_file[1]="testjob-results.tgz"
> Meaning this is SAM job and it has finished all and now tries to passwd
> the result testjob-results.tgz back to WMS.
> All these jobs are trying to globus-url-copy rb108.cern.ch
Can you try this from one of those WNs:
globus-url-copy file:/etc/group gsiftp://rb108.cern.ch/tmp/foo.$$
If it hangs, do these commands work from the WN:
telnet rb108.cern.ch 2811
telnet rb108.cern.ch 22000
The last command should return "Connection refused".
|