On Thu, 3 May 2007 [log in to unmask] wrote: > On Thu, 3 May 2007, Alexander Piavka wrote: > > > Hi all, > > > > At IL-BGU site i have a shared Torque server between lcgCE gCE. > > I've noticed that all jobs running on a WN are idle , these jobs are all > > gCE jobs, if lcgCE is run it finishes ok. > > The reason to these gCE sleeping jobs is that they fail to globus-url-copy for > > globus_url_retry_copy function in JobWrapper script: > > globus_url_retry_copy "file://${workdir}/${f}" "${__output_base_url}${ff}" > > The ${f} is __output_file[1]="testjob-results.tgz" > > Meaning this is SAM job and it has finished all and now tries to passwd > > the result testjob-results.tgz back to WMS. > > All these jobs are trying to globus-url-copy rb108.cern.ch > > > > Can you try this from one of those WNs: > > globus-url-copy file:/etc/group gsiftp://rb108.cern.ch/tmp/foo.$$ $ globus-url-copy -vb file:/etc/group gsiftp://rb108.cern.ch/tmp/foo.$$ 11662 bytes 4.47 KB/sec avg 4.47 KB/sec inst > If it hangs, do these commands work from the WN: > > telnet rb108.cern.ch 2811 > > telnet rb108.cern.ch 22000 > $ telnet rb108.cern.ch 2811 Trying 128.142.161.221... Connected to rb108.cern.ch (128.142.161.221). Escape character is '^]'. 220 rb108.cern.ch GridFTP Server 1.12 GSSAPI type Globus/GSI wu-2.6.2 (gcc32dbg, 1109600000-42) ready. QUIT 221 Goodbye. Connection closed by foreign host. $ telnet rb108.cern.ch 22000 Trying 128.142.161.221... telnet: connect to address 128.142.161.221: Connection refused telnet: Unable to connect to remote host: Connection refused > The last command should return "Connection refused". So globus-url-copy does not hang but last command should returns "Connection refused". Alex