Print

Print


On Thu, 3 May 2007 [log in to unmask] wrote:

> On Thu, 3 May 2007, Alexander Piavka wrote:
>
> >  Hi all,
> >
> > At IL-BGU site i have a shared Torque server between lcgCE gCE.
> > I've noticed that all jobs running on a WN are idle , these jobs are all
> > gCE jobs, if lcgCE is run it finishes ok.
> > The reason to these gCE sleeping jobs is that they fail to globus-url-copy for
> > globus_url_retry_copy function in JobWrapper script:
> >  globus_url_retry_copy "file://${workdir}/${f}" "${__output_base_url}${ff}"
> > The ${f} is __output_file[1]="testjob-results.tgz"
> > Meaning this is SAM job and it has finished all and now tries to passwd
> > the result testjob-results.tgz back to WMS.
> > All these jobs are trying to globus-url-copy rb108.cern.ch
>
>
>

> Can you try this from one of those WNs:
>
>     globus-url-copy file:/etc/group gsiftp://rb108.cern.ch/tmp/foo.$$

$ globus-url-copy -vb file:/etc/group gsiftp://rb108.cern.ch/tmp/foo.$$
           11662 bytes         4.47 KB/sec avg         4.47 KB/sec inst

> If it hangs, do these commands work from the WN:
>
>     telnet rb108.cern.ch 2811
>
>     telnet rb108.cern.ch 22000
>

$ telnet rb108.cern.ch 2811
Trying 128.142.161.221...
Connected to rb108.cern.ch (128.142.161.221).
Escape character is '^]'.
220 rb108.cern.ch GridFTP Server 1.12 GSSAPI type Globus/GSI wu-2.6.2
(gcc32dbg, 1109600000-42) ready.
QUIT
221 Goodbye.
Connection closed by foreign host.
$ telnet rb108.cern.ch 22000
Trying 128.142.161.221...
telnet: connect to address 128.142.161.221: Connection refused
telnet: Unable to connect to remote host: Connection refused

> The last command should return "Connection refused".

 So globus-url-copy does not hang but last command should returns
"Connection refused".

 Alex