On Thu, Feb 16, 2006 at 07:33:31PM +0100 or thereabouts, Guillermo Losilla Anad?n wrote:
> Dear *,
>
> We are facing some strange problem after installing a RB at BIFI
> (rb-egee.bifi.unizar.es). It is configured to support DTEAM and the newly
> created FUSION VO among others.
> When submitting a job using a DTEAM certificate everything works OK (the job
> reaches the "Done" status). However, it fails when submitting jobs using a
> FUSION VO certificate:
Hi Guillermo,
Do you have a pointer the fusion VO, there are some people in
the UK down the road working at JET (Joint European Torus) who
may want to join. You may already be in contact with them?
Steve
>
> edg-job-status https://rb-egee.bifi.unizar.es:9000/MZjHtYxQ_hiaBiezOvEh3A
>
> *************************************************************
> BOOKKEEPING INFORMATION:
>
> Status info for the Job : https://rb-egee.bifi.unizar.es:9000/MZjHtYxQ_hiaBiezOvEh3A
> Current Status: Ready
> Status Reason: 37 the provided RSL 'queue' parameter is invalid
> Destination: ce-egee.bifi.unizar.es:2119/jobmanager-lcgpbs-fusion
> reached on: Thu Feb 16 18:23:52 2006
> *************************************************************
>
> it ends up after 3 attempts;
>
> edg-job-status https://rb-egee.bifi.unizar.es:9000/MZjHtYxQ_hiaBiezOvEh3A
>
> *************************************************************
> BOOKKEEPING INFORMATION:
>
> Status info for the Job : https://rb-egee.bifi.unizar.es:9000/MZjHtYxQ_hiaBiezOvEh3A
> Current Status: Aborted
> Status Reason: Job RetryCount (3) hit
> ...
>
> In the other hand I checked that the "fusion" queue actually exists in the CE;
>
> [root@ce-egee root]# qstat -q
>
> server: ce-egee.bifi.unizar.es
>
> Queue Memory CPU Time Walltime Node Run Que Lm State
> ---------------- ------ -------- -------- ---- --- --- -- -----
> atlas -- 48:00:00 72:00:00 -- 1 0 -- E R
> lhcb -- 48:00:00 72:00:00 -- 0 0 -- E R
> dteam -- 48:00:00 72:00:00 -- 0 0 -- E R
> swetest -- 48:00:00 72:00:00 -- 0 0 -- E R
> fusion -- 48:00:00 72:00:00 -- 0 0 -- E R
> --- ---
> 1 0
>
> And more important, this simple execution works fine;
> [jmrb2002@ui-egee jmrb2002]$ globus-job-run ce-egee.bifi.unizar.es /bin/pwd
> /home/fusion001
>
> Hence the problem should be in the RB. I feel there is something missing in the
> RB configuration...
>
> Could anyone help, please?
> regards,
> Guillermo
>
>
> --
> Guillermo Losilla Anadón
> Instituto de Biocomputación y Física de Sistemas Complejos
> de la Universidad de Zaragoza (BIFI)
> http://bifi.unizar.es/~guillermo/
> e-mail: [log in to unmask]
> phone: (+34)976562212 ext.224
--
Steve Traylen
[log in to unmask]
http://www.gridpp.ac.uk/
|