Print

Print


Hi, Dani,

Many thanks for your reply. And I would like have the script  you
meitioned.  Thanks again.

BTW, do you use maui?

Regards,

Wei

Daniel Cano wrote:

> Hello,
> I had the same problem, and the one with ssh is still there, but I think
> it doesn't affect. Looks like if the where trying to exchange keys and
> they can't, because accesing by ssh from another network don't give this
> messages.
>
> The solution for the pbs is a bit more tricky. What I did is creating
> new queues from scratch and therefore removing those created by yaim. I
> used a script to do so, if you want it contact me. Anyway, with this
> trick everything seems to work fine.
>
> Cheers
>
> Dani
>
> Wei Xing wrote:
>
>> Hi, all,
>>
>> I have installed SL303+LCG2_3_0 by Yaim.
>>
>> I used lcg-CE-torque and lcg-WN-torque package  in  my CE  and  WN.
>> Now  there are  two problems:
>>
>> 1, ssh from CE to WN, and from WN to CE, I got error message "Server
>> GSSAPI Error" :
>>
>> a) From CE to WN
>> ====================================
>> [root@ce101 root]# ssh wn101
>> Server GSSAPI Error:
>> Miscellaneous failure
>> No such file or directory
>>
>> root@wn101's password:
>> ====================================
>> b) From WN to CE
>> ==============================
>> [root@wn101 root]# ssh ce101
>> Server GSSAPI Error:
>> Miscellaneous failure
>> No such file or directory
>>
>> root@ce101's password:
>> =============================
>>
>> Not sure if it is because of /etc/ssh/sshd_config? Since the
>> /etc/ssh/sshd_config (both in CE and WNs)  are generated by Yaim, it
>> should be a standard config file? Do I need modify it?
>>
>> 2, The pbs job manager does not work properly (see the following output
>> by qstat). Using qsub to submit a job to short queue, the job stayed in
>> the queue forever, but does not execute. I checked maui,  it works fine.
>> Any idea about it? is it related with ssh problem?
>>
>> Thanks in advance!
>>
>> Regards
>>
>> Wei Xing
>>
>>
>>
>>
>> ===============================================================
>> [dteam001@ce101 dteam001]$ qstat -a
>>
>> ce101.grid.ucy.ac.cy:
>>                                                            Req'd
>> Req'd   Elap
>> Job ID          Username Queue    Jobname    SessID NDS TSK Memory Time
>> S Time
>> --------------- -------- -------- ---------- ------ --- --- ------ -----
>> - -----
>> 3.ce101.grid.uc dteam001 short    test.sh       --   --  --    --  00:15
>> Q   --
>>
>> ==============================================================
>> [dteam001@ce101 dteam001]$ qstat -f 3.ce101.grid.ucy.ac.cy
>> Job Id: 3.ce101.grid.ucy.ac.cy
>>    Job_Name = test.sh
>>    Job_Owner = [log in to unmask]
>>    job_state = Q
>>    queue = short
>>    server = ce101.grid.ucy.ac.cy
>>    Checkpoint = u
>>    ctime = Wed Feb  9 12:09:33 2005
>>    Error_Path = ce101.grid.ucy.ac.cy:/home/dteam001/test.sh.e3
>>    exec_host = wn107.grid.ucy.ac.cy/0
>>    Hold_Types = n
>>    Join_Path = n
>>    Keep_Files = n
>>    Mail_Points = a
>>    mtime = Wed Feb  9 12:11:00 2005
>>    Output_Path = ce101.grid.ucy.ac.cy:/home/dteam001/test.sh.o3
>>    Priority = 0
>>    qtime = Wed Feb  9 12:09:33 2005
>>    Rerunable = True
>>    Resource_List.cput = 00:15:00
>>    Resource_List.walltime = 02:00:00
>>    Variable_List = PBS_O_HOME=/home/dteam001,PBS_O_LANG=en_US.UTF-8,
>>        PBS_O_LOGNAME=dteam001,
>>
>> PBS_O_PATH=/usr/java/j2sdk1.4.2_04/bin:/opt/globus/bin:/opt/globus/sbi
>>
>> n:/opt/edg/bin:/usr/local/bin:/usr/java/j2sdk1.4.2_04/bin:_undefined_/b
>>
>> in:/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/bin:/usr/local/sbin:
>>
>> /usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin:/opt/gpt/sb
>>
>> in:/usr/java/j2sdk1.4.2_04/bin:/opt/d-cache-client/bin:/root/bin:/opt/g
>>
>> pt/sbin:/usr/java/j2sdk1.4.2_04/bin:/opt/d-cache-client/bin:/opt/edg/bi
>>        n:/opt/edg/sbin:/opt/edg/bin:/opt/edg/sbin,
>>        PBS_O_MAIL=/var/spool/mail/root,PBS_O_SHELL=/bin/bash,
>>        PBS_O_HOST=ce101.grid.ucy.ac.cy,PBS_O_WORKDIR=/home/dteam001,
>>        PBS_O_QUEUE=short
>>    etime = Wed Feb  9 12:09:33 2005
>>
>> --
>> ============================================================
>> Wei Xing, M.Sc.
>> Research Associate                    Tel: 00357-22892663
>> Dept. of Computer Science             Fax: 00357-22892701
>> University of Cyprus                  email: [log in to unmask]
>> PO Box 20537
>> CY1678, Nicosia, CYPRUS
>


--
============================================================
Wei Xing, M.Sc.
Research Associate                    Tel: 00357-22892663
Dept. of Computer Science             Fax: 00357-22892701
University of Cyprus                  email: [log in to unmask]
PO Box 20537
CY1678, Nicosia, CYPRUS