Hi,
I think that we have a problematic WN , I've remade the list of our ssh
keys (CE and WN's) and a different problem has appeared.
I have found some information related to this on the net.
The BDII wrong information I think that comes from a system update that
we shouldn't have done.
*************************************************************
BOOKKEEPING INFORMATION:
Status info for the Job : https://rb02.lip.pt:9000/PELtcDlKJt5FwhAq8mOHQg
Current Status: Done (Failed)
Exit code: 0
Status Reason: Cannot read JobWrapper output, both from Condor and
from Maradona.
Destination: mallarme.cnb.uam.es:2119/jobmanager-pbs-biomed
reached on: Mon Aug 6 13:04:03 2007
*************************************************************
******************************************************************+
http://grid-deployment.web.cern.ch/grid-deployment/eis/docs/Maradona
I'm going to try this.
Thank you.
Alessandro Paolini wrote:
> Hi German,
> it seems there is an ssh problem between CE and WNs
>
> $ globus-job-run mallarme.cnb.uam.es/jobmanager-pbs -queue dteam
> /bin/hostname
> Permission denied, please try again.
> Permission denied, please try again.
> Permission denied (publickey,password,keyboard-interactive).
>
> please look at this faq
> http://goc.grid.sinica.edu.tw/gocwiki/ssh_problem_from_WN_to_CE
>
> Looking at the bdii, your CE i spublishing a wrong information: it
> seems to me that your CE is an LCG-CE, but there is this information
> (org.glite.ce) that is peculiar of a glite-CE
>
> $ ldapsearch -x -H ldap://mallarme.cnb.uam.es:2170 -b
> mds-vo-name=CNB-LCG2,o=grid 'objectClass=GlueService' GlueServiceType
> version: 2
>
> #
> # filter: objectClass=GlueService
> # requesting: GlueServiceType
> #
>
> # mallarme.cnb.uam.es:2119, CNB-LCG2, grid
> dn:
> GlueServiceUniqueID=mallarme.cnb.uam.es:2119,mds-vo-name=CNB-LCG2,o=grid
> GlueServiceType: org.glite.ce
>
> # rimbaud.cnb.uam.es:2136, CNB-LCG2, grid
> dn:
> GlueServiceUniqueID=rimbaud.cnb.uam.es:2136,mds-vo-name=CNB-LCG2,o=grid
> GlueServiceType: gridice
>
> # search result
> search: 2
> result: 0 Success
>
> # numResponses: 3
> # numEntries: 2
>
> Could you correct it?
>
> Cheers,
> Alessandro
> IT-ROC
>
> Germán Carrera ha scritto:
>> Hello,
>>
>> We are trying to solve some problems that we have with our site.
>>
>> The error what we have when we try to run a job using one of the
>> different
>> queues of the vo's that we support is the following.
>> Some help or indication will be appreciated.
>>
>> Thank you, Germán
>>
>> *************************************************************
>> BOOKKEEPING INFORMATION:
>>
>> Status info for the Job :
>> https://rb02.lip.pt:9000/6IoT1UwRzFJOgncqWXNVrg
>> Current Status: Aborted
>> Status Reason: Job RetryCount (3) hit
>> Destination: mallarme.cnb.uam.es:2119/jobmanager-pbs-biomed
>> reached on: Mon Aug 6 10:07:14 2007
>> *************************************************************
>>
>>
>> Other logs: (this a job with my user "biomed025"),
>>
>> 8/6 12:01:42 JMI: completed script validation: job manager type is fork.
>> 8/6 12:01:42 JMI: in globus_gram_job_manager_poll()
>> 8/6 12:01:42 JMI: local stdout filename =
>> /home/biomed025/.globus/.gass_cache/local/md5/64/4b52b501a63945aee673f4c74e6466/md5/32/277395ac7a1b6d3d10cf468a01c2
>>
>> 39/data.
>> 8/6 12:01:42 JMI: local stderr filename = /dev/null.
>> 8/6 12:01:42 JMI: poll: seeking:
>> https://mallarme.cnb.uam.es:20036/6846/1186394257/
>> 8/6 12:01:42 JMI: poll_fast: ******** Failed to find
>> https://mallarme.cnb.uam.es/6846/1186394257/
>> 8/6 12:01:42 JMI: poll_fast: returning -1 = GLOBUS_FAILURE (try Perl
>> scripts)
>> 8/6 12:01:42 JMI: cmd = poll
>>
>> *****************************************************************
>>
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]: Authenticated globus
>> user:
>> /C=ES/O=DATAGRID-ES/O=CNB/CN=German Carrera Corraleche
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]: Requested service:
>> jobmanager-fork
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]: Authorized as local
>> user:
>> biomed025
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]: Authorized as local uid:
>> 19425
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]: and local
>> gid: 1090
>> Aug 6 12:25:51 mallarme GRAM gatekeeper[5271]:
>> "/C=ES/O=DATAGRID-ES/O=CNB/CN=German Carrera Corraleche" mapped to
>> biomed025 (19425/1090)
>> Aug 6 12:25:54 mallarme gridinfo[5198]: JMA 2007/08/06 12:25:54
>> GATEKEEPER_JM_ID 2007-08-06.12:25:46.0000026329.0000000803 for
>> /C=ES/O=DATAGRID-ES/O=CNB/CN=German Carrera Corraleche on 193.136.6.69
>> Aug 6 12:25:54 mallarme gridinfo[5198]: JMA 2007/08/06 12:25:54
>> GATEKEEPER_JM_ID 2007-08-06.12:25:46.0000026329.0000000803 mapped to
>> biomed025 (19425, 1090)
>> Aug 6 12:25:54 mallarme gridinfo[5198]: JMA 2007/08/06 12:25:54
>> GATEKEEPER_JM_ID 2007-08-06.12:25:46.0000026329.0000000803 has
>> GRAM_SCRIPT_JOB_ID 96256.mallarme.cnb.uam.es manager type pbs
>> Aug 6 12:25:54 mallarme gridinfo[5198]: JMA 2007/08/06 12:25:54
>> GATEKEEPER_JM_ID 2007-08-06.12:25:46.0000026329.0000000803 JM exiting
>> Aug 6 12:25:55 mallarme sshd(pam_unix)[5366]: session opened for user
>> biomed025 by (uid=0)
>> Aug 6 12:25:55 mallarme sshd(pam_unix)[5366]: session closed for user
>> biomed025
>> Aug 6 12:25:55 mallarme sshd(pam_unix)[5380]: session opened for user
>> biomed025 by (uid=0)
>> Aug 6 12:25:55 mallarme sshd(pam_unix)[5380]: session closed for user
>> biomed025
>>
>
>
|