Dear all,
I encountered two problems with my CE (IA64).
1. When I submited jobs from UI with the command "edg-job-submit, the status
of all jobs were always "Ready" and the output of the command is as
following:
===================================="edg-job-submit and
edg-job-status"=============================================================
=======
[c2204] /home/horse > edg-job-submit --vo dteam -r
ce-lcg.sdg.ac.cn:2119/jobmanager-lcgpbs-dteam testJob.jd1
Selected Virtual Organisation name (from --vo option): dteam
Connecting to host lcg005.ihep.ac.cn, port 7772
Logging to host lcg005.ihep.ac.cn, port 9002
****************************************************************************
*****************
JOB SUBMIT OUTCOME
The job has been successfully submitted to the Network Server.
Use edg-job-status command to check job current status. Your job identifier
(edg_jobId) is:
- https://lcg005.ihep.ac.cn:9000/ndG9oYRRfRXbkAhivfDU1Q
****************************************************************************
*****************
[c2204] /home/horse > edg-job-status
https://lcg005.ihep.ac.cn:9000/ndG9oYRRfRXbkAhivfDU1Q
*************************************************************
BOOKKEEPING INFORMATION:
Status info for the Job :
https://lcg005.ihep.ac.cn:9000/ndG9oYRRfRXbkAhivfDU1Q
Current Status: Ready
Status Reason: unavailable
Destination: ce-lcg.sdg.ac.cn:2119/jobmanager-lcgpbs-dteam
reached on: Thu Nov 17 07:54:04 2005
*************************************************************
===========================================================================
======================================================
Following is contents of /var/log/globus-gatekeeper.log on CE regarding this
command.
------------------------------------------------------"/var/log/globus-gatek
eeper.conf of the command of
edg-job-submit"-------------------------------------------------------------
--------------------
[root@ce-lcg log]# more globus-gatekeeper.log
Notice: 6: /opt/edg/sbin/edg-gatekeeper pid=16995 starting at Thu Nov 17
15:10:02 2005
Notice: 6: GRAM contact:
ce-lcg.sdg.ac.cn:2119:/O=GRID-FR/C=CN/O=IHEP/OU=CC/CN=ce-lcg.sdg.ac.cn
Notice: 0: GATEKEEPER_ACCT_FD=6 (/var/log/globus-gatekeeper.log)
Notice: 6: Got connection 202.122.32.131 at Thu Nov 17 15:10:37 2005
Failed reading length 0
GSS authentication failure
globus_gss_assist token :3: read failure: Connection closed
Failure: GSS failed Major:01090000 Minor:00000000 Token:00000003
Failure: GSS failed Major:01090000 Minor:00000000 Token:00000003
----------------------------------------------------------------------------
----------------------------------------------------------------------------
----------------------------------------------------------------------------
2. When I tested the command of "globus-job-run" from UI, the
ce-lcg.sdg.ac.cn is still error! Just as following
============================================="globus-job-run"===============
==========================================================
[c2204] /home/horse > globus-job-run
ce-lcg.sdg.ac.cn:2119:/O=GRID-FR/C=CN/O=IHEP/OU=CC/CN=ce-lcg.sdg.ac.cn
/bin/ls GRAM Job submission failed because the gatekeeper failed to run the
job manager (error code 47)
===========================================================================
======================================================
Following is contents of /var/log/globus-gatekeeper.log on CE regarding this
command:
--------------------------------------------------------"/var/log/globus-gat
ekeeper.conf of the command of
globus-job-run"-------------------------------------------------------------
------------------
Notice: 6: Got connection 159.226.2.224 at Thu Nov 17 15:41:22 2005
Notice: 5: Trying to use delegated user proxy
Notice: 5: Authenticated globus user:
/O=GRID-FR/C=CN/O=IHEP/OU=CC/CN=Yongzheng [log in to unmask]
Notice: 0: GRID_SECURITY_HTTP_BODY_FD=8
Notice: 0: JOB_REPOSITORY_ID
2005-11-17.15:41:23.094585.0000016996.0000000050 (unique id used for Job
Repository)
Notice: 0: FORMAT: YYYY-MM-DD.hh:mm:ss.micros.pid.connection
Notice: 0: (Format: <date>.<time (with
microsecs)>.<pid>.<connection counter>)
Notice: 0: temporarily ALLOW empty credentials
Notice: 0: Using dlopen version of LCAS
Notice: 0: lcasmod_name = /opt/edg/lib/lcas/lcas.mod
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
LCAS 7: 2005-11-17.15:41:23.094585.0000016996.0000000050 : Initialization
LCAS version 1.1.22
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_init(): Reading LCAS database /opt/edg/etc/lcas/lcas.db
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
LCAS 5: 2005-11-17.15:41:23.094585.0000016996.0000000050 : LCAS
authorization request
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_get_fabric_authorization(): user is
/O=GRID-FR/C=CN/O=IHEP/OU=CC/CN
=Yongzheng [log in to unmask]
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas_userban.mod-plugin_confirm_authorization(): checking banned users in
/opt
/edg/etc/lcas/ban_users.db
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_get_fabric_authorization(): authorization granted by plugin
/opt/ed
g/lib/lcas/modules/lcas_userban.mod
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas_timeslots.mod-plugin_confirm_authorization(): Checking slot 1 out of 2
in
/opt/edg/etc/lcas/timeslots.db
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas_timeslots.mod-plugin_confirm_authorization(): Checking slot 2 out of 2
in
/opt/edg/etc/lcas/timeslots.db
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas_timeslots.mod-check_hour(): Hour (15:41:23) out of range:
(23:00:00)-
(24:00:00)
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_get_fabric_authorization(): authorization granted by plugin
/opt/ed
g/lib/lcas/modules/lcas_timeslots.mod
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas_plugin_example-plugin_confirm_authorization(): OK, what the heck, I'll
au
thorize Mr/Mrs /O=GRID-FR/C=CN/O=IHEP/OU=CC/CN=Yongzheng
[log in to unmask]
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_get_fabric_authorization(): authorization granted by plugin
/opt/ed
g/lib/lcas/modules/lcas_plugin_example.mod
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
lcas.mod-lcas_get_fabric_authorization(): succeeded
LCAS 0: 2005-11-17.15:41:23.094585.0000016996.0000000050 :
LCAS 7: 2005-11-17.15:41:23.094585.0000016996.0000000050 : Termination
LCAS
Notice: 0: temporarily ALLOW empty credentials
Notice: 0: Using dlopen version of LCMAPS
Notice: 0: lcmapsmod_name = /opt/edg/lib/lcmaps/lcmaps.mod
Notice: 0: dlopen error: /opt/edg/lib/lcmaps/lcmaps.mod: cannot open shared
object file: No such file or directory
Failure: Cannot open LCMAPS module of /opt/edg/lib/lcmaps/lcmaps.mod
Failure: Cannot open LCMAPS module of /opt/edg/lib/lcmaps/lcmaps.mod
----------------------------------------------------------------------------
----------------------------------------------------------------------------
-----------------------------------------------------------------------
Any response will be great appreciated!
Thank you in advance!
-Yongzheng
|