Yes, we have seen this also very frequently, and are very interested that it
is fixed.
Regards, Antun
-----
Antun Balaz
Research Assistant
E-mail: [log in to unmask]
Web: http://scl.phy.bg.ac.yu/
Phone: +381 11 3713152
Fax: +381 11 3162190
Scientific Computing Laboratory
Institute of Physics Belgrade
Pregrevica 118, 11080 Belgrade, Serbia
-----
---------- Original Message -----------
From: Adam Padee <[log in to unmask]>
To: [log in to unmask]
Sent: Thu, 10 Jul 2008 16:27:27 +0200
Subject: [LCG-ROLLOUT] qmgr dies unexpectedly at first request
> Hi,
>
> Recently I noticed that on my new gLite 3.1 CE, when I invoke qmgr
> and issue a command, it dies. Subsequent invocations are ok, but
> when I wait some time I get this error again. After each failure
> there is a strange message in the server logs, saying something like
> "cannot decode message". Below I paste the exact output from the
> command and relevant lines from the server logs:
>
> [root@ce server_logs]# qmgr
> Max open servers: 4
> Qmgr: list queue dteam
> [root@ce server_logs]# qmgr
> Max open servers: 4
> Qmgr: list queue dteam
> Queue dteam
> queue_type = Execution
> total_jobs = 0
> state_count = Transit:0 Queued:0 Held:0 Waiting:0 Running:0
> Exiting:0
> resources_max.cput = 48:00:00
> resources_max.walltime = 72:00:00
> acl_group_enable = True
> acl_groups = dteam
> mtime = Thu Jul 10 15:32:24 2008
> enabled = True
> started = True
>
> Qmgr: quit
> [root@ce server_logs]#
>
> An there are following messages in the server log:
>
> 07/10/2008 15:56:07;0080;PBS_Server;Req;dis_request_read;req header
> bad, dis error 7 (Premature end of message), type=Connect
> 07/10/2008 15:56:07;0080;PBS_Server;Req;req_reject;Reject reply
> code=15056(Bad DIS based Request Protocol MSG=cannot decode me ssage)
> , aux=0, type=Connect, from @
>
> Here are my versions of torque packages:
>
> [root@ce log]# rpm -qa |grep torque
> glite-yaim-torque-server-4.0.1-5
> torque-client-2.3.0-snap.200801151629.2cri.slc4
> torque-2.3.0-snap.200801151629.2cri.slc4
> torque-server-2.3.0-snap.200801151629.2cri.slc4
> glite-yaim-torque-utils-4.0.2-2
> [root@ce log]#
>
> Has anyone of you encountered similar behavior?
>
> Best regards,
> Adam
------- End of Original Message -------
|