Dear all,
Does anybody know what misty happened at Jun 22 5:09 (GMT+3:00)?
At this day and time there weren`t any updates or config changes on CE
node, but in globus-gatekeeper.log appeared this message(and still so):
=
PID: 582 -- Notice: 6: Got connection 128.142.173.150 at Sun Jun 22 05:09:05 2008
Failed reading length 0
GSS authentication failure
globus_gss_assist token :3: read failure: Connection closed
Failure: GSS failed Major:01090000 Minor:00000000 Token:00000003
=
And 30 minutes before all working well
=
PID: 3086 -- Notice: 6: Got connection 128.142.173.75 at Sun Jun 22 04:39:09 2008
TIME: Sun Jun 22 04:39:09 2008
PID: 3086 -- Notice: 5: Authenticated globus user: /DC=ch/DC=cern/OU=Organic Units/OU=Users/CN=samoper/CN=5
82979/CN=Judit Novak
...
=
ping to this IP(128.142.173.150) failed, to another - fine.
lcg-CAs are newest version, hostcert is ok
In system log on that day
"Users logging in through sshd:
sgmops005:
grid6.wdcb.ru (193.232.117.149): 84 times"
(It is WN. Logging from WN to CE is ok. gLite 3.1)
Simple tests:
$ globusrun -a -r grid8.wdcb.ru
GRAM Authentication test successful
jobsubmittion failed with
Got a job held event, reason: Globus error 12: the connection to the server failed (check host and port)
I`ve found only a little about it "Comments: One of your workers cannot run Globus jobs because the service called "gatekeeper" is not started or its port is closed by a firewall."
firewall port is opened and service is running.
Does anybody have something like that?
What further can I check to reveal the possible reason of the problem?
--
Best regards,
Alexander mailto:[log in to unmask]
|