Hi Maarten,
Many thanks indeed for your help.
I have left port 9001 (and only 9001) opened to world on my RB node.
This way I can use my RB but others cannot.
About the edg-wl-interlogd message, I would like to stress the fact that
it was issued once a minute for about two hours, when no gatekeeper
connection happened (hence no dteam job, site in JL state).
I have included below the relevant part of our /var/log/messages file on
CE, maybe it could help determine whether it was a normal behavior or not.
Best regards,
Dan
Jun 30 06:20:31 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:22:31 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:22:42 wipp-ce kernel: application bug: edg-gridftpd(13557) has
SIGCHLD set to SIG_IGN but calls wait().
Jun 30 06:22:42 wipp-ce kernel: (see the NOTES section of 'man 2 wait').
Workaround activated.
Jun 30 06:22:43 wipp-ce gridftpd[23023]:
2005-06-30.06:22:43.147533.0000023023.0000000332 : LCAS authorization
request
Jun 30 06:22:43 wipp-ce gridftpd[23023]:
2005-06-30.06:22:43.153384.0000023023.0000000332 : LCMAPS credential
mapping request
Jun 30 06:22:43 wipp-ce gridftpd[23023]: GSSAPI user
/C=UK/O=eScience/OU=QueenMaryLondon/L=Physics/CN=dave kant is authorized
as dteam001
Jun 30 06:22:43 wipp-ce gridftpd[23023]:
2005-06-30.06:22:43.188319.0000023023.0000000332 : LCMAPS credential
mapping request
Jun 30 06:22:43 wipp-ce gridftpd[23023]:
2005-06-30.06:22:43.188319.0000023023.0000000332 :
lcmaps_plugin_posix_enf-log_cred(): uid=18118(dteam001):pgid=2688(dteam)
Jun 30 06:22:43 wipp-ce gridftpd[23023]: cannot open pid file
/var/run/ftp.pids-all: Permission denied
Jun 30 06:22:43 wipp-ce sshd(pam_unix)[23027]: authentication failure;
logname= uid=0 euid=0 tty=NODEVssh ruser= rhost=eio23.pp.weizmann.ac.il
user=dteam001
Jun 30 06:22:46 wipp-ce sshd[23027]: Accepted hostbased for dteam001
from 192.168.1.23 port 39306 ssh2
Jun 30 06:22:46 wipp-ce sshd(pam_unix)[23030]: session opened for user
dteam001 by (uid=18118)
Jun 30 06:22:46 wipp-ce sshd(pam_unix)[23030]: session closed for user
dteam001
Jun 30 06:22:46 wipp-ce sshd(pam_unix)[23038]: authentication failure;
logname= uid=0 euid=0 tty=NODEVssh ruser= rhost=eio23.pp.weizmann.ac.il
user=dteam001
Jun 30 06:22:48 wipp-ce sshd[23038]: Accepted hostbased for dteam001
from 192.168.1.23 port 39307 ssh2
Jun 30 06:22:48 wipp-ce sshd(pam_unix)[23040]: session opened for user
dteam001 by (uid=18118)
Jun 30 06:22:48 wipp-ce sshd(pam_unix)[23040]: session closed for user
dteam001
Jun 30 06:24:31 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:25:10 wipp-ce gridinfo: [21690-21690] Job
1120101306:lcgpbs:internal_4224631491:21636.1120101302 (ID
2398.wipp-ce.weizmann.ac.il) has finished
Jun 30 06:25:10 wipp-ce gridinfo: [21690-21690] summary:
Jun 30 06:25:10 wipp-ce gridinfo: [21690-21690] Sorry, no accounting
information is collected from this type of batch system at the moment
Jun 30 06:25:10 wipp-ce gridinfo: [21690-21690] -- end of summary
Jun 30 06:26:31 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:28:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:28:52 wipp-ce su(pam_unix)[12353]: session closed for user rgma
Jun 30 06:30:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:32:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:32:40 wipp-ce gridinfo[18718]: JMA 2005/06/30 06:32:40
GATEKEEPER_JM_ID 2005-06-30.06:01:06.0000011906.0000000641 JM exiting
Jun 30 06:32:55 wipp-ce gridinfo[21643]: JMA 2005/06/30 06:32:55
GATEKEEPER_JM_ID 2005-06-30.06:15:01.0000011906.0000000648 JM exiting
Jun 30 06:33:15 wipp-ce gridinfo[13526]: JMA 2005/06/30 06:33:15
GATEKEEPER_JM_ID 2005-06-30.05:38:31.0000011906.0000000640 JM exiting
Jun 30 06:34:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:36:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:38:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:40:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:42:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:44:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:46:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:48:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:50:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:52:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:54:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:56:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 06:58:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:00:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:01:04 wipp-ce edg-rgma-gin: /etc/init.d/edg-rgma-gin shutdown
succeeded
Jun 30 07:01:04 wipp-ce su(pam_unix)[28087]: session opened for user
rgma by (uid=0)
Jun 30 07:01:07 wipp-ce edg-rgma-gin: /etc/init.d/edg-rgma-gin startup
succeeded
Jun 30 07:01:27 wipp-ce su(pam_unix)[28087]: session closed for user rgma
Jun 30 07:02:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:04:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:06:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:08:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:10:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:12:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:14:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:16:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:18:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:20:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:22:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:24:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:26:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:28:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:30:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:32:02 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:33:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:35:02 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:36:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:38:02 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:39:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:41:02 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:42:32 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:44:02 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:45:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:47:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:48:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:50:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:51:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:53:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:54:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:55:50 wipp-ce ntpd[4385]: synchronisation lost
Jun 30 07:56:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:57:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 07:59:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:00:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:01:04 wipp-ce edg-rgma-gin: /etc/init.d/edg-rgma-gin shutdown
succeeded
Jun 30 08:01:04 wipp-ce su(pam_unix)[1654]: session opened for user rgma
by (uid=0)
Jun 30 08:01:07 wipp-ce edg-rgma-gin: /etc/init.d/edg-rgma-gin startup
succeeded
Jun 30 08:01:26 wipp-ce su(pam_unix)[1654]: session closed for user rgma
Jun 30 08:02:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:03:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:05:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:06:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:08:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:09:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:11:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:12:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:14:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:15:33 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:17:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
event_queue_connect: edg_wll_ssl_connect
Jun 30 08:18:05 wipp-ce GRAM gatekeeper[3269]: Got connection
128.142.65.125 at Thu Jun 30 08:18:05 2005
FROM THIS POINT on, no more dg-wl-interlogd messages.
Maarten Litmaath wrote:
> Dan Schrager wrote:
>
>> Hi Maarten,
>>
>> 1) YES, it happened just then.
>
>
> Interesting, that would be a clue to get rid of those messages.
>
>> 2) a) Please explain, what does it mean "RB usable externally" ? RB
>> used by other sites that don't have RB node ? If so, I don't want it,
>> thank you very much, my name is not cern.ch...
>
>
> OK, so your RB shall only be used internally.
>
> BTW, I was mistaken not mentioning port 9001, which must also be visible
> when the RB is to be usable externally.
>
>> b) What does it mean "to submit jobs only locally, but to
>> external sites" ? If I want a job issued at my site to run at another
>> site (like grid is supposed to work ?) then I need to keep port 9002
>> (and only 9002) on my RB publicly opened ? Yes, that I want it, but
>> would this
>
>
> Sorry, it should be port 9001. In that case others cannot use your RB.
>
>> mean that other sites would be also able (in theory) to use my RB
>> node (like some sites use CERN's RB node) ? (see 2(a), that I don't
>> want...)
>>
>> Best regards,
>> Dan
>>
>>
>>
>> Maarten Litmaath wrote:
>>
>>> Dan Schrager wrote:
>>>
>>>> Hi everybody,
>>>>
>>>> 1) Does anyone know what is the meaning of the following message
>>>> repeated every minute or so for about two hours on the CE node in
>>>> /var/log/messages:
>>>>
>>>> Jun 30 08:08:03 wipp-ce edg-wl-interlogd[13825]: queue_thread:
>>>> event_queue_connect: edg_wll_ssl_connect
>>>
>>>
>>>
>>>
>>> We see those (annoying) messages all the time. Are you sure you only
>>> saw them during the period your site was flagged?
>>>
>>>> During that time our site was flagged as JL in the test zone.
>>>> Before and after that we were (and are) OK.
>>>>
>>>> 2) I have also just filtered out ports 9000-9002 on RB ( Logging &
>>>> Bookeeping and localloger), making them exclusively local
>>>> accessible. Could you please confirm (before tomorrow's next test)
>>>> that it is right to do so ?
>>>
>>>
>>>
>>>
>>> If your RB is to be usable externally, ports 9000 and 9002 must be
>>> visible.
>>
>
> And port 9001 too.
>
>>> If you want to submit jobs only locally, but to external sites, port
>>> 9002
>>
>
> That should be 9001 instead.
>
>>> ought to be visible, else the jobs will have incomplete logging info
>>> (but they can succeed).
>>
|