Burke, S (Stephen) wrote:
> Testbed Support for GridPP member institutes
>> [mailto:[log in to unmask]] On Behalf Of Mona Aggarwal said:
>> Unable to Register the Job:
>> https://gfe01.hep.ph.ic.ac.uk:9000/E8opy-OtrSfjrWRqWti6Cg
>> to the LB logger at: gfe01.hep.ph.ic.ac.uk:9002
>> Resource temporarily unavailable (Resource temporarily unavailable -
>> edg_wll_log_proto_client: Error get answer, timeout expired;)
>
> I think you can get that if it's overloaded - how many incoming
> connections do you have? Is the logger really listening on port 9002?
>
Not much!!!
# netstat -tap | grep edg-wl-logd
tcp 0 0 *:9002 *:*
LISTEN 2834/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
gfe03.hep.ph.ic.ac.uk:53707 CLOSE_WAIT 22575/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
lx08.hep.ph.ic.ac.uk:33634 CLOSE_WAIT 23512/edg-wl-logd
tcp 0 0 gfe01.hep.ph.ic.ac.uk:9002
lx08.hep.ph.ic.ac.uk:33640 ESTABLISHED 24461/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
lx08.hep.ph.ic.ac.uk:33628 CLOSE_WAIT 22576/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
fal-pygrid-32.lancs.a:54016 CLOSE_WAIT 21617/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
fal-pygrid-32.lancs.a:54024 CLOSE_WAIT 22577/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
fal-pygrid-32.lancs.a:54026 CLOSE_WAIT 23630/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
lx07.hep.ph.ic.ac.uk:43429 CLOSE_WAIT 21730/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
lx07.hep.ph.ic.ac.uk:43433 CLOSE_WAIT 22571/edg-wl-logd
tcp 0 0 gfe01.hep.ph.ic.ac.uk:9002
lx07.hep.ph.ic.ac.uk:43438 ESTABLISHED 23936/edg-wl-logd
tcp 30 0 gfe01.hep.ph.ic.ac.uk:9002
lx07.hep.ph.ic.ac.uk:43436 CLOSE_WAIT 23414/edg-wl-logd
-----------------------------------------------------------
It seems edg-wl-bkserverd is loading the RB heavily.
The command is:
/opt/edg/sbin/edg-wl-bkserverd --super-users-file
/etc/grid-security/lb_api_superusers.dat
-----------------------------------------------------------
Regards,
Mona
|