Hi,
Try to look at MyEGI:
1. https://nagios.egee.cesnet.cz/myegi/gridmap/NGI_CZ/ROC/
2. https://nagios.egee.cesnet.cz/myegi/services/?monitored=1&profile=5-1&facelist_values_services=3877
3. https://nagios.egee.cesnet.cz/myegi/history/?monitored=1&profile=5-1&facelist_values_services=3877
4. https://nagios.egee.cesnet.cz/myegi/status/3877/5-1/1298953525/
5. https://nagios.egee.cesnet.cz/myegi/metric/popup/3877/85/1298938557/
Cheers,
Wojciech
-----Original Message-----
From: LHC Computer Grid - Rollout [mailto:[log in to unmask]] On Behalf Of Tomas Kouba
Sent: 01 March 2011 16:47
To: [log in to unmask]
Subject: Re: [LCG-ROLLOUT] How to find info about failed job on CREAM reported by nagios
On 03/01/2011 01:04 PM, Steve Traylen wrote:
> On Tue, Mar 1, 2011 at 12:22 PM, Massimo Sgaravatto - INFN Padova
> <[log in to unmask]> wrote:
>> The clean way would be to have the id from NAGIOS (but I am completely
>> ignorant on NAGIOS).
>
> The notification from nagios should have a link to job output. Can you
> post the URL
> from the notification.
From the nagios log:
[1298938557] SERVICE NOTIFICATION: msg-contact;cream1.farm.particle.cz;org.sam.CREAMCE-JobSubmit-ops;CRITICAL;ncg-notify-by-msg;CRITICAL: Job was aborted.
Unfortunately we do not send other notifications. I will try to get some info from
operations portal logs.
--
Tomas Kouba
Institute of Physics, Academy of sciences of the Czech Republic
|