Hi John
>>Kashif, can you please send me some references to the ‘known issue with nagios’?
https://tomtools.cern.ch/jira/browse/SAM-2782
https://ggus.eu/ws/ticket_info.php?ticket=83385
Cheers
Kashif
________________________________________
From: Testbed Support for GridPP member institutes [[log in to unmask]] on behalf of John Gordon [[log in to unmask]]
Sent: Tuesday, August 21, 2012 7:56 AM
To: [log in to unmask]
Subject: Re: Broadcast Problem fixed?
Kashif, can you please send me some references to the ‘known issue with nagios’? A GGUS ticket for example. I haven’t seen anything on the operational tools list. GOCDB consulted them before implementing the multiple email support.
Thanks,
John
From: Testbed Support for GridPP member institutes [mailto:[log in to unmask]] On Behalf Of Kashif Mohammad
Sent: 21 August 2012 07:18
To: [log in to unmask]
Subject: Re: Broadcast Problem fixed?
Hi Govind
Only active Nagios instance send alert notifications to sites. Since current active instance is Lancaster so you will only get alerts from there.
As for multiple email address in GOCDB is concerned, it is a known issue with nagios that it can not send notification to multiple addresses. Probably it will be fix in next release. As a temporary workaround I asked sites to remove multiple email from GOCDB if receiving notification is a priority to them.
There is a another workaround for same problem at Nagios level which I might apply after holidays. This works with multiple address in GOCDB.
Cheers
Kashif
Cheers
Kashif
From: Govind Songara [mailto:[log in to unmask]]
Sent: Monday, August 20, 2012 07:28 PM
To: [log in to unmask]<mailto:[log in to unmask]> <[log in to unmask]<mailto:[log in to unmask]>>
Subject: Re: Broadcast Problem fixed?
With single email id, only get emails from Lancs nagios but no email from Oxford nagios.
Does anyone get it from both instance?
On Mon, Aug 20, 2012 at 6:16 PM, Elena Korolkova <[log in to unmask]<mailto:[log in to unmask]>> wrote:
Hello John
I meant email directly from nagios like the one below
Thanks
Elena
From: nagios <[log in to unmask]<mailto:[log in to unmask]>>
Subject: [SAM Nagios] UNKNOWN - lcgce1.shef.ac.uk/org.sam.CREAMCE-JobSubmit-/ops/Role=lcgadmin<http://lcgce1.shef.ac.uk/org.sam.CREAMCE-JobSubmit-/ops/Role=lcgadmin>
Date: 20 August 2012 18:10:05 GMT+01:00
To: ShefOps <[log in to unmask]<mailto:[log in to unmask]>>
Host: lcgce1.shef.ac.uk<http://lcgce1.shef.ac.uk>
Service: org.sam.CREAMCE-JobSubmit-/ops/Role=lcgadmin
Notification Type: PROBLEM
State: UNKNOWN
Additional Info: UNKNOWN: [Submitted-Cancel failed [timeout/dropped]] Problem cancelling job.
Nagios URL: https://gridppnagios.lancs.ac.uk/nagios/cgi-bin/extinfo.cgi?type=2&host=lcgce1.shef.ac.uk&service=org.sam.CREAMCE-JobSubmit-/ops/Role=lcgadmin
Documentation link: https://tomtools.cern.ch/confluence/display/SAM/CE
Additional info link:
Date/Time: Mon Aug 20 18:10:05 BST 2012
On 19 Aug 2012, at 09:29, John Gordon wrote:
> Elena, do you mean warnings directly from nagios, or the ROD tickets in GGUS raised in response to nagios probe failures?
>
> For the latter I raised a ticket and it is on hold waiting for a GGUS guy to come back from holiday. If it is the former then I am waiting for Kashif to come back from holiday and explain more.
>
> Regards,
>
> John
>
>> -----Original Message-----
>> From: Testbed Support for GridPP member institutes [mailto:TB-<mailto:TB->
>> [log in to unmask]<mailto:[log in to unmask]>] On Behalf Of Elena Korolkova
>> Sent: 19 August 2012 07:08
>> To: [log in to unmask]<mailto:[log in to unmask]>
>> Subject: Re: Broadcast Problem fixed?
>>
>> Hello
>>
>> there was also a problem to receive warnings from nagios for sites who have
>> multiple addresses.
>> Is this problem solved as well?
>>
>> Thanks
>> Elena
>>
>> On 17 Aug 2012, at 12:07, John Gordon wrote:
>>
>>> Thanks Rob and Matt.
>>>
>>> Now can anyone who DIDN'T receive it let me know.
>>>
>>> Thanks,
>>>
>>> John
>>>
>>> -----Original Message-----
>>> From: Testbed Support for GridPP member institutes
>>> [mailto:[log in to unmask]<mailto:[log in to unmask]>] On Behalf Of Rob Fay
>>> Sent: 17 August 2012 12:05
>>> To: [log in to unmask]<mailto:[log in to unmask]>
>>> Subject: Re: Broadcast Problem fixed?
>>>
>>> '[ EGI BROADCAST ] Test of UK Broadcasts' received at 11:00.
>>>
>>> GOCDB has [log in to unmask]<mailto:[log in to unmask]> and Alessandra configured for
>>> email for our site, so looks like that's worked to me (assuming
>> Alessandra received it as well).
>>>
>>> Cheers,
>>>
>>> Rob
>>>
>>> On 17/08/2012 11:58, John Gordon wrote:
>>>> The Ops portal say they have fixed the problem with broadcasts to
>>>> sites with multiple emails in GOCDB. I sent a broadcast to test this
>>>> to all UK sites. Can the first site with multiple email addresses to
>>>> read this please reply and tell me if you received my broadcast.
>>>>
>>>> Thanks,
>>>>
>>>> John
>>>>
>>>>
>>>> --
>>>> Scanned by iCritical.
>>>>
>>>>
>>>
>>> --
>>> Robert Fay [log in to unmask]<mailto:[log in to unmask]>
>>> System Administrator office: 220
>>> High Energy Physics Division tel (int): 43396
>>> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 3396<tel:%2B44%20%280%29151%20794%203396>
>>> University of Liverpool http://www.liv.ac.uk/physics/hep/
>>> --
>>> Scanned by iCritical.
>>
>> __________________________________________________
>> Dr Elena Korolkova
>> Email: [log in to unmask]<mailto:[log in to unmask]>
>> Tel.: +44 (0)114 2223553<tel:%2B44%20%280%29114%202223553>
>> Fax: +44 (0)114 2223555<tel:%2B44%20%280%29114%202223555>
>> Department of Physics and Astronomy
>> University of Sheffield
>> Sheffield, S3 7RH, United Kingdom
> --
> Scanned by iCritical.
__________________________________________________
Dr Elena Korolkova
Email: [log in to unmask]<mailto:[log in to unmask]>
Tel.: +44 (0)114 2223553<tel:%2B44%20%280%29114%202223553>
Fax: +44 (0)114 2223555<tel:%2B44%20%280%29114%202223555>
Department of Physics and Astronomy
University of Sheffield
Sheffield, S3 7RH, United Kingdom
|