Daniella's given a good outline of the actuality of the matter, and I don't have much to add on that front.
In terms of the more formal processes, I would indicate https://wiki.egi.eu/wiki/Grid_operations_oversight/ROD and the linked https://wiki.egi.eu/wiki/PROC01 which described what CoD should do.
The basic idea is to ensure that there is always someone checking everything is working at the level below. It goes:
Site -> RoD -> CoD -> NGI
If the NGI becomes unresponsive, or consistently chronically underperforms, then EGI can uncertify sites that are problematic. In practice, as long as there someone responsive at the NGI, then it's going to leave it to the NGI.
Although it seems rather beurocratic, and appears to have a clear implications of superior power, CoD's primary function is to ensure the Grid is reliable for the end users. Without sites, there's no resources available, so it's not actually in EGI's interest to chastise sites.
On 20 Feb 2012, at 18:00, Daniela Bauer wrote:
> Hi Stephen,
>
> I can't answer this question without incriminating myself, because
> I've never actually read the documentation. And it doesn't seem to be
> linked from the dashboard anyway.
>
> What it boils down to is to check the dashboard once or twice a day,
> issuing a ticket wrt any persistent alarms at sites you come across.
> If the site doesn't respond after a while (the default is about a 5
> days if I remember), you are meant to issue a reminder and if there's
> still no answer after a while you are meant to escalate the ticket to
> COD which in turn issues tickets to the NGI_UK managers. I am not sure
> that ever happens, because we usually deal with it internally and the
> dashboard has still some glitches that don't quite enforce the
> escalation procedure as you could do ....
> On the other hand the dashboard is buggy enough, so you can file
> counter tickets again the dashboard people.
>
> Any alarm without a ticket older than 72 h or ticket older than 30
> days (?) gets automatically picked up by the COD dashboard who then
> again issue a ticket to the NGI.
>
> In principal I think COD can withdraw the "Certified" status of a
> site, but again, I've never seen that happen.
>
> In short: It's a job a trained monkey with a penchant for admin work
> could do, but somebody has to do it and it ticks all the right boxes
> when filling out an EGI time sheet. We all get along fabulously and we
> are all glad that we aren't Cyril L'Orphelin who seems to be running
> the dashboard/CIC portal by himself. :-)
>
> Was that helpful ?
>
> Cheers,
> Daniela
>
>
> On 20 February 2012 17:08, Stephen Jones <[log in to unmask]> wrote:
>> Daniela,
>>
>> Aside: as a matter of interest, is there a (concise) list of "ROD duties"
>> and sanctions that the fearful sounding "COD" could levy?
>>
>> I'd be interested to know the key people and processes involved and
>> how they all get along.
>>
>> Steve
>>
>>
>>
>> Daniela Bauer wrote:
>>>>>
>>>>> MANCHESTER:
>>>>> https://ggus.eu/ws/ticket_info.php?ticket=78776
>>>>> Ops tests failing due to "lack of space". Kashif mentions ticketing the
>>>>> dpm developers or somehow figuring out a workaround. Sounds like we need to
>>>>> refer it to the storage group.
>>>>>
>>>
>>>
>>> This is reasonably urgent, as a ticket from the ROD dashboard cannot
>>> be extended beyond 30 days or so (i.e. during my ROD duty, sigh)
>>> without drawing the ire of COD . Manchester currently shows at 0%
>>> availability/reliability this month which almost certainly requires
>>> some kind of formal explanation at the end of the month. It would be
>>> good if the site made some kind of statement in the ticket, right now
>>> it mainly consists of ROD entries, making it looked abandoned.
>>>
>>> Cheers,
>>> Daniela
>>>
>>>
>>
>>
>>
>> --
>> Steve Jones [log in to unmask]
>> System Administrator office: 220
>> High Energy Physics Division tel (int): 42334
>> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
>> University of Liverpool http://www.liv.ac.uk/physics/hep/
>
>
>
> --
> -----------------------------------------------------------
> [log in to unmask]
> HEP Group/Physics Dep
> Imperial College
> Tel: +44-(0)20-75947810
> http://www.hep.ph.ic.ac.uk/~dbauer/
|