Hi Stephen,
I can't answer this question without incriminating myself, because
I've never actually read the documentation. And it doesn't seem to be
linked from the dashboard anyway.
What it boils down to is to check the dashboard once or twice a day,
issuing a ticket wrt any persistent alarms at sites you come across.
If the site doesn't respond after a while (the default is about a 5
days if I remember), you are meant to issue a reminder and if there's
still no answer after a while you are meant to escalate the ticket to
COD which in turn issues tickets to the NGI_UK managers. I am not sure
that ever happens, because we usually deal with it internally and the
dashboard has still some glitches that don't quite enforce the
escalation procedure as you could do ....
On the other hand the dashboard is buggy enough, so you can file
counter tickets again the dashboard people.
Any alarm without a ticket older than 72 h or ticket older than 30
days (?) gets automatically picked up by the COD dashboard who then
again issue a ticket to the NGI.
In principal I think COD can withdraw the "Certified" status of a
site, but again, I've never seen that happen.
In short: It's a job a trained monkey with a penchant for admin work
could do, but somebody has to do it and it ticks all the right boxes
when filling out an EGI time sheet. We all get along fabulously and we
are all glad that we aren't Cyril L'Orphelin who seems to be running
the dashboard/CIC portal by himself. :-)
Was that helpful ?
Cheers,
Daniela
On 20 February 2012 17:08, Stephen Jones <[log in to unmask]> wrote:
> Daniela,
>
> Aside: as a matter of interest, is there a (concise) list of "ROD duties"
> and sanctions that the fearful sounding "COD" could levy?
>
> I'd be interested to know the key people and processes involved and
> how they all get along.
>
> Steve
>
>
>
> Daniela Bauer wrote:
>>>>
>>>> MANCHESTER:
>>>> https://ggus.eu/ws/ticket_info.php?ticket=78776
>>>> Ops tests failing due to "lack of space". Kashif mentions ticketing the
>>>> dpm developers or somehow figuring out a workaround. Sounds like we need to
>>>> refer it to the storage group.
>>>>
>>
>>
>> This is reasonably urgent, as a ticket from the ROD dashboard cannot
>> be extended beyond 30 days or so (i.e. during my ROD duty, sigh)
>> without drawing the ire of COD . Manchester currently shows at 0%
>> availability/reliability this month which almost certainly requires
>> some kind of formal explanation at the end of the month. It would be
>> good if the site made some kind of statement in the ticket, right now
>> it mainly consists of ROD entries, making it looked abandoned.
>>
>> Cheers,
>> Daniela
>>
>>
>
>
>
> --
> Steve Jones [log in to unmask]
> System Administrator office: 220
> High Energy Physics Division tel (int): 42334
> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
> University of Liverpool http://www.liv.ac.uk/physics/hep/
--
-----------------------------------------------------------
[log in to unmask]
HEP Group/Physics Dep
Imperial College
Tel: +44-(0)20-75947810
http://www.hep.ph.ic.ac.uk/~dbauer/
|