Hello everybody,
The ticket review is going out quite late this week as I was off
yesterday (I had every intention of writing it last night, but then I
ate too much food and fell asleep with a full belly instead). I'll keep
it light this week.
Scanning the VO Nagios, what do I see...
https://vo-nagios.physics.ox.ac.uk/nagios/cgi-bin/status.cgi?host=all&servicestatustypes=16&hoststatustypes=15
Brunel's had gridpp failures for the last 4 days.
Lancaster's still failing for pheno (something in the authorisation
chain is broken, can't see what).
RalPP is having a brief problem with T2K (Job submission failed, no more
possible targets).
Bristol's in downtime.
Sheffield is having trouble with gridpp, and has started having trouble
with pheno, Sno+ and T2K for the last week.
And all the srms at the Tier 1 are failing their tests (which I believe
is the status quo, although this set of tests has only been failing for
the last 11 days).
To the Tickets!
26 Open UK Tickets this morning.
100IT
https://ggus.eu/index.php?mode=ticket_info&ticket_id=109573 (23/10)
100IT have got a ticket describing the Requirements for Fed Cloud sites,
detailing requirements such as what they need to put into the GOC DB and
publish in their BDII, and that they should support dteam. Still just
Assigned (23/10)
(The other 100IT ticket, 108356, is progressing nicely and might be of
interest to anyone thinking of playing with vmcatcher).
TIER 1
https://ggus.eu/index.php?mode=ticket_info&ticket_id=109276 (11/10)
Some users were having trouble using the FTS3 REST interface at RAL.
Ticket is progressing, but I just flagged it up as it has a few
possibly-of-interest technical tidbits about reloading CRLs. In progress
(28/10)
SHEFFIELD
https://ggus.eu/index.php?mode=ticket_info&ticket_id=109644 (27/10)
Sheffield failing SAM tests, likely caused by a reyaiming. Elena has
noticed errors in her bnotifier and bupdater logs (errors like "key
job_registry_add_remote not found"), and has asked for help - I've not
seen these errors before, has anyone else? In progress (27/10)
EFDA-JET
https://ggus.eu/index.php?mode=ticket_info&ticket_id=109571 (23/10)
Nagios failures at Jet after some certificate troubles last week. The
Jet admins have said they've fixed things, and they're looking all
green, so can this ticket be closed? In Progress (23/10)
MANCHESTER
https://ggus.eu/index.php?mode=ticket_info&ticket_id=109272 (11/10)
Atlas transfer failure ticket. The original problem looked to be at the
NDGF end, but atlas have spotted (possibly unrelated) problems with
other transfers - the example was one between Manchester and Liverpool.
Atlas also observed that the error rate seemed to be very different for
each space token. In progress (26/10)
And that's all folks!
Cheers,
Matt
|