Hello all,
We've come through the first quarter of 2017 already, and the time has
come for the fourth start-of-the-month review. So with GridPP38 just
around the corner let's look at what GGUS has for the UK...
30 UK tickets this month
SUSSEX
https://ggus.eu/?mode=ticket_info&ticket_id=122772 (11/7/16)
Atlas webdav/xroot ticket. Any luck, or would you like a hand at GridPP
this week? On hold (26/1)
https://ggus.eu/?mode=ticket_info&ticket_id=125503 (9/12/16)
Sno+ ticket about file access problems due to a wrong SE name in the
LFC. Any word on this too? I think a plan was put in place. In progress
(30/1)
RALPP
https://ggus.eu/?mode=ticket_info&ticket_id=126902 (2/3)
CMS ticket, I got a bit lost trying to follow it but a moot point as CMS
indicate it can be closed. In progress (3/4)
BRISTOL
https://ggus.eu/?mode=ticket_info&ticket_id=126864 (28/2)
Request to enable LZ, Daniela has provided the requested information. In
progress (31/3)
https://ggus.eu/?mode=ticket_info&ticket_id=126865 (28/2)
A CMS ticket from Daniela, concerning ipv6 transfer failures to/from
Bristol. Things were looking better, although there is an outstanding
question that Winnie highlighted about the CERN setup that perhaps
Duncan or someone could answer? In progress (31/3)
BIRMINGHAM
https://ggus.eu/?mode=ticket_info&ticket_id=127319 (27/3)
A low-availability ticket. Whilst these are boring it needs to be tended
(i.e. put In Progress or On Hold). Assigned (27/3)
GLASGOW
https://ggus.eu/?mode=ticket_info&ticket_id=124052 (25/9)
LHCB ticket concerning incorrect job publishing, to be fixed in the next
generation of ARC CEs deployed at Glasgow. Sadly the time has come for
another update, even if it's a totally dry one. On Hold (31/1)
https://ggus.eu/?mode=ticket_info&ticket_id=127160 (16/3)
An availability ticket. Nothing more to say then that. On hold (16/3)
SHEFFIELD
https://ggus.eu/?mode=ticket_info&ticket_id=127210 (19/3)
Atlas transfer timeout failures. After coming out of downtime failures
persist. Perhaps a similar problem to what we saw at Lancaster last
week? As per the post to the storage list those issues were apparently
soothed by increasing the DPM threads. In progress (3/4)
MANCHESTER
https://ggus.eu/?mode=ticket_info&ticket_id=127464 (3/4)
A very fresh atlas deletion error ticket. In progress (3/4)
https://ggus.eu/?mode=ticket_info&ticket_id=127384 (29/3)
LSST authorisation failure ticket. Alessandra has tracked down hopefully
all the config errors that crept in during the move from svn to git.
Hopefully this is nearly sorted. In progress (31/3)
LIVERPOOL
https://ggus.eu/?mode=ticket_info&ticket_id=124819 (3/11/16)
AFS ticket. After the firewall ports were opened the submitter provided
some feedback, but no news back from the site. Perhaps just put this
ticket out of its misery (like what will soonish happen for AFS itself)?
In progress (13/2)
https://ggus.eu/?mode=ticket_info&ticket_id=127353 (28/3)
Steve bravely rolled out a small Centos7 test cluster and Sno+ job
accidentally landed on it - they kept it that way to test things out but
sadly it looks like their tests failed and have asked for their jobs to
not land on the test cluster anymore. In progress (2/4)
https://ggus.eu/?mode=ticket_info&ticket_id=126956 (6/3)
Availability ticket due to the annoying ARC monitoring issues. On hold
(27/3)
QMUL
https://ggus.eu/?mode=ticket_info&ticket_id=127352 (28/3)
Icecube jobs failing on a QM GPU node - the likely cause has been
spotted (old AMD libs sitting on the system with a new nvidia card in
it) but it might be a little while till this is fixed. Dan has proposed
using this as an opportunity to roll out a Centos7 test node which
Icecube were okay with. In progress (31/3)
https://ggus.eu/?mode=ticket_info&ticket_id=127144 (15/3)
LHCB saw problems with ce04, which Dan reckons were caused by load and
has asked if there are still problems. Waiting for reply (31/3)
https://ggus.eu/?mode=ticket_info&ticket_id=126261 (30/1)
A biomed ticket for ce04, although they rechecked if this was still a
problem during the aforementioned load problems. There seems to be other
errors too though- maybe related to the biomed infrastructure? In
progress (31/3)
https://ggus.eu/?mode=ticket_info&ticket_id=126650 (15/2)
cern@school errors due to a misconfig in the VO usernames (slurm only
does lowercase usernames!). Dan has rolled out the new users and Daniela
has rolled out some tests jobs. In progress (31/3)
https://ggus.eu/?mode=ticket_info&ticket_id=127445 (1/4)
Another biomed submission error ticket, I'm not sure if this is a
duplicate of 126261. It looks like a similar error (on ce5 this time
though). Assigned (1/4)
BRUNEL
https://ggus.eu/?mode=ticket_info&ticket_id=127117 (13/3)
A request from CMS to upgrade the spacemon client. Raul was on it. Any
luck with this? Although I've just remembered that Raul is in a
different hemisphere so that question might fall on a deaf inbox. In
progress (14/3)
https://ggus.eu/?mode=ticket_info&ticket_id=127126 (14/3)
Availability ticket, again by the looks of it due to the ARC monitoring
playing up. On hold (27/3)
TIER 1
https://ggus.eu/?mode=ticket_info&ticket_id=127251 (21/3)
A ticket from an atlas user concerning transfers into castor have
trouble and some errors the user is seeing. John has requested more
information as the files themselves seem present and correct, but
someone who has some idea as to what the error messages listed by the
submitter mean would be handy. Waiting for reply (27/3)
https://ggus.eu/?mode=ticket_info&ticket_id=127449 (2/4)
One of the RAL ARCs wasn't working well for LHCB - but the problems
appear to have passed and the ticket can be closed now. In progress (3/4)
https://ggus.eu/?mode=ticket_info&ticket_id=126905 (2/3)
CVMFS commissioning for the SOLID experiment. With effort from Daniela
and Catalin things all look to be working for solid now with
/cvmfs/solidexperiment.egi.eu exported nicely and uploadable to by the
VO. Looks like another ticket can be closed. Waiting for reply (29/3)
https://ggus.eu/?mode=ticket_info&ticket_id=127388 (29/3)
LHCB troubles accessing some files at RAL. Have these issues passed with
the other castor problems from the weekend? In progress (3/4)
https://ggus.eu/?mode=ticket_info&ticket_id=127240 (21/3)
CMS request to run staging tests in prep for Run 2. There was a request
from CMS for access to some monitoring plots, I assume for the transfer
rates between buffers, but it wasn't very clear. In progress (27/3)
https://ggus.eu/?mode=ticket_info&ticket_id=126184 (26/1)
Atlas request for site monitoring input. Alessandra went over this in
last week's atlas uk meeting. It's not too late to have your say in the
google docs. In progress (7/2)
https://ggus.eu/?mode=ticket_info&ticket_id=124876 (7/11)
ROD ticket concerning tests to the RAL echo instance. Alastair's counter
ticket
(https://www.ggus.org/index.php?mode=ticket_info&ticket_id=125026)
hasn't had an update since last year - I think it needs a kick. On Hold
(1/1)
https://ggus.eu/?mode=ticket_info&ticket_id=117683 (18/11/15)
Castor Glue 2 publishing. Rob reported some good progress. On Hold (2/3)
NGI
https://ggus.eu/?mode=ticket_info&ticket_id=126808 (24/2)
WMS usage ticket - mainly involving Imperial and the Tier 1. There was
some worry from Daniela regarding the closure of old WMS tickets due to
it being "no longer supported", but there were reassurances that
security bugs would be fixed. Are you feeling reassured? In progress (20/3)
And that's all the tickets! I'll hopefully catch many of you in Brighton
later this week!
Cheers,
Matt
|