Hello all,
It's the first Monday of another month already, so it's time for a full
round up of all the tickets, as we have done since the days of yore.
23 Open UK Tickets this month.
SUSSEX
https://ggus.eu/?mode=ticket_info&ticket_id=122772 (11/7/16)
Sussex have just the one ticket- the atlas xroot/httpd one. Last word
was that Leo had contacted Dan at QM for help. In progress (5/9)
RALPP
https://ggus.eu/?mode=ticket_info&ticket_id=130264 (28/8)
Biomed CE publishing ticket - Chris is waiting on the corresponding
Brunel ticket (130263) to see how the fix offered works out. In progress
(26/9)
OXFORD
https://ggus.eu/?mode=ticket_info&ticket_id=130032 (11/8)
LHCB jobs failing on upload. The ticket occurred at a time of multiple
SE troubles, and there's not been any news throughout September so first
port of call will be to see if the problems persist. In progress (23/8)
https://ggus.eu/?mode=ticket_info&ticket_id=130173 (22/8)
A ticket from Duncan about incomplete perfsonar results, with the likely
solution being to reinstall to CentOS7. Kashif is on it. In progress (26/9)
https://ggus.eu/?mode=ticket_info&ticket_id=129931 (4/8)
http SAM tests failing - for unknown reasons. Kashif hopes an update of
the headnode will fix things. On hold (19/9)
CAMBRIDGE
https://ggus.eu/?mode=ticket_info&ticket_id=130787 (28/9)
LHCB pilots failing at Cambridge. It looks like lhcb jobs are failing
due to hitting a CPU time limit, although no changes have been made site
side to break things. John proposed increasing the CPU limits. Waiting
for reply (28/9)
BRISTOL
https://ggus.eu/?mode=ticket_info&ticket_id=130646 (20/9)
Low CMS xroot HC rates. After some clarification on the problem Lukasz
is looking at the xroot logs. Did you see anything? In progress (26/9)
BIRMINGHAM
https://ggus.eu/?mode=ticket_info&ticket_id=129930 (4/8)
The Birmingham version of the Oxford http SAM test ticket. Although
symptoms are slightly different it's equally hard to debug. Any news, or
perhaps we can try to rally the troops for another bash at helping? In
progress (16/8)
MANCHESTER
https://ggus.eu/?mode=ticket_info&ticket_id=130868 (2/10)
A fresh ROD CE submission test failure ticket. Assigned (2/10)
LIVERPOOL
https://ggus.eu/?mode=ticket_info&ticket_id=130518 (12/9)
One of those ROD availability tickets we all loathe. Steve kept us all
in the loop, and hopefully things will go green soon. On hold (2/10)
LANCASTER
https://ggus.eu/?mode=ticket_info&ticket_id=130753 (26/9)
Setting up na62 at Lancaster, things seem to be working for them after
the usual back-and-forth and we just need some more jobs to flow. On
Hold (26/9)
QMUL
https://ggus.eu/?mode=ticket_info&ticket_id=130262 (28/8)
Another biomed publishing ticket, although this one is aimed at the SE.
After some feedback from Biomed it looks like it's the glue2 bit that's
broken. In progress (26/9)
IMPERIAL (well, Dirac)
https://ggus.eu/?mode=ticket_info&ticket_id=130202 (24/8)
Not really a IC ticket, but one to all na62 sites - about na62 jobs
waiting too long. Dan included a useful link in the ticket:
https://na62.gla.ac.uk/index.php?task=stats&view=sitehealth - there's
feedback from RAL in the ticket too. Is it just fairshare (fairly)
stopping na62 jobs running as quickly as they'd like? In progress (27/9)
BRUNEL
https://ggus.eu/?mode=ticket_info&ticket_id=130742 (26/9)
LHCB noticing pilots failing at Brunel - Raul points out that the CE is
being replaced so the problems should be going away. In progress (26/9)
https://ggus.eu/?mode=ticket_info&ticket_id=130263 (28/8)
Biomed ticket about negative running jobs being published at Brunel -
the ARC devs are involved and Raul has kindly offered to test what they
have. Any news from them? In progress (13/9)
TIER 1
https://ggus.eu/?mode=ticket_info&ticket_id=128991 (16/6)
Solidexperiment.org tape support. The Castor tape is reading for
testing, just waiting on word from the VO (aka Janusz). But Janusz is
busy in a control room somewhere... Waiting for reply (13/9)
https://ggus.eu/?mode=ticket_info&ticket_id=130467 (10/9)
CMS SAM tests failing at RAL, due to a lack of space on CASTOR. Chris
has identified a bunch of dark data and set about purging it, ready for
another round of consistency checks. In Progress (25/9)
https://ggus.eu/?mode=ticket_info&ticket_id=130782 (27/9)
A request from lhcb to deploy the latest version of heposlibs (which
contains a dependency on git). The request has been passed along at the
Tier 1. In progress (28/9)
https://ggus.eu/?mode=ticket_info&ticket_id=130193 (23/8)
CMS staging of files taking a too long from RAL tape- possibly due to a
bunch of corrupt files (although manual copies for some problem files
are working). George has asked CMS to try again, and if problems persist
send a list of dodgy files. Waiting for reply (29/9)
https://ggus.eu/?mode=ticket_info&ticket_id=130207 (24/8)
MICE seeing timeouts copying to CASTOR. Gareth provided an indepth
update in the ticket, the ticket is being kept open whilst the GenTape
disk pool is upgraded. In progress (6/9)
https://ggus.eu/?mode=ticket_info&ticket_id=127597 (7/4)
A CMS ticket to check networking and xroot performance, held up for a
while waiting on the RAL networking team. There is some recent movement
to repoke the networkers. On hold (2/10)
https://ggus.eu/?mode=ticket_info&ticket_id=124876 (7/11/16)
ECHO SAM tests failing due to a problem with the tests - no movement on
the counter-ticket (125026) since April - it likely needs a kick. On
hold (1/1)
https://ggus.eu/?mode=ticket_info&ticket_id=117683 (18/11/15)
Getting GLUE 2 working for CASTOR. Intermittently worked on as time
allows. The ticket could do with an update this quarter. On hold (6/7)
And that's all the tickets! I'll catch you all in tomorrow's meeting.
Cheers!
Matt
|