Heya all,
We're hitting Febuary with 43 open tickets - slowly whittling away at
them! It's the first Monday of the month, so let's dive into them all.
Cheers!
Matt
NGI
https://ggus.eu/ws/ticket_info.php?ticket=90451 (15/1)
Grouping of the core services. Progress is being made, although no
deadline for this work has been given. In Progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=91081 (1/2)
Our ROD team has been ticketed in order to make sure they're keeping
track of out of date services after the 1st February deadline. We'll
discuss this in the meeting. In progress (4/2)
TIER 1
https://ggus.eu/ws/ticket_info.php?ticket=86152 (17/9/2012)
"Correlated packet-loss on perfsonar host". This ticket is being
considered within the scope of wider scale networking issues at RAL, but
other aspects of the investigation are coming first. On hold (16/1)
https://ggus.eu/ws/ticket_info.php?ticket=91029 (30/1)
atlas were having a problem querying the FTS jobs, which if I'm reading
the ticket right might have been caused by some transfers between castor
and QMUL's storm going awry. Chris has offered to upgrade his storm to
EMI2 if it's thought that would help, and has asked atlas what they'd
like him to do. In Progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=90151 (8/1)
neiss have been enabled on the RAL WMS, but some problems still need to
be ironed out. In Progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=90528 (17/1)
The RAL WMS isn't assigning SNO+ jobs to Sheffield. Still being
investigated. In Progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=91060 (31/1)
A CMS ticket (although it's not got CMS as its "Concerned VO"), about
glexec problems on a few workers. There was a few days where identity
switching didn't work. More pool accounts have been requested, and when
that's done the issue should be solved. In progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=89733 (17/12/2012)
Chris' uncovering of a dodgey top-BDII node at RAL. A new BDII trinity
went live today, hopefully that'll have solved the problems. We're now
at the wait-and-see-if-it's-fixed stage. In progress (4/2)
RALPP
https://ggus.eu/ws/ticket_info.php?ticket=90244 (10/1)
Atlas migration from groupdisk. Waiting on atlas to finish moving data.
With Brian on the other side of the planet it might need someone else to
keep an eye on this and similar tickets. Waiting for reply (29/1)
https://ggus.eu/ws/ticket_info.php?ticket=90863 (27/1)
Atlas FTS errors on intra-site transfers/deletions. Looked to be a load
related problem, possibly caused by the deletions. Did it come back? In
progress (28/1)
OXFORD
https://ggus.eu/ws/ticket_info.php?ticket=86106 (14/9/2012)
Ye olde "low atlas sonar rates to BNL ticket" for Oxford. Have there
been any further investigation on this issue. Does the problem still
exist? We don't want to leave these tickets to rot. On hold (30/11)
https://ggus.eu/ws/ticket_info.php?ticket=90245 (10/1)
Oxford's atlas group disk migration ticket. Oxford seem to be mostly
drained. Waiting for reply (28/1)
https://ggus.eu/ws/ticket_info.php?ticket=91117 (3/2)
atlas FTS failures, the problem seemed to be caused by high load on a
dpm disk pool. Things looked to have calmed down (did you read-only the
server?), this ticket looks good for closing. In progress (4/2)
BRISTOL
https://ggus.eu/ws/ticket_info.php?ticket=90275 (10/1)
cms (I think it's cms) have ticketed sites about their cvmfs status.
Winnie is working on this, but has time constraints. On hold (29/1)
https://ggus.eu/ws/ticket_info.php?ticket=90328 (11/1)
Stephen ticketed Bristol over some strange values published by their SE.
Waiting to track down how a similar problem was fixed. In progress (31/1)
https://ggus.eu/ws/ticket_info.php?ticket=90361 (13/1)
Enabling the GridPP VOMS server ticket for the ngs VO - the Bristol
edition. Winnie's put the ticket on hold. On Hold (29/1)
BIRMINGHAM
https://ggus.eu/ws/ticket_info.php?ticket=86105 (14/9/2012)
Birmingham's "low atlas sonar rate to BNL" ticket. The same comments to
the Oxford version apply to this one. Maybe we're lucky and the
problem's evaporated! On hold (30/11/12)
GLASGOW
https://ggus.eu/ws/ticket_info.php?ticket=90862 (27/1)
Glasgow have a descrepency between the advertised space used according
the the SRM and their BDII. Inder investigation, Stephen has asked that
any findings get passed along to DPM support. In Progress (28/1)
https://ggus.eu/ws/ticket_info.php?ticket=89804 (18/12/12)
The Glaswegian atlas group disk migration ticket. After the initial
changes this seems quiet, maybe too quiet. On hold (10/1)
https://ggus.eu/ws/ticket_info.php?ticket=91106 (2/2)
Atlas shifters noticed the Glasgow SE down. Things are settled now, so
this ticket can probably be closed (remember that it's usually best NOT
to leave it to a VO to close a ticket). In progress (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=90966 (28/1)
The Glasgow WMS doesn't seem to be working for the londongrid VO. In
progress (29/1)
https://ggus.eu/ws/ticket_info.php?ticket=90386 (14/1)
enmr.eu report that they can't run jobs when they use proxies containing
VOMS group information. Hopefully this will be fixed when Glasgow roll
out their new argus server. In progress (21/1)
https://ggus.eu/ws/ticket_info.php?ticket=90362 (13/1)
Enabling the GridPP VOMS server ticket for the ngs VO - Glasgow style.
Hopefully this will be fixed with their new argus server. In progress (21/1)
https://ggus.eu/ws/ticket_info.php?ticket=89753 (17/12/2012)
Path MTU discovery problems from QMUL to Glasgow. Discovered to be a
problem within Clydenet, held until it's fixed. On hold (23/1)
ECDF
https://ggus.eu/ws/ticket_info.php?ticket=90878 (27/1)
lhcb report cvmfs problems. Turned out to be a missing nfs mount on some
workers causing jobs to have problems, things have been fixed and the
bad jobs removed. Andy asks if LHCB jobs are doing better at their site.
Waiting for reply (29/1)
https://ggus.eu/ws/ticket_info.php?ticket=86334 (24/9/2012)
Low atlas sonar rates to BNL ticket - Edinburgh edition. See my comments
for Birmingham and Oxford. Wahid gave a brief update, things have been
proceeding offline. On hold (16/1)
https://ggus.eu/ws/ticket_info.php?ticket=89356 (10/12/2012)
(I misread this ticket title as Nagios EU Sec-Ops, and lost 5 minutes to
imagining what kind of cool, futuristic cyber-police force such an
organisation with that name would be. I've been staring at GGUS for too
long I think)
Wahid has given a statement about the need for the tarball to undergo
more testing, and the ticket has been extended. On hold (31/1)
DURHAM
https://ggus.eu/ws/ticket_info.php?ticket=91072 (1/2)
Durham are having cream nagios test failures- "teething troubles" for
their updated services. In progress (1/2)
https://ggus.eu/ws/ticket_info.php?ticket=89825 (19/12/2012)
enmr.eu having trouble installing software on the Durham cluster. Ticket
"On hold" but there seems to be some progress going on as Durham get
their reinstalled services back up and running. On hold (2/2)
https://ggus.eu/ws/ticket_info.php?ticket=75488 (19/10/2011)
Ancient Compchem ticket. Mike reports that the new CE is up but needs
the VO software reinstalling. On hold (1/2)
https://ggus.eu/ws/ticket_info.php?ticket=90358 (13/1)
Durham's enabling the gridpp voms for the ngs VO ticket. On hold until
the current batch of work is complete. On hold (30/1)
https://ggus.eu/ws/ticket_info.php?ticket=90340 (12/1)
lhcb pilots aborting at Durham. Let's see how the reinstalled services
work for them, we might want to ask the VOs in these tickets directly
how things are going. On hold (1/2)
https://ggus.eu/ws/ticket_info.php?ticket=90393 (14/1)
Helloworld dteam jobs failing at Durham. All that has been written
previously for the Durham tickets probably applies here! On hold (1/2)
LIVERPOOL
https://ggus.eu/ws/ticket_info.php?ticket=90243 (10/1)
The scouser atlas groupdisk migration ticket. John has stated that they
stand ready to move space on atlas' word, which has yet to come. Waiting
for reply (11/1)
LANCASTER
https://ggus.eu/ws/ticket_info.php?ticket=90242 (10/1)
The red-rose version of the atlas groupdisk migration ticket. The
migration seems to have stalled atlas-side. On hold (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=90395 (14/1)
dteam helloworld jobs fail at Lancaster. Tracked down to a CE being
rubbish rather then a configuration error, the offending CE is due for
downtime this week to correct it's poor behaviour. On hold (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=84461 (23/7)
t2k transfer failures to Lancaster. The problem has been greatly
reduced, and the FTS channels have has their number of concurrent
transfers turned down. Waiting to see how this goes. Waiting for reply
(24/1)
https://ggus.eu/ws/ticket_info.php?ticket=85367 (20/8)
Pilot jobs for ilc failing at Lancaster, due to the same performance
issues seen above. Hopefully it'll be no more after the reinstall. On
hold (4/2)
https://ggus.eu/ws/ticket_info.php?ticket=88772 (22/11/2012)
One of Lancaster's clusters is giving out bad GlueCEPolicyMaxCPUTime,
tracked to a bug in the dynamic publishing
(https://ggus.eu/ws/ticket_info.php?ticket=88904). Waiting on a fix,
which I don't think made it out in the last update. On hold (3/12)
RHUL
https://ggus.eu/ws/ticket_info.php?ticket=89751 (17/12/2012)
Path MTU discovery problems to RHUL. The RHUL networking team are
following up with Janet. On hold (28/1)
IMPERIAL
https://ggus.eu/ws/ticket_info.php?ticket=89750 (17/12/2012)
IC's Path MTU discovery ticket. Again the ball is in Janet's court. On
hold (16/1)
BRUNEL
https://ggus.eu/ws/ticket_info.php?ticket=90359 (13/1)
Brunels ticket to enable the GridPP voms server for the ngs VO. Raul had
a go at fixing it but no joy. In progress (21/1)
EFDA-JET
https://ggus.eu/ws/ticket_info.php?ticket=88227 (6/11/2012)
No dynamic publishing at EFDA-JET for biomed. Ideas appear to have been
exhausted. In progress (23/1)
|