Hi Jeremy,
> -----Original Message-----
> From: Testbed Support for GridPP member institutes
> One other thing (already discussed by the deployment team on
> Tuesday), please feel free to mail me your suggestions on how
> to improve the GridView availability interface (shown here
> http://gridview.cern.ch/GRIDVIEW/same_index.php) as I am
> speaking to the developers tomorrow. Already on the list is
> getting rid of the abbreviations and clearer explanations of
> the tests used. It looks likely that figures from this tool
> will be among the first to be used by the WLCG Management
> Board to judge site availability. For your site select
> "Tier-2 site availability" and then select your site from the
> (annoying) list, select the daily report and then the time frame.
> Finally click display graphs. Let me know if you are unable
> to find your site in the Tier-2 list - we already see some
> are missing. Also let me know if the data looks completely
> wrong from your perspective (i.e. do you think the trend is correct?).
Well from my point of view the could improve the catagorizing of the
failures. According to the graphs I have an uptime over the last day of
barely more than 50%, when I check out the failures on the SAM pages,
every single one is a Top level BDII timeout of one sort or another (as
an aside I thought the T1 had put in a second BDDI to help with the
issue, did that happen?). Since on Friday I'll go in and mark all these
failures non-relavant they could retrospectively remove the tests from
their accounting. Of course that would have to be audited, but if they
start to try to hold me accountable on these numbers they won't get a
nice answer!
Chris.
|