Print

Print


Hi -

Birmingham is seeing a few of these messages:

Adder._updateOutputs() XML is inconsistent with filesTable

But I don't know what they mean or if I should be worried.  I notice that 
other sites have this message too... Any clues as to what the problem is?

Cheers,

Chris



On Tue, 4 Aug 2009, Graeme Stewart wrote:

> We have done ~20 hammercloud tests of various flavours in the UK in
> the last 6 months, plus all the STEP tests. We had one 6 hour problem
> with the pilot code two weeks ago. It was a bug and it was fixed. In
> general panda infrastructure availability is probably > 99%, so we
> were even a little unlucky to catch this. Please, we are not
> "debug[ing] ATLAS pilot job framework".
>
> In contrast, even better sites are having a hard time reaching 95% job
> success rate, so from the site side we have a way to go yet.
>
> My feeling is that the tests are useful for the sites - they are
> primarily for the sites to prepare themselves for data. They do not
> require people to be on "full alert" or to work out of normal hours.
> They are to make analysis support routine, actually. Sites will
> eventually reach a stage where they fully understand their
> infrastructure, then you probably do not learn so much from the tests
> but continuing to participate should not be a problem - in fact it
> keeps re-validating the site, which is reassuring.
>
> I haven't heard that people are finding the tests unhelpful or
> distracting, but please say if you think so.
>
> Cheers
>
> Graeme
>
> On Tue, Aug 4, 2009 at 11:26, Coles, J (Jeremy)<[log in to unmask]> wrote:
>> Hi John
>>
>> We'll review the situation at today's deployment team meeting. It is a
>> pity that the Panda server has not coped well in keeping things stable
>> for the testing period, but there are periods of stability. The obvious
>> danger is that many of the enthusiastic sysadmins out there lose
>> interest if the ATLAS submission problems dominate.
>>
>> Any of the sysadmins out there reading this can comment if they feel
>> their time is being wasted and the approach needs to change. Suggestions
>> are always welcomed. Let's also see how things go today/tomorrow.
>>
>> Regards,
>> Jeremy
>>
>>
>>
>> -----Original Message-----
>> From: Testbed Support for GridPP member institutes
>> [mailto:[log in to unmask]] On Behalf Of Gordon, JC (John)
>> Sent: 04 August 2009 10:07
>> To: [log in to unmask]
>> Subject: Re: HammerCloud - today at 10am onwards.
>>
>> Just a thought. Always happy to be contradicted by those more engaged.
>> From the last test, purely anecdotally, many of the issues raised on the
>> list seemed to be due to the pilot factories etc. This seemed to be a
>> waste of the alertness of the sysadmins.
>>
>> John
>>
>>> -----Original Message-----
>>> From: Testbed Support for GridPP member institutes [mailto:TB-
>>> [log in to unmask]] On Behalf Of Coles, J (Jeremy)
>>> Sent: 04 August 2009 09:44
>>> To: [log in to unmask]
>>> Subject: Re: HammerCloud - today at 10am onwards.
>>>
>>> Hi John
>>>
>>> Others will probably reply but my understanding was that the
>>> HammerCloud
>>> jobs are just part of the testing infrastructure. The actual testing
>> is
>>> of the sites and their bottlenecks not the pilot job framework.
>>> Therefore I think it is good if as many sites as possible remain
>>> involved - it is more about finding capacity/throughput limits than
>>> testing if a site is satisfactory and the subtle changes in the job
>> mix
>>> may bring out new issues each time.
>>>
>>> Jeremy
>>>
>>> -----Original Message-----
>>> From: Testbed Support for GridPP member institutes
>>> [mailto:[log in to unmask]] On Behalf Of Gordon, JC (John)
>>> Sent: 04 August 2009 09:32
>>> To: [log in to unmask]
>>> Subject: Re: HammerCloud - today at 10am onwards.
>>>
>>> Might it be an idea for sites which were satisfactory last time to
>>> ignore it and then look back and see what happened? You're not going
>> to
>>> be on full alert at all hours during data taking and it shouldn't need
>>> the whole UK to debug ATLAS pilot job framework.
>>>
>>> Just a thought,
>>>
>>> John
>>>
>>>> -----Original Message-----
>>>> From: Testbed Support for GridPP member institutes [mailto:TB-
>>>> [log in to unmask]] On Behalf Of Sam Skipsey
>>>> Sent: 04 August 2009 09:15
>>>> To: [log in to unmask]
>>>> Subject: HammerCloud - today at 10am onwards.
>>>>
>>>> Hi all,
>>>>
>>>> Just to remind you that we have another HammerCloud at 10am.
>>>>
>>>> Business as usual. (Indeed, we can assume that these will be fairly
>>>> regular for the next couple of weeks, I guess.)
>>>>
>>>> Sam
>>> --
>>> Scanned by iCritical.
>>> --
>>> Scanned by iCritical.
>> --
>> Scanned by iCritical.
>> --
>> Scanned by iCritical.
>>
>
>
>
>

-- 
West 326
Physics and Astronomy
University of Birmingham
Edgbaston
Birmingham
B15 2TT

(Office) 0121 414 4700
(Mobile) 0798 666 1959