PS I can change this to a different directory previously kept for
xrootd, but not on all the nodes as we do have xrootd as well on 200 of
them and these unfortunately come first in the pbs order of submission.
cheers
alessandra
Alessandra Forti wrote:
> Hi Graeme,
>
> they point both at the same space.
>
> ta
>
> cheers
> alessandra
>
> Graeme Stewart wrote:
>> On Wed, Jul 22, 2009 at 14:05, Duncan
>> Rand<[log in to unmask]> wrote:
>>
>>> Alessandra Forti wrote:
>>>
>>>> Hi Graeme,
>>>>
>>>>
>>>>> ANALY_MANC1: Bad. Build job problems, which seem to be on stage-out
>>>>> after the code compiles, e.g.,
>>>>> http://panda.cern.ch:25980/server/pandamon/query?job=1016502474.
>>>>>
>>>>>
>>>>>
>>>> all data servers links are saturated.
>>>>
>>>>
>>>>> ANALY_MANC2: Fair. Running out of local stage space? "Error details:
>>>>> pilot: Too little space left on local disk to run job: 2050048 kB
>>>>> (need > 2097152 kB)" - maybe need to clean up disks or run less jobs
>>>>> on these nodes? Otherwise ok.
>>>>>
>>>>>
>>>>>
>>>> indeed this is a recurring problem that we will solve when we
>>>> upgrade to
>>>> SL5. We have only two cpus per node but only 20GB in scratch. It
>>>> used to be
>>>> enough but these analisys seem to copy far more than 10GB per job.
>>>>
>>> We now get torque to create a per-job temporary directory which gets
>>> cleared
>>> up at the end of the job:
>>>
>>> http://www.clusterresources.com/torquedocs21/users/2.2files.shtml#tmpdir
>>>
>>>
>>
>> For historical reasons (remember the EDG RB?) the pre-pilot wrapper
>> script looks first for EDG_WL_SCRATCH, then TMPDIR:
>>
>> if [ -n "$EDG_WL_SCRATCH" ]; then
>> cd $EDG_WL_SCRATCH
>> elif [ -n "$TMPDIR" ]; then
>> cd $TMPDIR
>> fi
>>
>> I think this is now an anachronism and I will reverse the order, but
>> be kind to me if in the next few hours the pilot wrapper does the
>> wrong thing...
>>
>> Graeme
>>
>>
>
--
No man ever steps in the same river twice, for it's not the same river and he's not the same man. (Heraclitus)
Northgrid Tier2 Technical Coordinator
http://www.hep.manchester.ac.uk/computing/tier2
|