Print

Print


hello,

...Have you checked the available disk free space of the worker node?

Yiannis

On 5/4/07, Daniel Lorenz < [log in to unmask]> wrote:
Hello,

since today 0:00 SFT fail on at least one WN.

After the job start running on the WN, they immediately terminate. The
/pbs/undelivered/<job>..ER file says:

submit-helper script running on host gcn51 gave error: cache_export_dir
(/home/dteam006/.lcgjm/globus- cache-export.I23975) on gatekeeper did not
contain a cache_export_dir.tar archive

logging info says:
Event: Done
- exit_code               =    1
- host                    =     rb127.cern.ch
- level                   =    SYSTEM
- priority                =    asynchronous
- reason                  =    Cannot read JobWrapper output, both from Condor
and from Maradona.
- seqcode                 =
UI=000003:NS=0000000003:WM=000012:BH=0000000000:JSS=000009:LM=000019:LRMS=000000:APP=000000
- source                  =    LogMonitor
- src_instance            =    unique
- status_code             =    FAILED

The CRL was up to date.
I can copy file from the WN to the CE with
globus-url-copy.
The clocks are synchronized.
Users are mapped to the same id.

Has anybody an idea?

Thanks in advance,
Daniel Lorenz