Well, in case of LCG-CE I think it's best to try running job with
globus-job-run. Perhaps you could post output of running it?
atb,
Ilja
Adeel-ur-Rehman wrote:
> Dear Ilja,
>
> Sorry for the confusion. By globus-job-run, I mean to say edg-job-submit!
> Yes I'm talking about LCG-CE.
>
> -- Best Regards --
> Adeel
>
> -----Original Message-----
> From: LHC Computer Grid - Rollout [mailto:[log in to unmask]]
> On Behalf Of Ilja Livenson
> Sent: Saturday, August 25, 2007 8:29 PM
> To: [log in to unmask]
> Subject: Re: [LCG-ROLLOUT] Job Submission Failure
>
> Hi,
>
> are you sure you are talking about globus-job-run? It doesn't resubmit
> jobs, afaik, hence doesn't fail with the HitCount error.
>
> Ilja
>
> PS. You are talking about LCG CE, not gLite, right?
>
> Adeel-ur-Rehman wrote:
>
>> Dear All,
>>
>> At our site, since I upgraded it to the latest update of gLite 3.1, no
>>
> jobs are executing rather I am getting job submission failures. Reading the
> details of the error, it states "Got a job held event, reason: Unspecified
> gridmanager error". I can qsub test jobs, but globus-job-run Aborts the job
> after Retrying HitCount 3 times.
>
>>
>> And there is no offending ssh key problems between our CE and WNs.
>>
>> Any ideas??
>>
>>
>>
>> -- Best Regards --
>> Adeel-ur-Rehman
>>
>>
|