Dear Ilja,
Sorry for the confusion. By globus-job-run, I mean to say edg-job-submit!
Yes I'm talking about LCG-CE.
-- Best Regards --
Adeel
-----Original Message-----
From: LHC Computer Grid - Rollout [mailto:[log in to unmask]]
On Behalf Of Ilja Livenson
Sent: Saturday, August 25, 2007 8:29 PM
To: [log in to unmask]
Subject: Re: [LCG-ROLLOUT] Job Submission Failure
Hi,
are you sure you are talking about globus-job-run? It doesn't resubmit
jobs, afaik, hence doesn't fail with the HitCount error.
Ilja
PS. You are talking about LCG CE, not gLite, right?
Adeel-ur-Rehman wrote:
>
> Dear All,
>
> At our site, since I upgraded it to the latest update of gLite 3.1, no
jobs are executing rather I am getting job submission failures. Reading the
details of the error, it states "Got a job held event, reason: Unspecified
gridmanager error". I can qsub test jobs, but globus-job-run Aborts the job
after Retrying HitCount 3 times.
>
> And there is no offending ssh key problems between our CE and WNs.
>
> Any ideas??
>
>
>
> -- Best Regards --
> Adeel-ur-Rehman
>
|