Dear Maarten,
I tried to submit the job using an ordinary user account (i.e. adeel) from
UI which is only a member of dteam VO.
Regarding the reconfiguration of the CE, I only upgraded it to the latest
available update of glite-3.1.
Yes I checked the suggestions on the page
http://goc.grid.sinica.edu.tw/gocwiki/Unspecified_gridmanager_error
/var/spool/pbs/mom_logs on the WN don't state anything, so it seems that the
jobs are not actually executing.
I have tested the PBS stagein functionality by running the script attached
under a grid user account by specifying its corresponding queue name as an
argument, I got "test successful" message.
-- Best Regards --
Adeel
-----Original Message-----
From: [log in to unmask] [mailto:[log in to unmask]]
Sent: Saturday, August 25, 2007 9:51 PM
To: Adeel-ur-Rehman
Cc: [log in to unmask]
Subject: Re: [LCG-ROLLOUT] Job Submission Failure
On Sat, 25 Aug 2007, Adeel-ur-Rehman wrote:
> At our site, since I upgraded it to the latest update of gLite 3.1, no
jobs
> are executing rather I am getting job submission failures. Reading the
> details of the error, it states "Got a job held event, reason: Unspecified
> gridmanager error". I can qsub test jobs, but globus-job-run Aborts the
job
Did you try submitting a job as an "sgm" user? Did you reconfigure your CE?
> after Retrying HitCount 3 times.
>
> And there is no offending ssh key problems between our CE and WNs.
Did you check the suggestions on this page:
http://goc.grid.sinica.edu.tw/gocwiki/Unspecified_gridmanager_error
You can test the PBS stagein functionality by running the attached script
under a grid user account on the CE, with the queue name as argument.
Check /var/spool/pbs/mom_logs on the WN.
|