I am having problems getting LCG to submit jobs to pbs. The CE and
pbs-server are running on different hosts (WN:s are running the TAR_WN
installation).
When submitting an lcg-job i get the following in the pbs-server log:
09/06/2005 14:22:03;0008;PBS_Server;Job;111464.smokescreen;Job Queued at
request of dteam002@n100, owner = dteam002@n100, job name = STDIN, queue
= gridjobs
09/06/2005 14:22:05;0008;PBS_Server;Job;111464.smokescreen;MOM rejected
modify request, error: 15001
09/06/2005 14:22:05;0080;PBS_Server;Req;req_reject;Reject reply
code=15001, aux=0, type=11, from root@smokescreen
In the mom-logs on the WN I get:
pbs_mom;Req;del_files;cannot stat globus-cache-export.sR2688.gpg
pbs_mom;Req;;Type deletejob request received from
PBS_Server@smokescreen, sock=11
pbs_mom;Req;;Type deletefiles request received from
PBS_Server@smokescreen, sock=10
pbs_mom;Req;;Type modifyjob request received from
PBS_Server@smokescreen, sock=11
pbs_mom;Req;req_reject;Reject reply code=15001, aux=0, type=11, from
PBS_Server@smokescreen
Error 15001 is 'Unknown Job Identifier'.
What might be wrong here?
--
--------------------------------------------------------
Johan Gunnarsson Systems expert
National Supercomputer Centre Linköping university
[log in to unmask] http://www.nsc.liu.se
--------------------------------------------------------
|