Hi all,
I've been testing our torque installation. However, submitted jobs are kept in the 'Q' state for too much time. Meanwhile, ALL, working nodes report to be on the "free" state.
Does somebody knows how to solve it ?
Here are some output logs;
*************************************************************************************
[root@ce root]# qstat -nsR
ce.prd.hp.com:
Req'd Req'd Elap
Job ID Username Queue NDS TSK Memory Time S Time BIG FAST PFS
--------------- -------- -------- --- --- ------ ----- - ----- ----- ----- -----
8.ce.prd.hp.com dteam001 dteam 1 -- -- 48:00 Q -- -- -- --
bl-wn9
--
9.ce.prd.hp.com lhcb001 lhcb 1 -- -- 48:00 Q -- -- -- --
bl-wn9
--
10.ce.prd.hp.co dteam001 dteam 1 -- -- 48:00 Q -- -- -- --
bl-wn9
--
11.ce.prd.hp.co dteam001 dteam 1 -- -- 48:00 Q -- -- -- --
bl-wn9
--
12.ce.prd.hp.co dteam001 dteam 1 -- -- 48:00 Q -- -- -- --
bl-wn9
--
*********************************************************************************
[root@ce root]# pbsnodes -a | head -n 15
bh-wn0.prd.hp.com
state = free
np = 3
properties = lcgpro
ntype = cluster
status = arch=linux,uname=Linux bh-wn0.prd.hp.com 2.4.21-20.EL.cern #1 TueSep 28 18:42:19 CEST 2004 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=14837558,totmem=1557528kb,availmem=544136kb,physmem=1027392kb,ncpus=1,loadave=0.00,rectime=1109003619
bh-wn1.prd.hp.com
state = free
np = 3
properties = lcgpro
ntype = cluster
status = arch=linux,uname=Linux bh-wn1.prd.hp.com 2.4.21-20.EL.cern #1 TueSep 28 18:42:19 CEST 2004 i686,sessions=? 0,nsessions=? 0,nusers=0,idletime=408004,totmem=1557528kb,availmem=1236400kb,physmem=1027392kb,ncpus=1,loadave=0.00,rectime=1109003628
********************************************************************************
Any help appreciated.
Thanks,
./MS
|