Dear All,
At qmul we have the problem that standing reservation does not seem to
work. They don't appear in showres -n
When running diagnose -r at qmul we get this.
"WARNING: reservation table is corrupt: active procs reserved does not
equal active procs detected (1064 != 1448)"
It seems that several jobs gets running on a node without the node being
reserved in maui, for example:
checkjob 763954
-----
checking job 763954
State: Running
Creds: user:cms012 group:cms class:lcg2_long qos:DEFAULT
WallTime: 00:01:12 of 3:00:00:00
SubmitTime: Tue Feb 6 07:33:20
(Time Queued Total: 7:43:04 Eligible: 7:23:42)
StartTime: Tue Feb 6 15:16:24
Total Tasks: 1
Req[0] TaskCount: 1 Partition: sl3
Network: [NONE] Memory >= 0 Disk >= 0 Swap >= 0
Opsys: [NONE] Arch: [NONE] Features: [sl3]
Dedicated Resources Per Task: PROCS: 1 MEM: 500M
Allocated Nodes:
[cn235:1]
IWD: [NONE] Executable: [NONE]
Bypass: 0 StartCount: 201
PartitionMask: [DEFAULT][sl3][installs]
WARNING: active job has no reservation
PE: 1.00 StartPriority: 443
----------------------------------------------------------
Any idea of what can cause this ? It it like if maui was not in synch
with what is actually happening on the cluster.
Cheers, Olivier.
--
- O. van der Aa - Imperial College London -
- LT2 Technical Coordinator -
- tel: +442075947810, +442071005426 -
- SIP: [log in to unmask] -
- fax: +442078238830 -
- http://surl.se/agtu -
|